惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

S
Securelist
O
OpenAI News
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
T
Threat Research - Cisco Blogs
D
Darknet – Hacking Tools, Hacker News & Cyber Security
Google Online Security Blog
Google Online Security Blog
C
CXSECURITY Database RSS Feed - CXSecurity.com
N
News and Events Feed by Topic
S
Security Affairs
SecWiki News
SecWiki News
Project Zero
Project Zero
L
Lohrmann on Cybersecurity
P
Proofpoint News Feed
P
Palo Alto Networks Blog
L
LINUX DO - 最新话题
H
Hacker News: Front Page
Recent Commits to openclaw:main
Recent Commits to openclaw:main
I
Intezer
Simon Willison's Weblog
Simon Willison's Weblog
W
WeLiveSecurity
T
The Exploit Database - CXSecurity.com
K
Kaspersky official blog
The GitHub Blog
The GitHub Blog
I
InfoQ
云风的 BLOG
云风的 BLOG
雷峰网
雷峰网
B
Blog
IT之家
IT之家
AWS News Blog
AWS News Blog
Jina AI
Jina AI
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
Google DeepMind News
Google DeepMind News
Spread Privacy
Spread Privacy
N
News and Events Feed by Topic
Security Latest
Security Latest
美团技术团队
C
Check Point Blog
WordPress大学
WordPress大学
T
Tenable Blog
S
Security @ Cisco Blogs
Last Week in AI
Last Week in AI
博客园 - 聂微东
月光博客
月光博客
博客园 - 【当耐特】
S
Schneier on Security
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
S
Secure Thoughts
Schneier on Security
Schneier on Security
C
Cisco Blogs
Cyberwarzone
Cyberwarzone

Hacker News: Show HN

PurrrrrFocus: Pomodoro Timer App - App Store Workflow Engine — Multi-Step Orchestration for Bun RapidPhoto: Pro Photo Editor App - App Store GitHub - amitb-quantum/roboapi: The unified API layer for robotics. Connect any robot, any brand, with one SDK. Like Stripe, but for robots. GitHub - manankharwar/fusioncore: ROS 2 sensor fusion SDK: UKF, 3D native, proper GNSS, zero manual tuning. Apache 2.0. Show HN: Wayland Wlroot Hjkl Everywhere Show HN: Codex context bloat? 87% avg reduction on SWE-bench Verified traces GitHub - NoahCristino/llmcat: A simple CLI that transforms your code into clean, structured text for feeding into LLMs. GitHub - actuallyepic/background-computer-use GitHub - h4ckf0r0day/obscura: The headless browser for AI agents and web scraping GitHub - anthonybudd/Express-ts-API-Template: Production-ready minimal REST API boilerplate using Express.js, Sequelize and MySQL. GitHub - molefrog/lilmd: Agent-friendly CLI for reading large Markdown files Show HN: I built a simple site to reduce tool overload and improve focus GitHub - JonathanRosado/claude-anyteam: Native Claude Code teammates, any LLM. Codex today. Gemini, Kimi, GLM, DeepSeek next. GitHub - abi/lilo Show HN: ffmpeg-render-pro – Parallel video rendering with live dashboard GitHub - adam-s/HNswered: Notifies you when someone replies to your Hacker News posts and comments. GitHub - arian-gogani/nobulex: The accountability primitive for AI agents. Cryptographic behavioral commitments with trustless verification. GitHub - Developing-Gamer/roids: Steroids for your AI agents GitHub - nellavio/nellavio: 🚀 Next.js dashboard starter with auth, i18n, 18 pages, 60+ charts and 90+ UI components Spotify – Web Player GitHub - tillahoffmann/cctape: Claude proxy to record all interactions in a local database, allowing you to browse and search sessions, track usage, and let Claude search its own history over MCP. Show HN: Rook, a macOS notes app for developers GitHub - donchuru/mr-links: Chrome extension that shows referenced links above comments on Marginal Revolution assorted links posts GitHub - anubhavgupta/whisper-npu: Speech to text at cursor using NPU. (shortcut -> win+/) SHOW HN: I built a marketplace agency that treats profit as the actual metric GitHub - trycua/cua: Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows). GitHub - montanaflynn/headless-terminal: Headless terminal — puppeteer for TUIs (vim/emacs/htop/nethack) with a Go CLI backed by libghostty-vt The Genesis of TurboPentest.com: Bridging the Gap in an AI-Code Explosion Era GitHub - VincenzoManto/Doxa: A YAML-driven multi-agent simulation platform for economic and social systems. It combines LLM-backed agents, market microstructure, relation graphs, and world events behind a FastAPI API and a React client. GitHub - IdreesInc/PokeMesh: Collaborative Pokémon over a decentralized network, no internet required! GitHub - tamnd/python-one: Historical Python source tarballs (1.0.1 through 2.0c1) mirrored from legacy.python.org, extracted and committed at their original release dates. GitHub - dixalex/decision-linter: Decision Linter — like ESLint for your thinking. Claude Code plugin that scores assumptions, runs consider-the-opposite, and outputs judgment memos. GitHub - pitermarx/Virgulas: An Workflowy Inspired Outliner Show HN: A standalone YouTube Music popup for OBS (no plugins required) GitHub - polotno-project/render-tag: Render HTML string into canvas using 2d native API libgoc/bench/README.md at 81798b3dd5e27d7f1e752b6cff0bdf3a95d381b7 · libgoc/libgoc Show HN: Llm.sql – Run a 640MB LLM on SQLite, with 210MB peak RSS and 7.4 tok/s GitHub - RivoLink/leaf: Terminal Markdown previewer — GUI-like experience. [China AI News] Eight Chinese chip families ran DeepSeek V4 on launch day GitHub - michaelaz774/decision-engine: A decision operating system for startup founders, powered by Claude Code. Synthesizes wisdom from 25+ legendary founders and investors into interactive AI-driven decision frameworks. Sheetflow App - Webflow Apps & Integrations Questly — The Third Rail GitHub - crufter/safer: Sleep better while AI agents have shell access. gitrails/README.md at main · maxawzsinger/gitrails Show HN: RustNmap | Hacker News GitHub - 0x0funky/agent-sprite-forge: Agent Skill for generating 2D sprite sheets, transparent PNG frames, and animated GIFs from prompts. Submissions from github.com/darshanfofadiya | Hacker News GitHub - seb3773/ntfs-repair-rfc: A legally defensible, industry-grade blueprint for building an open-source NTFS structural repair engine from scratch. TOSTask - Chrome 应用商店 GitHub - Fergana-Labs/stash: Shared memory for your team's coding agents GitHub - AdirAmsalem/easl: Instant hosting for AI agents — turn output into pages worth sharing Show HN: Pdfnative – zero-dependency TypeScript PDF engine GitHub - EricNelson12/retrocycles-hilbert GitHub - ONSARI/payclaw-skill Earful — A voice-only social network GitHub - Chrilleweb/dotenv-diff: Validate environment variable usage in your codebase llms-wordpress-plugin-benchmark/README.md at main · guilamu/llms-wordpress-plugin-benchmark GitHub - agentdmai/teamfuse: Fuse Claude Code agents into a working team. AgentDM-powered template with a cabinet-style control panel, five starter roles Thedex — AI-Native Log Intelligence Introducing Universal Deploy (+server) | Vike GitHub - thomas-vilte/mls-go: MLS Protocol (RFC 9420) implementation in Go. Secure group key exchange with forward secrecy and post-compromise security for E2EE messaging. GitHub - tamnd/python-0.9.1: Python 0.9.1 from 1991, Guido van Rossum's first public release, patched to compile on modern systems GitHub - orchidfiles/ungate: Use your Claude and ChatGPT subscriptions in Cursor instead of paying for API tokens. Show HN: Rusty Browser – AI rust service spinning up AI browsers Show HN: Seleci – Pre-built AI agents that keeps your business running GitHub - SkardiLabs/skardi: Spark for Agents — a data platform that gives AI agents full data autonomy so every dataset in your stack becomes something an agent can actually use. sss/Design.md at share-hn · ekipan/sss GitHub - TwillAI/agentbox-sdk: The open-source TypeScript SDK for running AI coding agents in sandboxes. One unified API — swap agents and infrastructure providers without changing your code. GitHub - cheprasov/ts-jsbt: JavaScript Binary Transfer (JSBT) – a binary serialization format designed for JavaScript → JavaScript communication. GitHub - AI-Colleagues/skill-mgr: Agent Skill manager. Install a skill for all supported or specified agents When Your Repo Moves, Your AI Coding History Doesn’t Buffer zoom GitHub - janaraj/tnl: Structured English contracts for AI coding agents — proposed by the agent, approved by you, saved on disk, read by every future session. Show HN: Slopify – An AI agent skill to slopify a codebase GitHub - tinyhumansai/openhuman: Your Personal AI super intelligence. Private, Simple and extremely powerful. Show HN: Reducing a 66-node dependency cycle to 13 in Scrapy GitHub - NV404/gova GitHub - latitude-dev/eval-skills: LLM eval skills for developers. Free tools to find failure patterns, build evals, and improve AI quality in production GitHub - BadC-mpany/lilith-zero: ⚸ Lilith Zero - Security Middleware for MCP tool calls written in Rust. GitHub - pumpkin-bit/Flux3n1: Music generation using the Collatz conjecture formula GitHub - Lumen-Labs/brainapi2: BrainAPI is a knowledge graph–powered AI memory layer that transforms unstructured data into structured knowledge, enabling intelligent search, recommendations, and contextual memory for AI agents and applications. An agent-native static host for AI-generated sites · VibeDrop GitHub - k38f/envsleuth: 🕵️ Detective for env vars in Python code. Finds os.getenv/os.environ usages via AST and checks them against your .env file. whodb/cli at main · clidey/whodb GitHub - Higangssh/winclipshot: Windows clipboard-to-path helper for terminal CLIs like Claude Code. Screenshot with Win+Shift+S, paste the saved path with Ctrl+V GitHub - decisionbox-io/decisionbox-platform: DecisionBox connects to your data warehouse, runs autonomous AI agents that write and execute SQL, and surfaces validated insights and actionable recommendations — without you asking a single question. GitHub - scastiel/kado: Kadō — A privacy-first habit tracker for iPhone and iPad. GitHub - franzenzenhofer/tinyscreenshot: Token-frugal screenshots for AI agents. A default capture costs ~540 tokens instead of ~2100. GitHub - russellromney/honker: SQLite extension + bindings for Postgres NOTIFY/LISTEN semantics with durable queues, streams, pub/sub, and scheduler Manex Hub App - App Store Release 0.5 · greymattergames/unbug GitHub - mljar/features_goldmine: Features Engineering Made Easy GitHub - chojs23/lazyagent: TUI for watching all your AI coding agents(claude, codex and opencode) Running Is The Hardest Endurance Sport? Not so Fast Show HN: Canopy – A2UI experiment in Go for macOS/AppKit Show HN: Share browser recordings on Cloudflare Pages from the command line GitHub - aiptimizer/TurboOCR: Fast GPU OCR server. 270 img/s on FUNSD. TensorRT FP16, PP-OCRv5, HTTP + gRPC. Show HN: We built a way for Claude Code to join meetings like a real teammate Show HN: Razorpay-universal – A framework-agnostic Razorpay SDK
GitHub - mmccalla/model-due-diligence: model-due-diligence is not a model safety verifier. It is a static evidence-gathering control for AI model supply-chain review. It supports provenance, artefact integrity, unsafe serialisation detection, secret exposure checks, suspicious code review, dependency risk detection, and audit reporting before first model execution.
djhope99 · 2026-06-13 · via Hacker News: Show HN

model-due-diligence static supply-chain security infographic

model-due-diligence is a Python command-line tool for performing static supply-chain due diligence on local AI model files and cloned model repositories before they are imported into runtimes such as Ollama, llama.cpp, LM Studio or Transformers.

It is designed to help answer one practical question:

“Is there obvious static evidence that this model artefact or repository should not be trusted, loaded or run without further review?”

It reduces practical risk from unsafe serialisation, suspicious repository content, weak provenance, exposed secrets, unexpected binaries, unsafe dependency files and malformed model metadata.

It does not prove that a model is safe.

A clean report means only that this tool did not identify the specific static artefact risks it is designed to detect. It must not be treated as proof that model weights, repository content, runtime behaviour or downstream use are benign.


Contents

  • What the tool does
  • What the tool does not do
  • Architecture
  • Scanner coverage
  • Risk scoring
  • Install
  • Quick start
  • CLI reference
  • Example workflows
  • Reports and outputs
  • Recommended operating model
  • Development workflow
  • Testing and quality gates
  • Repository structure
  • Security posture
  • Standards alignment
  • Limitations
  • Roadmap
  • Contributing
  • Licence

What the tool does

model-due-diligence statically inspects a local path and generates reviewable evidence.

It checks:

  • file inventory, SHA-256 hashes, permissions and symlinks;
  • high-risk serialisation formats such as pickle, .pt, .pth, .bin, .joblib and H5;
  • lower-risk model formats such as .gguf, .safetensors and .onnx;
  • GGUF magic bytes and version metadata;
  • safetensors header metadata;
  • suspicious text and binary strings;
  • Python AST indicators such as eval, exec, compile, pickle.loads, os.system and subprocess;
  • trust_remote_code=True usage in Python and text files;
  • risky pickle-like byte markers in high-risk serialisation formats;
  • high-entropy non-model files;
  • Git provenance, origin remote, current commit, dirty worktree and Git LFS listing where available;
  • external scanner output from ModelScan, Semgrep, Bandit, pip-audit and detect-secrets;
  • optional quality self-checks using Ruff, Pyright and mypy.

The tool produces:

  • a human-readable Markdown report;
  • a deterministic JSON report for automation;
  • an optional SARIF report for code-scanning workflows;
  • raw external scanner outputs where external tools are run.

What the tool does not do

The tool is intentionally static. During normal scanning it does not:

  • load model weights;
  • import untrusted repository code;
  • execute model-specific scripts;
  • run model inference;
  • send artefacts to external services;
  • require network access for local scanning;
  • decide automatically that a model is safe.

Static scanning cannot reliably detect:

  • malicious behaviour encoded directly into model weights;
  • sleeper-agent or trigger-based backdoors;
  • training-data poisoning;
  • benchmark-specific manipulation;
  • malicious behaviour that appears only after fine-tuning;
  • malicious behaviour that appears only after tools are connected;
  • prompt-injection obedience in downstream RAG or agent workflows;
  • data exfiltration behaviour that only appears at runtime;
  • vulnerabilities in local model runtimes;
  • all unsafe deserialisation evasions.

Use it as a risk-reduction gate, not as a trust oracle.


Architecture

The project uses a modular monolith architecture. This keeps installation and local execution simple while maintaining clear internal boundaries between CLI, orchestration, scanners, risk scoring and reports.

flowchart LR
    user[User / CI] --> cli[CLI]
    cli --> app[Application Orchestrator]
    app --> inventory[File Inventory]
    app --> native[Native Static Scanners]
    app --> external[External Scanner Adapters]
    app --> risk[Risk Scorer]
    risk --> report_model[Audit Report Model]
    app --> report_model
    report_model --> markdown[Markdown Report]
    report_model --> json[JSON Report]
    report_model --> sarif[SARIF Report]

    native --> text[Text Patterns]
    native --> ast[Python AST]
    native --> binary[Binary Strings]
    native --> entropy[Entropy]
    native --> metadata[Model Metadata]
    native --> pickle[Pickle Heuristics]
    native --> git[Git Provenance]

    external --> modelscan[ModelScan]
    external --> semgrep[Semgrep]
    external --> bandit[Bandit]
    external --> pipaudit[pip-audit]
    external --> secrets[detect-secrets]
    external --> quality[Quality Self-Checks]
Loading

Runtime flow

sequenceDiagram
    participant U as User / CI
    participant C as CLI
    participant A as App
    participant I as Inventory
    participant N as Native Scanners
    participant E as External Scanners
    participant R as Risk Scorer
    participant W as Report Writers

    U->>C: mdd <target> --out <dir>
    C->>C: Parse arguments and build ScanContext
    C->>A: Run scan
    A->>I: Build file inventory and hashes
    I-->>A: FileRecord[] + Finding[]
    A->>N: Run static native scanners
    N-->>A: Finding[] + ModelMetadata[]
    A->>E: Run optional external scanners
    E-->>A: CommandResult[] + Finding[]
    A->>R: Score findings and tool outcomes
    R-->>A: Risk score + risk level
    A-->>C: AuditReport
    C->>W: Write Markdown / JSON / SARIF
    C-->>U: Print risk score, risk level and report paths
Loading

Internal dependency direction

Dependencies should flow in one direction:

cli -> app -> domain
app -> inventory
app -> scanners
app -> external
app -> reporting
scanners -> domain/config/utils
external -> domain/config/command_runner
reporting -> domain/config

Rules:

  • scanners must not import app;
  • reporters must not run scanners;
  • external adapters must not write final project reports directly;
  • domain models must not depend on filesystem, subprocess or reporting modules;
  • native scanners must not execute model artefacts or repository code.

Scanner coverage

Coverage area Native support External support Status
File inventory, hashes and permissions Yes No Covered
Symlink detection Yes No Covered
Executable/script detection Yes Semgrep / Bandit Covered
High-risk serialisation detection Yes ModelScan Covered
Pickle heuristic indicators Yes ModelScan Covered
GGUF header inspection Yes No Basic coverage
Safetensors header inspection Yes No Basic coverage
Suspicious text/code patterns Yes Semgrep / Bandit Covered
Python AST dangerous-call detection Yes Bandit / CodeQL Covered
Binary string indicators Yes No Basic coverage
High-entropy anomaly detection Yes No Basic coverage
Secrets detection Yes detect-secrets Covered
Dependency vulnerability checks No pip-audit / Dependabot Covered for requirements.txt
Git provenance checks Yes No Basic coverage
Project code quality No Ruff / Pyright / mypy / pytest Covered
Repository semantic security analysis No CodeQL Covered in GitHub Actions
SARIF output Yes CodeQL native SARIF Partial
SBOM generation No No Planned
Sigstore / SLSA provenance No No Planned
Licence compatibility checks No No Planned
Model-card quality checks No No Planned
Weight-level backdoor detection No No Not reliably detectable
Runtime behavioural testing No No Planned separately

Risk scoring

Findings are normalised into severities and converted into a bounded score from 0 to 100.

Severity Current score contribution
INFO 0
LOW 3
MEDIUM 10
HIGH 30
CRITICAL 60

External scanner non-zero exits can also contribute to the score when the tool was available and produced reviewable signals.

Risk level Score range Meaning Recommended action
LOW 0-29 No obvious supported static artefact risks were found Acceptable for sandboxed first run
MEDIUM 30-69 Reviewable findings exist Do not import until findings are understood
HIGH 70-89 Material risk indicators exist Do not load unless every finding is justified
CRITICAL 90-100 Severe or multiple high-risk indicators exist Treat as unsafe by default

The score is intentionally conservative. It is a decision aid, not an automated trust verdict.


Install

Prerequisites

  • Python 3.11 or later;
  • Git;
  • a Unix-like shell for the provided scripts;
  • optional external scanner CLIs if you want full coverage.

Recommended local setup

python3 -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install -e ".[dev,scanners]"

Or use the setup script:

For a lighter install without optional scanner integrations:

./scripts/dev-setup.sh --no-scanners

Verify installation

mdd --help
mdd-ollama --help
model-due-diligence --help
python -m model_due_diligence --help

Quick start

Scan a cloned model repository:

mdd ./downloaded-model --out ./audit

Scan a local GGUF file:

mdd ~/models/qwen.gguf --out ./audit-qwen

Scan an installed Ollama model by name:

mdd-ollama qwen3:4b --out ./audit-qwen3-ollama

Fail the command when the risk level is medium or above:

mdd ./downloaded-model --out ./audit --fail-on medium

Run a fast smoke scan without optional external tools:

mdd tests/fixtures/safe_repo \
  --out ./audit-smoke \
  --fail-on critical \
  --skip-external

Generate only JSON output:

mdd ./downloaded-model \
  --out ./audit-json \
  --format json

CLI reference

usage: model-due-diligence [-h] [--out OUT] [--timeout TIMEOUT]
                           [--format FORMATS] [--skip-external]
                           [--skip-modelscan] [--skip-semgrep]
                           [--skip-bandit] [--skip-pip-audit]
                           [--skip-detect-secrets]
                           [--skip-quality-self-check]
                           [--quality-self-check]
                           [--fail-on {low,medium,high,critical}]
                           [--version]
                           target
Argument Description
target Path to a model file or model directory
--out Output report directory
--timeout Per-tool timeout in seconds
--format Comma-separated report formats: markdown,json,sarif
--skip-external Skip all optional external scanner tools
--skip-modelscan Skip ModelScan only
--skip-semgrep Skip Semgrep only
--skip-bandit Skip Bandit only
--skip-pip-audit Skip pip-audit only
--skip-detect-secrets Skip detect-secrets only
--quality-self-check Run Ruff, Pyright and mypy against this project as optional self-checks
--skip-quality-self-check Skip quality self-checks
--fail-on Return non-zero when risk is at or above the selected level
--version Print package version

mdd-ollama

mdd-ollama resolves an installed Ollama model from the local OLLAMA_MODELS store, stages scan-friendly filenames in a temporary directory, and then runs the normal static due-diligence flow on that staged directory.

It does not require the Ollama server to be running as long as the local manifest and blob store is present.

usage: mdd-ollama [-h] [--ollama-models-dir OLLAMA_MODELS_DIR] [--out OUT]
                  [--timeout TIMEOUT] [--format FORMATS] [--skip-external]
                  [--skip-modelscan] [--skip-semgrep] [--skip-bandit]
                  [--skip-pip-audit] [--skip-detect-secrets]
                  [--skip-quality-self-check] [--quality-self-check]
                  [--fail-on {low,medium,high,critical}] [--keep-staged]
                  model

Typical usage:

mdd-ollama llama3:8b --out ./audit-llama3

For an uninstalled checkout, run it with:

PYTHONPATH=src python3 -m model_due_diligence.ollama_cli llama3:8b --out ./audit-llama3

Example workflows

Audit a Hugging Face clone

Use the helper script:

./examples/audit-huggingface-clone.sh \
  https://huggingface.co/Qwen/Qwen3-8B-GGUF \
  ./audit-qwen3

The script clones into a temporary directory, runs the scanner, writes reports to the output directory and removes the temporary clone afterwards.

Audit a local GGUF file

./examples/audit-local-gguf.sh \
  ~/models/qwen3-8b-q4_k_m.gguf \
  ./audit-qwen3-gguf

Audit an installed Ollama model

./examples/audit-installed-ollama.sh \
  qwen3:4b \
  ./audit-qwen3-ollama

Use in CI

A conservative CI smoke gate can run without optional external scanners:

mdd tests/fixtures/safe_repo \
  --out ./audit-smoke \
  --fail-on critical \
  --skip-external

A fuller CI gate can install scanner extras and run:

mdd ./downloaded-model \
  --out ./audit \
  --fail-on high

Reports and outputs

A normal run can produce:

audit/
├── model_due_diligence_report.md
├── model_due_diligence_report.json
├── model_due_diligence_report.sarif
├── modelscan.json
├── semgrep.json
├── bandit.json
├── pip-audit-<hash>.json
└── detect-secrets.json
File Purpose
model_due_diligence_report.md Human-readable review report
model_due_diligence_report.json Machine-readable deterministic report
model_due_diligence_report.sarif Static-analysis output suitable for code-scanning workflows
modelscan.json Raw ModelScan output
semgrep.json Raw Semgrep output
bandit.json Raw Bandit output
pip-audit-<hash>.json Raw pip-audit output per requirements file
detect-secrets.json Raw detect-secrets output

Generated audit outputs may contain local paths, hashes, snippets and scanner evidence. Do not commit them unless you have reviewed them for sensitive content.


Recommended operating model

Use model-due-diligence as one control in a broader model supply-chain process:

Official or reputable source
+ pinned commit or hash
+ static due-diligence scan
+ first run in a no-network sandbox
+ no credentials mounted
+ restricted filesystem access
+ adversarial behavioural test suite
+ runtime monitoring
+ human review
= reasonable practical risk reduction

Recommended practice:

  1. Prefer official publisher repositories or reputable quantisers.
  2. Avoid floating tags such as latest for operational use.
  3. Pin exact Git revisions and record SHA-256 hashes.
  4. Run model-due-diligence before importing or loading artefacts.
  5. Review all HIGH and CRITICAL findings manually.
  6. Run first inference in a network-disabled container or VM.
  7. Do not mount API keys, SSH keys, cloud credentials or client data.
  8. Test prompt-injection and tool-use behaviour before RAG or agent deployment.
  9. Keep reports and accepted hashes for reproducibility.

Development workflow

Set up the environment:

./scripts/dev-setup.sh
source .venv/bin/activate

Run quality gates:

Run tests:

Build the package:

./scripts/build-package.sh

Build without running local checks first:

./scripts/build-package.sh --skip-checks

Testing and quality gates

The expected local quality gates are:

ruff format --check src tests
ruff check src tests
pyright
mypy src tests
pytest
mdd tests/fixtures/safe_repo --out ./audit-smoke --fail-on critical --skip-external

The helper script runs the same pattern:

Use fix mode for Ruff formatting and safe lint fixes:

./scripts/run-quality.sh --fix

Run unit tests only:

./scripts/run-tests.sh --unit

Run integration tests only:

./scripts/run-tests.sh --integration

Run with coverage:

./scripts/run-tests.sh --coverage

Repository structure

model-due-diligence/
├── .github/
│   ├── workflows/
│   │   ├── ci.yml
│   │   ├── codeql.yml
│   │   └── release.yml
│   ├── dependabot.yml
│   └── pull_request_template.md
├── docs/
│   ├── architecture.md
│   ├── contribution-guide.md
│   ├── limitations.md
│   ├── scanner-coverage.md
│   ├── standards-alignment.md
│   └── threat-model.md
├── examples/
│   ├── audit-installed-ollama.sh
│   ├── audit-huggingface-clone.sh
│   ├── audit-local-gguf.sh
│   └── sample-report.md
├── scripts/
│   ├── build-package.sh
│   ├── dev-setup.sh
│   ├── run-quality.sh
│   └── run-tests.sh
├── src/model_due_diligence/
│   ├── cli.py
│   ├── app.py
│   ├── config/
│   ├── domain/
│   ├── external/
│   ├── inventory/
│   ├── ollama.py
│   ├── ollama_cli.py
│   ├── reporting/
│   ├── scanners/
│   └── utils.py
├── tests/
│   ├── fixtures/
│   ├── integration/
│   └── unit/
├── .env.example
├── .gitattributes
├── .gitignore
├── .python-version
├── LICENSE
├── pyproject.toml
└── README.md

Security posture

The project follows these design rules:

  • static by default;
  • no model execution during scanning;
  • no untrusted repository code execution during scanning;
  • no shell invocation for external scanner commands;
  • external tool failures are visible in reports;
  • findings include severity, category, file, message, evidence where available and recommendation;
  • missing scanners are reported rather than silently ignored;
  • generated reports are ignored by Git by default;
  • real model artefacts are ignored by Git by default;
  • dependency updates are managed through Dependabot;
  • CodeQL runs through GitHub Actions;
  • releases build source and wheel distributions and validate metadata before publishing.

Standards alignment

An explicit control mapping for relevant NIST, MITRE, and OWASP guidance is in docs/standards-alignment.md.


Limitations

A clean report does not mean a model is safe.

Static checks cannot reliably detect:

  • subtle weight-level backdoors;
  • sleeper-agent behaviour;
  • poisoned training data;
  • malicious behaviour activated by rare prompts;
  • malicious behaviour activated only through tool use;
  • all deserialisation evasions;
  • all obfuscated payloads;
  • prompt-injection obedience in downstream RAG or agent workflows;
  • runtime exfiltration behaviour;
  • vulnerabilities in Ollama, llama.cpp, LM Studio, Transformers or other runtimes.

This tool should not be the sole approval mechanism for regulated production deployment, client-data processing, internet-connected agentic systems, autonomous coding agents with write access, or systems with access to secrets or privileged infrastructure.


Roadmap

Planned or candidate improvements:

  • fuller GGUF metadata validation;
  • safetensors tensor offset and shape validation;
  • Hugging Face metadata retrieval using pinned revisions;
  • SBOM generation;
  • Sigstore or SLSA provenance checks;
  • licence compatibility checks;
  • model-card quality scoring;
  • SARIF upload workflow;
  • sandboxed behavioural test harness for local inference;
  • prompt-injection and tool-use behavioural tests for RAG and agent workloads.

Contributing

See docs/contribution-guide.md.

Before opening a pull request, run:

Contributions should preserve the project’s core boundary: scanning must remain static by default and must not execute untrusted model artefacts or repository code.


Licence

Licensed under the Apache License, Version 2.0. See LICENSE.