Show HN: Mini-Mythos- A Crowdsourced Mythos Harness copy for Vulnerability Scans

hackernews | 2026년 4월 16일 07:56 | 📦 오픈소스

#anthropic #claude #command r #취약점/보안

원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

The user wants me to summarize a technical article about an open-source project called "Mini-Mythos" which is a recreation of Anthropic's Mythos Preview for finding memory-safety vulnerabilities in C/C++ codebases. Let me create a 2-4 sentence summary in Korean that includes specific facts, numbers, and context from the article. Key points: 1. It's an OSS recreation of Anthropic's Mythos for finding memory-safety vulnerabilities 2. Uses Docker with ASan-instrumented builds 3. Uses Claude Code CLI for auditing 4. Early results are positive - found a buffer overflow in miniupnpd 5. Has a budget cap (default $50) 6. Tested with miniupnpd repo and recreated historic CVEs 7. Default model is claude-opus-4 I need to write 2-4 sentences, at least 80 characters, in Korean, including specific facts and numbers. Anthropic의 Mythos Preview를 재현한 오픈소스 프로젝트 **Mini-Mythos**가 공개되었다. 이 도구는 C/C++ 코드베이스에서 메모리 안전성 취약점을 자동으로 탐지하며, ASan 계측 빌드와 Claude Code CLI를 활용하여 각 파일을 채점(1~5점)하고 익스플로잇을 찾는 방식으로 작동한다. 기본 예산 한도는 $50이며, 기본 모델은 claude-opus-4이다. 현재 miniupnpd 저장소를 테스트하여 이전 CVE를 재현하는 데 성공했으며, 최신 모델(opus)은 별도 설정 옵션에서 발생하는 원격 버퍼 오버플로우 취약점도 발견했다.

본문

A (shoddy) OSS recreation of Anthropic's Mythos Preview cybersecurity harness* that locates and verifes memory-safety vulnerabilities in C/C++ codebases. *AGI not included, results may vary, side effects may inlcude the end of all software, ludicrous api bills and/or anthropic account bans (probably not but I wouldn't say never ;-;) Anthropic's Design is Stupidly Simple - Rank every file 1-5 - Spin up a Docker container with an ASan-instrumented build - Prompt Claude Code with that file to 'find an exploit bro' and report back a defect with a reproduction script. - Have a Judge critic the finding for BS - Repeat for EVERY FILE that's it. Obviously, I do not have access to Claude Mythos. This project is an experiment in 'baking a cake without flour'. The hypothesis is that, with a reasonable harness, you don't need it. It doesn't take a genius to realize 'rate every file 1-5' is likely NOT best way to automate zero-day-discovery, and specialized tools + scaffolding might hold the key to better performance. Besides, Long term, big compute + historic CVEs + OSS git checkpoints is a perfect RL sandbox for tuninng agentic cyber-sec tools. As for it working, early results are positive. View Current Progress to read current progress & yapping. - Docker - Python 3.12+ ( pip install -r requirements.txt ) - Claude Code CLI authenticated ( claude /login ) Create targets//target.toml : [project] name = "myproject" description = "a short description" [docker] container_name = "minimythos_myproject" image = "minimythos_myproject:latest" workdir = "/opt/myproject" [build] repo_url = "https://github.com/example/myproject.git" repo_revision = "abc123" build_dir = "subdir" # optional: subdirectory within the repo to audit Copy docker/Dockerfile.example to your target directory, fill in the build commands (autotools and CMake examples are included), then: docker build -t minimythos_myproject:latest targets/myproject/ docker run -d --name minimythos_myproject minimythos_myproject:latest The docker build step already generated reachable_symbols.json inside the container (via nm over every compiled .o file). Just copy it out: mkdir -p runs/targets/myproject docker cp minimythos_myproject:/opt/myproject/reachable_symbols.json \ runs/targets/myproject/reachable_symbols.json Enables dead-code filtering. If you skip it, the harness scores all files anyway. cd harness python3 -u orchestrator.py --target myproject Flags: | Flag | Purpose | |---|---| --max-runs N | Stop after N audit runs | --dry-run | Score files and print queue — skips audit runs | --skip-docker | Skip Gate A trigger execution | --budget USD | Hard cap in USD (default: $50) | --model MODEL | Audit model (default: claude-opus-4-6 ) | # Live color log (run in a second terminal) python3 watch_run.py --tail # List completed runs python3 show_run.py # View a specific run transcript (any prefix of run_id works) python3 show_run.py # View the judge transcript for a run python3 show_run.py --judge Note: re-running the same command resumes from the next unresolved file. To reset fully: rm runs/targets/myproject/audit.jsonl rm runs/targets/myproject/scores.json orchestrator.py main loop preprocessor.py dead-code filter (tree-sitter + symbol table) scorer.py LLM file scoring (host-side Claude) runner.py audit agent dispatch (docker exec claude) validator.py Gate B: independent judge agent verifier.py Gate A: trigger execution + sanitizer check budget.py hard spend cap enforcement config.py all settings (models, timeouts, budgets) prompts/ score.txt file scoring prompt audit.txt audit agent prompt judge.txt judge agent prompt So far, I've worked with miniupnpd as a test repo, and managed to recreate historic CVEs on older checkouts. For novel work, the $20 opus plan has unearthed a global buffer overflow in miniupnpd.c remotely-triggered if running with a non-default config option. Notes: - Anthropic Models are surprisingly willing to just FIND vulnerabilities and write triggers in this setting. Anccetodally, very low refusal rate when prompting them to find and write trigger scripts inside this automated harness. - Wrapping the Claude Code CLI directly is a massive shortcut. It might be too heavy and warrent changes later, but it's a SOTA agent scaffolding for a reason and mirrors what Anthropic reported in their tests. Experimenting with better harnesses to test current model capabilities seems promising, with Opus [already finding live Firefox vulnerabilities] (https://www.anthropic.com/news/mozilla-firefox-security) Planned improvements/ experiments - Semantic taint analysis before the main agent to focus the search space - Joern tools for call-graph analysis and reachability checks - AST-Aware context trimming - Patch churn targeting - Adding wrappers for Codex and OpenCode to benchmark performance against Claude Obviously, PRs and issues welcome. See Contributing.

원문 보기 (hackernews)

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

요약

본문

관련 저널 읽기