Show HN: Self-healing browser harness via direct CDP
hackernews
|
|
📦 오픈소스
#ai 모델
#claude
원문 출처: hackernews · Genesis Park에서 요약 및 분석
요약
LLM이 웹 브라우저 작업을 완전히 자율적으로 수행할 수 있는 가장 단순하고 얇은 자가 치유 하네스가 공개되었습니다. 이 도구는 크롬과 직접 웹소켓으로 연결되며, 작업 중 필요한 기능이 빠져 있으면 에이전트가 직접 코드를 수정하여 기능을 추가하는 특징이 있습니다. 사용자는 설치 후 깃허브 저장소를 열어 상호작용이 작동하는지 별 표시 같은 간단한 데모를 통해 확인할 수 있습니다.
본문
The simplest, thinnest, self-healing harness that gives LLM complete freedom to complete any browser task. Built directly on CDP. The agent writes what's missing, mid-task. No framework, no recipes, no rails. One websocket to Chrome, nothing between. ● agent: wants to upload a file │ ● helpers.py → upload_file() missing │ ● agent edits the harness and writes it helpers.py 192 → 199 lines │ + upload_file() ✓ file uploaded You will never use the browser again. Paste into Claude Code or Codex: Set up https://github.com/browser-use/browser-harness for me. Read `install.md` first to install and connect this repo to my real browser. Then read `SKILL.md` for normal usage. Always read `helpers.py` because that is where the functions are. When you open a setup or verification tab, activate it so I can see the active browser tab. After it is installed, open this repository in my browser and, if I am logged in to GitHub, ask me whether you should star it for me as a quick demo that the interaction works — only click the star if I say yes. If I am not logged in, just go to browser-use.com. When this page appears, tick the checkbox so the agent can connect to your browser: See domain-skills/ for example tasks. Useful for sub-agents or deployment. Free tier: 3 concurrent browsers, no card required. - Grab a key at cloud.browser-use.com/new-api-key - Or let the agent sign up itself via docs.browser-use.com/llms.txt (setup flow + challenge context included). install.md — first-time install and browser bootstrapSKILL.md — day-to-day usagerun.py (~36 lines) — runs plain Python with helpers preloadedhelpers.py (~195 lines) — starting tool calls; the agent edits theseadmin.py +daemon.py (~361 lines) — daemon bootstrap plus the CDP websocket and socket bridge PRs and improvements welcome. The best way to help: contribute a new domain skill under domain-skills/ for a site or task you use often (LinkedIn outreach, ordering on Amazon, filing expenses, etc.). Each skill teaches the agent the selectors, flows, and edge cases it would otherwise have to rediscover. - Skills are written by the harness, not by you. Just run your task with the agent — when it figures something non-obvious out, it files the skill itself (see SKILL.md). Please don't hand-author skill files; agent-generated ones reflect what actually works in the browser. - Open a PR with the generated domain-skills// folder — small and focused is great. - Bug fixes, docs tweaks, and helper improvements are equally welcome. - Browse existing skills ( github/ ,linkedin/ ,amazon/ , ...) to see the shape. If you're not sure where to start, open an issue and we'll point you somewhere useful.
Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.
공유