Invincat – 계층화되고 감사 가능한 장기 메모리를 갖춘 터미널 AI 코딩 에이전트

hackernews | 2026년 4월 27일 01:37 | 📰 뉴스

#anthropic #gemini #gpt-4 #openai #오픈소스

원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

파이썬 기반 터미널 AI 프로그래밍 도우미인 Invincat은 로컬 저장소에서 파일을 읽고 쓰며 명령을 실행하는 등 실제 엔지니어링 작업을 지원합니다. 터미널 기반 워크플로우와 안전장치가 적용된 실행 기능을 통해 IDE나 브라우저를 전환할 필요 없이 작업할 수 있습니다. 또한 팀이 실행 전 계획을 검토할 수 있고 세션 간에 맥락을 유지하는 확장 가능한 아키텍처를 제공합니다.

본문

A Python-based terminal AI programming assistant — collaborate with AI directly in your project directory: read/write files, execute commands, browse the web, and maintain memory across sessions. Invincat is designed for real engineering work in local repositories, not demo-only chat. - Terminal-native workflow: stay in your project directory and use AI without switching IDEs or browser tabs. - Execution with guardrails: shell/file/network actions are approval-gated by default, with optional auto-approve for trusted flows. - Plan-first delivery mode: /plan lets teams review and approve checklists before execution, reducing risky one-shot edits. - Long-context durability: micro compression + offload keep long sessions usable without losing operational history. - Practical memory model: user/project memory stores persist conventions across sessions and are inspectable via /memory . - Extensible architecture: MCP tools, skills, and subagents allow adapting the assistant to team-specific workflows. Invincat uses a multi-agent runtime with clear role boundaries. User Input enters the session router.- If /plan mode is active, input is routed to thePlanner Agent ; otherwise to theMain Agent . Main Agent executes tools (file/shell/web/MCP) under approval and middleware guardrails.- After a non-trivial turn completes, Memory Agent runs asynchronously to extract durable user/project memory updates. - When needed, Main Agent delegates bounded subtasks to local or async subagents. | Agent | Primary Responsibility | Allowed/Expected Behavior | Hard Boundary | |---|---|---|---| | Main Agent | Execute user tasks end-to-end | Read/write files, run commands, use MCP/tools, coordinate subtasks | Must not directly read/write memory_user.json or memory_project.json | Planner Agent (/plan ) | Produce and refine executable plans | Read-only context gathering, write_todos , approve_plan , optional clarification via ask_user | No implementation actions (no file edits, no command execution) | | Memory Agent | Curate durable memory after each completed turn | Score and apply memory ops (create/update/rescore/retier/archive/delete/noop ) to user/project stores | Conservative extraction; skips low-confidence or ephemeral facts | | Local Subagents | Parallelize bounded in-process subtasks | Handle scoped tasks delegated by main agent with explicit instructions | Operate only within delegated scope; main agent remains final integrator | | Async Subagents | Offload long/remote tasks | Launch/update/cancel remote subagent jobs via async tools | Treated as delegated workers, not primary conversation owner | - Planner mode uses both visible-tool filtering and runtime allow-list enforcement. - Memory store files are protected by middleware and updated only through the memory pipeline. - Memory extraction runs in post-turn async middleware ( aafter_agent ) so it does not block user-visible responses. - Chinese guide: doc/README_CN.md - Memory design (Chinese): doc/MEMORY_DESIGN.md - Memory design (English): doc/MEMORY_DESIGN_EN.md Requirements: Python 3.11+ # Install from PyPI pip install invincat-cli Or install from source: git clone https://github.com/dog-qiuqiu/invincat.git cd invincat pip install -e . # Start in your project directory cd ~/my-project invincat-cli After the first launch, run /model to configure the model and API Key, then you can start the conversation directly. Run /model command to open the model management interface: - Press Ctrl+N to register a new model - Fill in the provider, model name, and API Key - Select from the list and press Enter to activate | Provider | Example Models | |---|---| anthropic | claude-sonnet-4-6 , claude-opus-4-7 | openai | gpt-4o , o3 | google_genai | gemini-2.0-flash , gemini-2.5-pro | openrouter | Supports all models on OpenRouter | For OpenAI-compatible interfaces (DeepSeek, Zhipu, local Ollama, etc.), simply set the base_url to connect. | Variable | Description | |---|---| ANTHROPIC_API_KEY | Anthropic API Key | OPENAI_API_KEY | OpenAI API Key | GOOGLE_API_KEY | Google API Key | OPENROUTER_API_KEY | OpenRouter API Key | TAVILY_API_KEY | Tavily web search Key (optional) | Type your question or task directly in the input box and press Enter to send. AI will automatically select the appropriate tools to complete the task: Search for the latest usage of LangGraph interrupt /clear /threads /model ... ... Press Tab to autocomplete available commands. See Slash Commands for the complete list. Use planner mode when you want to discuss and approve a plan before execution: /plan Then describe your task in chat. The planner agent will: - analyze requirements with read-only tools - write a todo list ( write_todos ) - ask for explicit approval ( approve_plan ) After approval, planner mode exits and keeps the approved checklist visible. The approved checklist is then handed to the main agent for execution. If you reject the plan, the planner stays in planning mode so you can refine requirements and regenerate the checklist. Exit planner mode anytime: /exit-plan /exit-plan also cancels an in-flight planner turn and drops queued planner handoff actions, so no stale plan execution will continue after exit. Use @ in your message to reference files, and AI will read and understand their content: @src/main.py Are there any potential performance issues in this file? When AI performs operations like file writing, shell commands, or network requests, it will pause by default for confirmation: Auto-approve Mode: Press Shift+Tab to toggle. When enabled, all tool calls are automatically approved, suitable for trusted task scenarios. The status bar will display an AUTO indicator. ⚠️ It's recommended to enable auto-approve only after you're familiar with the task content. Press Ctrl+J in the input box to insert a line break, suitable for entering longer code or paragraphs. A lightweight compression that runs automatically before each model call, no LLM involved, taking B[Main Agent Response] B --> C[MemoryAgentMiddleware aafter_agent] C --> D{Non-trivial + completed + throttle passed?} D -- No --> E[Skip extraction] D -- Yes --> F[Incremental slice by cursor + anchor] F --> G[Collect tool evidence] G --> H[Structured ops JSON] H --> I[Validate + guardrails] I --> J[Atomic write memory_user.json / memory_project.json] J --> K[Next turn RefreshableMemoryMiddleware injects active memory] | Type | Path | Scope | |---|---|---| | Global Memory Store | ~/.invincat/{assistant_id}/memory_user.json (default: ~/.invincat/agent/memory_user.json ) | Universal for all projects (coding style, personal preferences) | | Project Memory Store | {project root}/.invincat/memory_project.json (fallback: {cwd}/.invincat/memory_project.json when project root is not detected) | Current project context (repository conventions, architecture, stack); falls back to current working directory when no project root is detected | AGENTS.md is deprecated for runtime memory injection. The runtime memory pipeline now uses memory_*.json as the single source of truth. Memory updates are triggered after non-trivial completed turns, with: - incremental extraction: consume only messages added since the previous memory extraction in the same thread - cursor invalidation fallback: if history is rewritten (for example, compaction/checkpoint replay), fallback to one full-history pass - turn-interval throttling - keyword-based early triggers (preferences/rules/conventions) - time/file cooldown guards Tune behavior via environment variables: INVINCAT_MEMORY_CONTEXT_MESSAGES=0 INVINCAT_MEMORY_MIN_TURN_INTERVAL=1 INVINCAT_MEMORY_MIN_SECONDS_BETWEEN_RUNS=0 INVINCAT_MEMORY_FILE_COOLDOWN_SECONDS=0 INVINCAT_MEMORY_CONTEXT_MESSAGES=0 means no cap on the incremental delta since the last memory extraction. Set a positive integer to cap the delta to recent N messages. By default the memory agent runs after every non-trivial turn (MIN_TURN_INTERVAL=1 , no wall-clock or file cooldown) so memory stays in sync with the latest signal. Raise the values to re-enable throttling if the extraction cost becomes a concern. For production tuning (cost-sensitive setups), a practical starting point is: INVINCAT_MEMORY_MIN_TURN_INTERVAL=2 INVINCAT_MEMORY_MIN_SECONDS_BETWEEN_RUNS=8 INVINCAT_MEMORY_FILE_COOLDOWN_SECONDS=5 If project memory updates appear rare, check in this order: - Is the turn non-trivial and completed? Very short confirmations ( ok ,thanks ,继续 ) are skipped. - Did evidence come from supported tools? Project evidence extraction prioritizes read_file ,edit_file ,write_file ,execute ,bash ,shell . - Is evidence durable and convention-like? Temporary logs or one-off statuses are intentionally ignored. - Is throttling active? MIN_TURN_INTERVAL , wall-clock cooldown, or file cooldown can suppress runs. - Was history rewritten? Cursor mismatch triggers fallback behavior; check whether compaction/replay happened. - Did writes fail guardrails? Invalid/conflicting operations are dropped by schema and safety validation. Quick verification path: - Run one concrete, non-trivial turn that states a stable project rule. - Ensure at least one supporting read/execute tool result exists in that turn. - Open /memory and check theproject tab for new or updated active items. /memory Open the full-screen memory manager for live inspection of memory stores: - separate pages for user andproject scope (1 /2 , orTab to switch) - highlights key fields ( status ,id ,section ,content ) for each item - supports r (refresh),a (show/hide archived),Esc (close) Skills are predefined workflow templates for reusing complex task steps. /skill:web-research Search for LangGraph best practices /skill:code-review Check code quality in src/ directory | Location | Path | Description | |---|---|---| | Built-in Skills | Installed with package | skill-creator | | Global Custom | ~/.invincat/agent/skills/ | Available across projects | | Project-level | .invincat/skills/ | Only available in current project | /skill-creator Starts an interactive wizard that guides you through creating and saving new skills. /threads Opens the session browser, displaying all historical conversations (time, message count, branch, etc.). /clear Clears the current conversation and starts a new session (old sessions are still saved and can be retrieved via /threads ). Type / in the input box and press Tab to view and autocomplete all commands. | Command | Description | |---|---| /clear | Clear current conversation, start new session | /threads | Browse and restore historical sessions | /plan | Enter planner mode; approved checklist is handed to the main agent | /exit-plan | Exit planner mode, cancel running planner turn and queued handoff | /quit / /q | Exit program | | Command | Description | |---|---| /model | Switch or manage model configurations | /theme | Switch color theme | /language | Switch interface language (Chinese / English) | /tokens | View token usage details | | Command | Description | |---|---| /offload / /compact | Manually compress context, free tokens | /memory | Open full-screen memory manager (live user/project view) | | Command | Description | |---|---| /mcp | View connected MCP servers and tools | /editor | Edit current input in external editor | /skill-creator | Interactive wizard for creating new skills | /changelog | Open release notes/changelog | /feedback | Show feedback channel information | /docs | Open project documentation entry | | Command | Description | |---|---| /help | Display help information | /version | Display version number | /reload | Reload configuration files | /trace | Open current conversation in LangSmith (requires configuration) | Q: No response on first launch? You need to configure the model first. Run /model → Press Ctrl+N to register a model → Fill in the API Key. Q: How to interrupt a running task? Press Esc to interrupt the current AI response; if AI is waiting for tool approval, Esc acts as a rejection. Q: Context too long causing slow response? Run /offload to manually compress history, or wait for automatic compression (triggers when usage exceeds 80%). Q: How to make AI remember my coding preferences? Just tell AI directly, for example "Remember: my project uses 4-space indentation, no semicolons", and AI will automatically save it to memory files at the appropriate time. Q: How to share skills across different projects? Place skill files in the ~/.invincat/agent/skills/ directory for global availability; place in .invincat/skills/ for current project only.

원문 보기 (hackernews)

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

요약

본문

관련 저널 읽기