Ollama를 통해 Claude Code를 라우팅하는 데 소요되는 비용 계산(~90% 절감)
hackernews
|
|
📰 뉴스
#anthropic
#claude
#llama
#오픈소스
원문 출처: hackernews · Genesis Park에서 요약 및 분석
요약
클로드 데스크탑과 터미널의 클로드 코드를 연동하여 전략적 작업은 유료 모델로, 무거운 작업은 오픈소스 모델로 분산 처리하는 방법을 소개합니다. 이 방식은 Ollama를 통해 리팩토링이나 파일 배치 작업 등 컨텍스트가 많이 필요한 작업을 무료 모델로 수행하게 하여, 비용을 약 90% 절감할 수 있습니다. 제공되는 21장의 슬라이드와 복사-붙여넣기 프롬프트를 사용하면 macOS, Windows, Linux 등 다양한 OS에서 자동으로 설정을 완료할 수 있습니다.
본문
Pair Claude Desktop on Anthropic with Claude Code routed through Ollama in your terminal. Strategy stays on Pro. Heavy footwork runs on a free open-source model. Cut your Claude Code bill ~90%. A 21-slide visual walkthrough that pairs your Anthropic Claude Desktop app with an Ollama-backed Claude Code terminal session — so the strategic work stays on Pro while the heavy lifting runs on a free open-source model (Gemma, Qwen, DeepSeek, your pick). - Auto-detects your OS — macOS, Windows + WSL2, Linux - Copy-paste prompt — drop one block into Claude and it does ~98% of the setup for you - Verifies both engines side-by-side at the end so you know it actually worked - Self-contained HTML — no build step, opens in any browser Claude Pro on the Desktop app is great for thinking, planning, and architecture. Claude Code in the terminal eats quota fast — context-heavy tasks like lints, refactors, file batch ops, and grep-and-replace can burn through your monthly limit in days. The fix: route Claude Code through Ollama (local or cloud-hosted free model). Two engines, same UX, one bill cut by an order of magnitude. The hosted version has the full visual deck, OS-aware steps, and the copy-paste prompt: If you want to skip the visuals and let Claude do it all for you: - Open prompts/copy-paste-prompt.md - Copy the entire file contents - Paste into a fresh Claude Desktop / Claude.ai conversation - Follow along — it auto-detects your OS, installs everything, configures the router, and verifies both engines git clone https://github.com/Coherence-Daddy/use-ollama-to-enhance-claude.git cd use-ollama-to-enhance-claude/presentation open index.html # macOS # or just drag index.html into a browser | Engine | Where it runs | What it's for | |---|---|---| | Claude Desktop (Anthropic) | Native app | Strategy, architecture, code review, tricky bugs | | Claude Code → Ollama | Your terminal | Lints, refactors, repetitive edits, file batch ops | Two side-by-side panes. Same UX. One of them is free. . ├── README.md ← this file ├── LICENSE ← MIT ├── prompts/ │ └── copy-paste-prompt.md ← the canonical setup prompt └── presentation/ ├── index.html ← 21-slide visual walkthrough ├── cd-face-coral.png ← brand asset └── copy-paste-prompt.md ← (mirror of /prompts version, kept beside the deck) The presentation is the same exact HTML served at coherencedaddy.com/tutorials/use-ollama-to-enhance-claude. Open it locally, host it yourself, or fork it for your own walkthroughs. claude-code ollama gemma llm-tools cost-optimization agentic-coding anthropic open-source-llm MIT — see LICENSE . Use it, fork it, ship a YouTube walkthrough of it. No attribution required, but if you do credit it, link to coherencedaddy.com. Coherence Daddy is a 508(c)(1)(A) faith-driven technology organization on a mission to help humanity be more coherent. We build private, secure self-help tools — and the occasional tutorial that saves you a few hundred bucks on AI bills. - Website: coherencedaddy.com - Free tools (523+): coherencedaddy.com/tools - Tutorials: coherencedaddy.com/tutorials - X / Twitter: @coherencedaddy If this saved you a real chunk of money, the kindest thing you can do is ⭐ this repo and share the tutorial link.
Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.
공유