매월 $100의 Claude 코드 지출을 Zed 및 OpenRouter에 재할당

hackernews | 2026년 4월 9일 18:27 | 🔬 연구

#ai 딜 #ai비용 #anthropic #claude #gemini #mistral #openai #openrouter #zed #할당전환

원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

최근 클로드(Claude) 모델의 사용량 제한이 더 빠르게 도달하는 문제가 불거지면서, 한 사용자가 기존에 월 100달러를 지불하던 구독 비용을 재분배하는 방안을 소개했다. 속도가 빠르고 가벼운 코드 에디터인 젯(Zed)을 활용하고, 오픈라우터(OpenRouter)를 통해 클로드, 제미나이 등 다양한 모델을 작업 복잡도에 맞춰 유연하게 선택할 수 있게 구성했다. 특히 오픈라우터를 사용하면 크레딧을 1년 동안 이월할 수 있는 장점이 있으며, 데이터 유출을 방지하기 위해 '제로 데이터 보존(ZDR)' 엔드포인트만 사용하는 보안 설정도 적용했다. 또한 커서(Cursor) 3.0의 완전한 에이전트 중심 재설계에 주목하며, 특정 파일 경로에만 AI 규칙이 적용되는 세밀한 컨텍스트 제어 기능을 높이 평가했다.

본문

Reallocating $100/Month Claude Code spend to Zed and OpenRouter I’ve been disappointed to feel that I’m hitting Claude limits faster than before. For context, I use both Claude Code and the Claude desktop app for work and pay $100/month for the privilege of hitting limits. I’m not the only one (this was AMD’s senior director of AI) with numerous other reports found over Reddit and Twitter. This article is how I’m reallocating that spend to other tools and models while getting more flexibility at the same time. I like options and while Opus is undoubtedly the market leader for agentic coding, there are other models that I like to use to balance cost and speed depending on the complexity of the task in hand. I’m looking at how I can use different models with an Agent Harness. An “agent harness” coordinates sending and receiving messages from LLMs, injecting tool defintions and calling the tools and orchestrating all of this into workflows (including retrying failing tasks). Claude Code is an example of such a system. It takes the user message, coordinates reading/writing files - among other things - and makes calls to the LLM. Plans: $10 / month - pricing page You don’t realise how slow/laggy VSCode and the all of the forks are until you try out Zed. The builtin agent harness is basic but nice with the ability to follow the agent around as it modifies files and to add new profiles to customise the agent behaviour. Like Cursor it shows the context usage and the rules that are being applied to the current session. If you continue to use Claude Code or other tools like Mistral Vibe, Zed integrates them directly into the editor using the Agent Client Protocol (ACP) - see supported agents. The biggest disadvantage is definitely the lack of extensions compared to VSCode but there are enough to cover common languages and common tasks. Zed do offer usage based pricing once you have used up the credits they provide however their token prices are higher than going directly to the API themselves. This is why I prefer to use the OpenRouter integration into Zed instead. A nice side benefit is you get the more native context window sizes. For some reason Zed limits the Gemini 3.1 context to 200k tokens in their native integration however with OpenRouter you can make use of the full 1M. Their docs say this may be changed in the future. The largest option of models and providers that I know of is OpenRouter and it’s easy enough to sign up, pre-pay some credits and get an API key. I don’t like that I have a set window of Anthropic credits. If I use it I have to wait for it to reset (or pay). But when I’m not using it I’m missing out on that window of opportunity. Instead I can top up my OpenRouter credits which expire after 365 days if unused. Then I can use the credits when I’m working and save them/roll-over when I’m not. To minimise data exposure risk, I have chosen not to consent to OpenRouter being able to use inputs/outputs “to improve the product” (though you get a 1% discount if you do), and I have enabled the “Zero Data Retention (ZDR) Endpoints Only” in my Workspace Guardrail settings. You do lose out on some models here - for example, qwen/qwen3.6-plus which is only hosted on Alibaba Cloud - however that’s a small price I’m willing to pay. Plans: $20 | $60 | $200 / month - pricing page I originally switched from VSCode & Copilot to Cursor in 2025 after experiencing the magic of the Cursor “Tab” jumping around the editor preempting my next move. As it moved from autocomplete-on-steroids to more agentic coding, I was thankful to have access to multiple models to experiment with (this is now also available in Copilot but in the beginning they were OpenAI only). I mostly ignored Cursor 2.0 as they put more emphasis on the chat interface however with Cursor 3.0 as a complete rewrite (in Rust like Zed) and focused on Agent orchestration, I am curious to try it out. Cursor was (or still semi-is) my preferred editor. As a VSCode fork, all extensions are available. They were an early adopter of the plan mode -> agent mode workflow and now support a new debug mode which is a more advanced print style debug that the agent can also interact with. Cursor also supports different types of rule applications, something I personally love and am surprised that other agent harnesses haven’t adopted. Most agent harnesses take an “apply intelligently” approach, trying to let the AI make decisions on when to include a rule based on the description. But Cursor also offers the ability to only apply to specific files. I know I have rules that only apply to *.py files, or even **/models.py etc. I am able to make the most of my context window by explicitly setting those rules to be added only to certain filepath regexs. It guarantees their usage Choosing Cursor you get API rate pricing above the included use in your plan (and you can limit this so your total spend is limited to $100) but you are still paying minimum $20/month which does not r

원문 보기 (hackernews)

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

요약

본문

관련 저널 읽기