HN 표시: OpenJet – 메모리가 제한된 엣지 하드웨어를 위한 오프라인 에이전트 하네스

hackernews | 2026년 3월 16일 01:15 | 📦 오픈소스

#ai 모델 #ai 코딩 #claude #llama #openjet #로컬 llm #엣지 하드웨어 #오프라인 에이전트

원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

OpenJet은 제트손(Jetson) 및 엣지 리눅스 환경의 엄격한 메모리 제약을 고려하여 설계된 오프라인 우선 에이전트 런타임입니다. 기본적으로 llama.cpp의 서버를 활용하지만 SGLang이나 TensorRT-LLM으로도 설정이 가능하며, 제한된 RAM과 세션 단절 문제를 극복하기 위해 자동 문맥 압축 및 상태 복구 기능을 제공합니다. 또한 Python SDK 및 OpenTelemetry 계측을 지원하여 하드웨어 인식형 배포와 안정적인 오퍼레이터 워크플로우 구현을 돕습니다.

본문

An AI coding agent that runs entirely on your machine. This is Claude Code for local LLMs. OpenJet handles the model, the runtime, and the setup without having to manually wrangle complex confirgurations. You get a coding agent in your terminal that reads your files, edits your code, runs commands, and stays out of the cloud. git clone https://github.com/l-forster/open-jet.git cd open-jet ./install.sh open-jet --setup That's it. Setup detects your hardware, picks a model that fits your RAM, downloads it, and gets everything running. Already have a .gguf ? It finds that too. Then just: open-jet An agent in your terminal that can actually do things: - Read and edit your code — search files, apply edits, write new ones - Run shell commands — with explicit approval before anything executes - Resume sessions — close the terminal, come back later, pick up where you left off - Work on constrained hardware — automatic context condensing, model unload/reload around heavy tasks - Device access — cameras, microphones, GPIO for edge and embedded work - Python SDK — automate the same agent from scripts Cloud coding agents need API keys, send your code to someone else's server, and cost money per token. Local chat tools give you a chat window but not an agent — no file access, no shell, no session recovery. OpenJet closes that gap. It's built for local models on real hardware, where memory is tight, context windows are short, and sessions get interrupted. Everything runs on your machine, nothing leaves it. - Quickstart - Installation - Configuration - Runtime: llama.cpp - Python SDK - Usage: CLI - Usage: Slash commands - Usage: Device sources - Usage: Workflow harness - Usage: Session state and logging - Examples - Deployment: Jetson - Deployment: Linux x86 + NVIDIA - Deployment: CPU-only AGPL-3.0-only , with commercial licensing available under separate terms.

원문 보기 (hackernews)

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

요약

본문

관련 저널 읽기