Show HN: DeltaMemory – 프로덕션 AI 에이전트를 위한 지속적인 인지 메모리

hackernews | | 💼 비즈니스
#ai #llm #tip #메모리 #에이전트 #인지 #ai 에이전트 #deltamemory #show hn #인지 메모리 #자동 팩트 추출
원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

AI 에이전트의 세션 간 데이터 단절 문제를 해결하기 위해 개발된 DeltaMemory는 지속적인 기억력과 시간적 추론 능력을 제공하는 인지형 메모리 레이어입니다. 이 기술은 LoCoMo 벤치마크에서 89%의 정확도를 기록했으며, 50ms의 매우 낮은 검색 지연 시간과 기존 토큰 재처리 방식 대비 97%의 비용 절감 효과를 자랑합니다. 오픈 소스 SDK 형태로 제공되어 어떤 LLM 스택과도 호환되며, 현재 얼리 액세스 단계에 있습니다.

본문

Your AI agents forget everything. We fix that. DeltaMemory is the cognitive memory layer for production AI agents. Persistent recall, automatic fact extraction, and contextual intelligence that compounds over time. Works with your stack Add memory to any agent in minutes A single SDK call gives your agents persistent memory with automatic fact extraction, knowledge graphs, and temporal reasoning. 3,714x Token Compression Raw conversations are compressed into structured facts and a knowledge graph. 26M tokens become 7K. Your agents recall what matters without re-processing history. Three Lines to Integrate Install the SDK, connect to your DeltaMemory instance, and call ingest/recall. No schema design, no embedding pipelines, no infrastructure to manage. Framework Native First-class integrations with Vercel AI SDK, LangChain, CrewAI, and n8n. Drop DeltaMemory into your existing agent stack without rewriting your application. Built-in Observability Every memory operation is traced. See what facts were extracted, which memories were recalled, and how salience scores change over time. Debug agent behavior with full visibility. Built for teams that ship to production DeltaMemory meets the security, compliance, and deployment requirements of enterprise AI teams. Run it your way, with full control over your data. Security and Compliance SOC 2 and HIPAA readiness built into the architecture. Cryptographic ownership of memory graphs with fine-grained consent controls. Your data stays yours. Deployment Flexibility Run DeltaMemory as a managed cloud service or deploy on-premise in your own VPC. Multi-tenant isolation with per-user session management and concurrent access controls. Full Traceability Every memory operation produces an audit trail. Track what was ingested, what facts were extracted, which memories influenced a response, and when. Complete provenance for regulated industries. Memory for every industry Wherever AI agents interact with people repeatedly, DeltaMemory turns those interactions into compounding intelligence. Patient context that persists Medical AI assistants that remember patient history, medication interactions, and care preferences across sessions. HIPAA-ready architecture keeps data compliant. A therapy chatbot recalls that a patient mentioned anxiety triggers three sessions ago, without the patient repeating themselves. Tutors that know each student AI tutors that track learning progress, identify knowledge gaps, and adapt teaching style based on accumulated understanding of each student. An AI tutor remembers a student struggles with quadratic equations and adjusts difficulty automatically in future sessions. Personalization without cold starts Shopping assistants that build preference profiles from every interaction. No more asking the same questions. Recommendations improve with every conversation. A shopping agent knows a customer prefers sustainable brands and size M, surfacing relevant products without being asked. Agents that never ask twice Support agents with full customer history. Every past ticket, preference, and resolution is available instantly. Escalations include complete context. A support agent resolves a billing issue in one interaction because it already knows the customer's plan, past disputes, and preferred resolution. Deal intelligence that compounds Sales AI that tracks prospect interactions, objections, and buying signals across touchpoints. Every follow-up is informed by the full relationship history. A sales agent recalls that a prospect mentioned budget approval in Q2 and follows up at the right time with the right context. Built to outperform Benchmarked against every major memory layer on the LoCoMo long-term conversation benchmark. Highest score on the long-term conversation benchmark 16x faster than the next closest memory layer Complex queries across multiple conversation sessions Direct fact retrieval from long-term conversation memory Build with us Our SDKs and integrations are open source. Contribute to the TypeScript SDK, build framework plugins, or join the conversation on Discord. Open Source SDKs Contribute to the TypeScript SDK, report bugs, and submit pull requests. View Repository →Discord Join the community to ask questions, share use cases, and get help from the team. Join Discord →Documentation API reference, integration guides, and architecture deep dives. Read the Docs →Design Partners Work directly with our engineering team to shape the roadmap and get priority support. Apply Now →

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

공유

관련 저널 읽기

전체 보기 →