Promptfoo: 프롬프트, 에이전트 및 RAG를 테스트합니다.

hackernews | 2026년 3월 16일 20:23 | 📰 뉴스

#anthropic #llama #llm #openai #promptfoo #rag #red teaming #review #테스트

원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

Promptfoo는 LLM 애플리케이션의 프롬프트와 모델을 평가하고 보안 취약점을 스캔할 수 있는 CLI 도구 및 라이브러리입니다. 개발자는 OpenAI와 Anthropic 등 다양한 모델을 비교하거나 CI/CD에 자동화된 검사를 통합하여 안전하고 신뢰할 수 있는 AI 앱을 배포할 수 있습니다. 이 도구는 모든 평가를 100% 로컬에서 수행하며, MIT 라이선스 오픈 소스로 제공되어 데이터 프라이버시를 보장하고 빠른 개발 사이클을 지원합니다.

본문

promptfoo is a CLI and library for evaluating and red-teaming LLM apps. Stop the trial-and-error approach - start shipping secure, reliable AI apps. Website · Getting Started · Red Teaming · Documentation · Discord Promptfoo is now part of OpenAI. Promptfoo remains open source and MIT licensed. Read the company update. npm install -g promptfoo promptfoo init --example getting-started Also available via brew install promptfoo and pip install promptfoo . You can also use npx promptfoo@latest to run any command without installing. Most LLM providers require an API key. Set yours as an environment variable: export OPENAI_API_KEY=sk-abc123 Once you're in the example directory, run an eval and view results: cd getting-started promptfoo eval promptfoo view See Getting Started (evals) or Red Teaming (vulnerability scanning) for more. - Test your prompts and models with automated evaluations - Secure your LLM apps with red teaming and vulnerability scanning - Compare models side-by-side (OpenAI, Anthropic, Azure, Bedrock, Ollama, and more) - Automate checks in CI/CD - Review pull requests for LLM-related security and compliance issues with code scanning - Share results with your team Here's what it looks like in action: It works on the command line too: It also can generate security vulnerability reports: - Developer-first: Fast, with features like live reload and caching - Private: LLM evals run 100% locally - your prompts never leave your machine - Flexible: Works with any LLM API or programming language - Battle-tested: Powers LLM apps serving 10M+ users in production - Data-driven: Make decisions based on metrics, not gut feel - Open source: MIT licensed, with an active community - Getting Started - Full Documentation - Red Teaming Guide - CLI Usage - Node.js Package - Supported Models - Code Scanning Guide We welcome contributions! Check out our contributing guide to get started. Join our Discord community for help and discussion.

원문 보기 (hackernews)

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

요약

본문

관련 저널 읽기