Gemini API 파일 검색은 이제 다중 모드입니다.
hackernews
|
|
📰 뉴스
#gemini
#google
#구글
#멀티모달
#임베딩
#ai 모델
원문 출처: hackernews · Genesis Park에서 요약 및 분석
요약
Gemini API의 File Search 기능이 텍스트뿐만 아니라 이미지를 함께 처리하는 멀티모달로 확장되었습니다. 이제 Gemini Embedding 2 모델을 활용해 키워드 대신 자연어 설명으로 시각적 자산을 검색할 수 있게 되었습니다. 또한, 근거 인용 기능이 도입되어 RAG 시스템의 정확성과 투명성이 강화되었습니다.
본문
Gemini API File Search is now multimodal: build efficient, verifiable RAG Today, we are expanding the Gemini API’s File Search tool. You can now build retrieval-augmented generation (RAG) systems with multimodal data and custom metadata. We’re also introducing page citations to improve grounding and transparency. Whether you are prototyping a weekend project or scaling a production application for thousands of users, your RAG systems can now natively process and better organize your text and visual data. Give your apps a photographic memory File Search now processes images and text together. Powered by the Gemini Embedding 2 model, the tool understands native image data, providing your agents contextual awareness. Think of a creative agency trying to dig up a specific visual asset. Instead of relying on keywords or filenames, your app can search an entire archive for an image matching a specific emotional tone or visual style described in a natural language brief. See how developers are already using it: Filter the noise with custom metadata Dumping files into a database is easy. Finding the right one at scale is the real challenge. Custom metadata allows you to attach key-value labels to your unstructured data — things like department: Legal or status: Final . By applying metadata filters at query time, your application can scope requests to the data slice required. This significantly reduces noise from irrelevant documents, increasing both the speed and accuracy of your RAG workflows. Show your work with page citations When your application pulls an answer from a massive PDF, users need to verify exactly where that answer came from. File Search now ties the model’s response directly to the original source. It captures the page number for every piece of indexed information. This level of granularity allows you to point users directly to the right spot, which helps build trust and makes your tool immediately useful for rigorous fact-checking. Get started with File Search We want to make it as easy as possible to store and retrieve the data that makes your ideas work. The File Search tool handles the heavy infrastructure so you can focus on building the product. Uploading files and searching across them is simple:
Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.
공유