영국 정부의 Mythos AI 테스트는 사이버 보안 위협과 과대 광고를 분리하는 데 도움이 됩니다.

Ars Technica | 2026년 4월 15일 04:11 | 🔬 연구

#ai 모델 #anthropic #claude #사이버보안 #취약점 #ai보안 #mythos #취약점/보안

원문 출처: Ars Technica · Genesis Park에서 요약 및 분석

요약

영국 정부 AI 보안 연구소는 안스로픽의 신개념 AI 모델 '미토스 프리뷰'의 사이버 공격 능력을 평가한 결과를 발표했습니다. 단일 보안 과제에서는 타 최신 모델과 큰 차이가 없었으나, 이 모델은 여러 과제를 연결해 시스템에 침투하는 다단계 공격 수행 능력에서 돌보였습니다. 또한 과거 GPT-3.5 터보가 어려움을 겪었던 하위 난이도의 캡처 더 플래그 과제 중 약 85% 이상을 해결하는 성과를 보였습니다.

본문

Last week, Anthropic announced it was restricting the initial release of its Mythos Preview model to "a limited group of critical industry partners," giving them time to prepare for a model that it said is "strikingly capable at computer security tasks." Now, the UK government's AI Security Institute (AISI) has published an initial evaluation of the model's cyber-attack capabilities that adds some independent public verification to those Anthropic reports. AISI's findings show that Mythos isn't significantly different from other recent frontier models when it comes to tests of individual cyber-security related tasks. But Mythos could set itself apart from previous models through its ability to effectively chain these tasks together into the multi-step series of attacks necessary to fully infiltrate some systems. "The Last Ones" finally falls AISI has been putting various AI models through specially designed Capture the Flag challenges since early 2023, when GPT-3.5 Turbo struggled to complete any of the group's relatively low-level "Apprentice" tasks. Since then, performance of subsequent models has risen steadily, to the point where Mythos Preview can complete north of 85 percent of those same Apprentice-level CTF tasks.Read full article Comments

원문 보기 (Ars Technica)

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

요약

본문

관련 저널 읽기