AI 에이전트를 위한 새로운 무어의 법칙

hackernews | | 🤖 AI 모델
#ai 에이전트 #chatgpt #review #성능 향상 #코딩
원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

연구원 METR의 분석에 따르면, AI 에이전트가 수행할 수 있는 코딩 작업의 시간 길이가 약 7개월마다 두 배로 증가하는 등 급격한 성장세를 보이고 있습니다. 2019년부터 2026년까지의 모델을 230개 이상의 과제로 테스트한 결과, AI 성공률과 작업 난이도 간에 강한 상관관계가 발견되었습니다. 이러한 추세는 현재까지 정체 기미 없이 지속되며, AI가 자율적으로 해결 가능한 작업의 범위가 지수적으로 확장되고 있음을 시사합니다.

본문

Last updated March 2026 A new Moore's Law for AI agents When ChatGPT came out in 2022, it could do 30 second coding tasks. Today, AI agents can autonomously do coding tasks that take humans over fourteen hours. The length of coding tasks frontier systems can complete is growing exponentially – doubling every 7 months. This trend was discovered by researchers at METR. They took the most capable agents from 2019 to 2026, and tested them on about 230 tasks: mostly coding tasks, with some on general reasoning. Then, they compared the agent's success rate to the length of each task – how long it takes human professionals to complete, ranging from under 30 seconds to over 8 hours. Across all models tested, two clear patterns emerged: - Task length is highly correlated with agent success rate (R² = 0.83) - The length of tasks that agents succeed at 50% of the time – the time horizon – is growing exponentially What comes next? This exponential trend seems robust, and there's no evidence of plateauing. Exponentials grow fast. Extrapolating out, this trend predicts: - 2027: 1 work day (8 hours) - 2028: 1 work week (40 hours) - 2029: 1 work month (167 hours) Recently, the trend has accelerated. In 2024-2025, time horizons doubled every 4 months, down from every 7 months over 2019-2025. If the faster trend continues, agents might reach month-long tasks in 2027. However, looking at just one year's data gives a less robust estimate. The rate of progress might slow down. It might also speed up. Given that the trend has already sped up, it could be on a growth trajectory that's faster than exponential. This fits intuitively: there might be a bigger gap in required skills between 1 and 2 week tasks, than 1 and 2 year tasks. Additionally, as AIs improve they'll be increasingly useful for developing yet more capable AIs. This could also lead to superexponential growth in AIs' time horizons. Increasingly capable AI systems could trigger a flywheel of acceleration – agents speeding up the creation of more capable agents, which speed up the creation of more capable agents. From here, agent capabilities might skyrocket beyond any human's abilities in AI research – and across many or all other domains. The effects would be transformative. If automating AI research leads to progress this fast, the rapidly increasing time horizon of AI systems might end up being one of the most important trends in human history.

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

공유

관련 저널 읽기

전체 보기 →