Ask HN: 현재 진행 중인 AI 연구가 대규모 언어 모델(LLM)의 성능을 향상시키고 있나요?

hackernews | | 🔬 연구
#2026 트렌드 #ai #ai 에이전트 #기타 ai #영상보안 #취약점/보안 #하이브리드 #한화비전 #anthropic #chatgpt #claude #openai #머신러닝/연구
원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

오푸스나 코덱 같은 모델과 중국 및 오픈 소스 모델을 비교해 보면, 추론 능력은 큰 차이가 없고 주로 코딩 부분에서 차이가 나는 것으로 느껴진다고 한 취미 전문가가 밝혔습니다.

본문

I&#x27;m just a hobbyist that has ran LLM models locally and follow a lot of content about it. Hope we have a few AI researchers here on HN to clarify this.<p>When using Opus or Codex vs. a chinese or Open source model, it feels like its reasoning capabilities are basically the same.</p><p>The difference is typically in coding. It looks like OpenAI and Anthropic invest a lot in pre-training (paying Mercor and the like).</p><p>Also a lot in creating synthetic data, I believe this has bigger AI research involvement and techniques.</p><p>This ends up creating the perspective that it is smart, after all, it has been trained with what you want to do, so it can do that for you.</p><p>But overall, is there really much AI research being done on those companies, or are the AI researchers mostly fine-tuning small aspects of the model, akin to what Google engineers used to do for Google search?</p><p>Of course, there&#x27;s the RLHF loop that developers using Anthropic&#x2F;OpenAI products as well, which provides probably very good data.</p><p>I ask this because this all looks like somebody with money could throw money at the problem and end up with a better model at the end, provided they do what I outlined above better -- with AI research being really not that important.</p><p>It still often feels like talking with ChatGPT 4 with just better data.</p><p>Even the big upgrade of Claude Code being able to work autonomously looks to be mainly due to it knowing how to grab context and do tool calls (not saying that this is easy), rather than the model&#x27;s raw performance being better.</p><p>Or am I wrong, is there something extremely good on those models that AI researchers discovered that the others don&#x27;t have?</p>

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

공유

관련 저널 읽기

전체 보기 →