AI를 사용하여 레거시 메인프레임 헤어볼을 현대화하는 것이 생각보다 어려운 이유

hackernews | 2026년 3월 23일 19:13 | 🔬 연구

#ai #genai #review #레거시 #메인프레임 #코드현대화

원문 출처: hackernews · Genesis Park에서 요약 및 분석

요약

AI를 활용한 레거시 코드 현대화는 기술적 한계로 인해 생각보다 어렵습니다. 기존 도구들은 대규모 코드베이스 처리 시 문맥 유지 실패, 비즈니스 로직의 의미 이해 부족, 그리고 모놀리식 구조를 클라우드 네이티브로 재설계하는 데 있어 어려움을 겪습니다. 반면 Sage AI Technologies는 전체 코드를 수집해 시스템 수준의 아키텍처 모델을 생성하고 이를 바탕으로 최적화된 LLM 에이전트를 훈련시켜, 규모 있는 현대화 작업에서도 정확도를 유지한다고 주장합니다.

본문

Why using AI to modernize legacy mainframe hairballs is harder than you think Jason Bloomberg, Managing Director, Intellyx BV Generative AI (genAI) is a legacy code modernization magic wand. Put that mainframe hairball in one end and out pops well-architected modern code out of the other, ready for deployment in the cloud of your choice, right? Not so fast. It’s true that there are now several genAI-based legacy code modernization tools on the market that can do a reasonable job of translating, say, COBOL or PL/I code into Java or Python, etc. As a rule, however, these tools cannot scale. Feed them large codebases – the ‘hairballs’ that most mainframe-dependent organizations struggle with on a daily basis – and they fall over. Sage AI Technologies, however, is the exception. Why does genAI fall short when modernizing such codebases? And how does Sage solve this hairball of a problem? Challenge #1: Maintaining context at scale Every genAI large language model (LLM) has a limitation on how much information it can accept in a prompt. This context window constrains the level of understanding any modernization tool can bring to a legacy codebase. Furthermore, the more code fed into the tool, the more tokens it consumes, which means the more expensive it becomes to run. The result: more money for poorer results. Many legacy mainframe codebases run at millions of lines of code across thousands of programs, as well as numerous copybooks, JCL jobs, and data definitions. As a result, most AI-based modernization tools have a global context problem that limits whole-system understanding of legacy codebases. In addition, legacy mainframe code written in languages like COBOL, PL/I, and REXX typically ties business logic to core operations. GenAI struggles to interpret such logic especially at scale, as it often misses critical edge cases or other functionality. LLMs may do a good job of understanding code locally at the program or file level but fail at making sense of code at the global, end-to-end business flow level. Challenge #2: Understanding the semantics of business logic LLMs generally understand syntax patterns, giving modernization tools at least the appearance of code understanding. Where genAI struggles, however, is understanding the semantics of mainframe logic. After all, the original programmers of mainframe applications had no idea that decades later, AI would need to make sense of their code. They had no qualms, therefore, about incorporating numeric flags, implicit sequencing assumptions, data-driven control flow, or ‘magic’ values like hardcoded maximum values or error codes – implicit semantic signifiers that present booby traps for any LLM. Sometimes these semantic signifiers represent logical edge cases – edge cases that the AI may miss entirely. In other situations, the modernization tool may generate financial logic errors: errors that mainframe-centric organizations simply cannot tolerate. Without an understanding of such code modernization traps, AI-based modernization tools struggle to uncover the business intent in the code they are modernizing. They end up generating results that may look correct but include subtle logic errors that testers find difficult to uncover. Challenge #3: Modernizing architecture vs. modernizing code Many legacy modernization tools purport to offer some level of architectural modernization. For example, it’s common for such tools to be able to identify subroutines in the mainframe code and translate them into shared objects or microservices in Java. Modernizing legacy procedural hairballs into efficient cloud native applications, however, is far more difficult. Simply creating microservices isn’t enough: a modernization tool must also understand the bounded contexts that properly separate microservice functionality so that it can offer the scale and flexibility that cloud native architectures promise. Instead, legacy modernization tools are more likely to preserve monolithic code structure by wrapping legacy logic, rather than rearchitecting it to follow modern architectural best practices. A better approach to AI-based modernization: Sage AI Technologies The modernization tooling from Sage represents a next-generation approach to applying genAI to the challenge of legacy codebase modernization at scale. Rather than limiting input code to a particular context window, Sage ingests the entire codebase. As a result, it is able to produce high-level architectural models and insights that give engineers a system-wide map of components, data flows, and dependencies. In other words, Sage anchors AI outputs in the structure of the entire application, rather than separate parts of it, thus maintaining context at scale. Sage then leverages the architectural models it produces plus auto-generated documentation to train domain-aware LLM agents, essentially tailoring them for each specific codebase. As a result, Sage’s reasoning leverages context that it grounds in each particula

원문 보기 (hackernews)

Genesis Park 편집팀이 AI를 활용하여 작성한 분석입니다. 원문은 출처 링크를 통해 확인할 수 있습니다.

요약

본문

관련 저널 읽기