OpenAI spills technical details about how its AI coding agent works

A Glimpse into the Inner Workings of Codex: OpenAI's AI Coding Agent

On Friday, OpenAI engineer Michael Bolin published a detailed technical breakdown of how the company's Codex CLI coding agent works internally, offering developers insight into AI coding tools that can write code, run tests, and fix bugs with human supervision. This technical post complements our previous article on how AI agents work, filling in the technical details on how OpenAI implements its "agentic loop."

The Rise of AI Coding Agents

AI coding agents are having a "ChatGPT moment," where Claude Code with Opus 4.5 and Codex with GPT-5.2 have reached a new level of usefulness for rapidly coding up prototypes, interfaces, and churning out boilerplate code. The timing of OpenAI's post details the design philosophy behind Codex just as AI agents are becoming more practical tools for everyday work.

The Controversy Surrounding AI Coding Agents

These tools aren't perfect and remain controversial for some software developers. While OpenAI has previously told Ars Technica that it uses Codex as a coding tool to help develop the Codex product itself, we also discovered, through hands-on experience, that these tools can be astonishingly fast at simple tasks but remain brittle beyond their training data and require human oversight for production work. The rough framework of a project tends to come fast and feels magical, but filling in the details involves tedious debugging and workarounds for limitations the agent cannot overcome on its own.

The Technical Challenges of Codex

Bolin's post doesn't shy away from these engineering challenges. He discusses the inefficiency of quadratic prompt growth, performance issues caused by cache misses, and bugs the team discovered (like MCP tools being enumerated inconsistently) that they had to fix. The level of technical detail is somewhat unusual for OpenAI, which has not published similar breakdowns of how other products like ChatGPT work internally.

The Agent Loop: A Technical Breakdown

Bolin's post focuses on what he calls "the agent loop," which is the core logic that orchestrates interactions between the user, the AI model, and the software tools the model invokes to perform coding work. At the center of every AI agent is a repeating cycle. The agent takes input from the user and prepares a textual prompt for the model. The model then generates a response, which either produces a final answer for the user or requests a tool call (such as running a shell command or reading a file). If the model requests a tool call, the agent executes it, appends the output to the original prompt, and queries the model again. This process repeats until the model stops requesting tools and instead produces an assistant message for the user.

Constructing the Initial Prompt

The looping process has to start somewhere, and Bolin's post reveals how Codex constructs the initial prompt sent to OpenAI's Responses API, which handles model inference. The prompt is built from several components, each with an assigned role that determines its priority: system, developer, user, or assistant. The instructions field comes from either a user-specified configuration file or base instructions bundled with the CLI. The tools field defines what functions the model can call, including shell commands, planning tools, web search capabilities, and any custom tools provided through Model Context Protocol (MCP) servers. The input field contains a series of items that describe the sandbox permissions, optional developer instructions, environment context like the current working directory, and finally the user's actual message.

The Quadratic Growth of Prompts

As conversations continue, each new turn includes the complete history of previous messages and tool calls. This means the prompt grows with every interaction, which has performance implications. According to the post, because Codex does not use an optional "previous_response_id" parameter that would allow the API to reference stored conversation state, every request is fully stateless (that is, it sends the entire conversation history with each API call rather than the server retrieving it from memory). Bolin says this design choice simplifies things for API providers and makes it easier to support customers who opt into "Zero Data Retention," where OpenAI does not store user data.

The Inefficiency of Quadratic Prompt Growth

The quadratic growth of prompts over a conversation is inefficient, but Bolin explains that prompt caching mitigates this issue somewhat. Cache hits only work for exact prefix matches within a prompt, which means Codex must carefully avoid operations that could cause cache misses. Changing the available tools, switching models, or modifying the sandbox configuration mid-conversation can all invalidate the cache and hurt performance.

Compacting Conversations

The ever-growing prompt length is directly related to the context window, which limits how much text the AI model can process in a single inference call. Bolin writes that Codex automatically compacts conversations when token counts exceed a threshold, just as Claude Code does. Earlier versions of Codex required manual compaction via a slash command, but the current system uses a specialized API endpoint that compresses context while preserving summarized portions of the model's "understanding" of what happened through an encrypted content item.

Future Developments

Bolin says that future posts in his series will cover the CLI's architecture, tool implementation details, and Codex's sandboxing model. This technical post provides a valuable insight into the inner workings of Codex, highlighting the challenges and complexities of building an AI coding agent. As AI agents continue to evolve and improve, it will be interesting to see how OpenAI and other companies address these technical challenges and push the boundaries of what is possible with AI-powered coding tools.

Conclusion

The technical post by Michael Bolin provides a comprehensive overview of how Codex works, highlighting the agent loop, prompt construction, and the challenges of quadratic prompt growth. This post is a valuable resource for developers and researchers interested in AI-powered coding tools, and it provides a unique insight into the inner workings of Codex. As AI agents continue to evolve and improve, it will be interesting to see how OpenAI and other companies address these technical challenges and push the boundaries of what is possible with AI-powered coding tools.

Source: https://arstechnica.com/ai/2026/01/openai-spills-technical-details-about-how-its-ai-coding-agent-works/

OpenAI spills technical details about how its AI coding agent works

OpenAI spills technical details about how its AI coding agent works

A Glimpse into the Inner Workings of Codex: OpenAI's AI Coding Agent

The Rise of AI Coding Agents

The Controversy Surrounding AI Coding Agents

The Technical Challenges of Codex

The Agent Loop: A Technical Breakdown

Constructing the Initial Prompt

The Quadratic Growth of Prompts

The Inefficiency of Quadratic Prompt Growth

Compacting Conversations

Future Developments

Conclusion

About the Author

Share this article

Related Posts

The latest AI news we announced in May 2026

The Download: AI hacking beyond Mythos, and chatbots' impact on our brains

The Meta hack shows there’s more to AI security than Mythos