Akshay 🚀(@akshay_pachaar ):Claude Code's architecture, mapped. Calude Code is one of the most powerful agent harnessed out there, it's a lot more than "a CLI that calls claude." the actual system has six layers, and the model is just one node inside the loop. the diagram breaks down every component: 𝗜𝗻𝗽𝘂𝘁 𝗟𝗮𝘆𝗲𝗿 handles session management, permission gating, and YAML-based trust tiers before anything reaches the model. 𝗞𝗻𝗼𝘄𝗹𝗲𝗱𝗴𝗲 𝗟𝗮𝘆𝗲𝗿 holds the skill registry, context compressor (3-layer, 92% threshold), task graph, and cross-session memory store. this is where harness intelligence lives outside the weights. 𝗘𝘅𝗲𝗰𝘂𝘁𝗶𝗼𝗻 𝗟𝗮𝘆𝗲𝗿 runs tool dispatch through a typed registry with one handler per tool. bash, read, write, grep, glob, revert. streaming runtime handles parallel execution. prompt cache reuses stable prefixes at 10% cost. 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻 𝗟𝗮𝘆𝗲𝗿 connects the MCP runtime to external servers. filesystem, git, custom. tools register inward, memory writes outward to agent_memory. md. 𝗠𝘂𝗹𝘁𝗶-𝗔𝗴𝗲𝗻𝘁 𝗟𝗮𝘆𝗲𝗿 is the most underappreciated piece. subagent spawner, teammate mailboxes over redis pub/sub, FSM protocol (IDLE→REQUEST→WAIT→RESPOND), autonomous board with atomic locks, and worktree isolation with per-task branches and conflict detection on merge. 𝗢𝗯𝘀𝗲𝗿𝘃𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗟𝗮𝘆𝗲𝗿 wraps everything. event bus with lifecycle hooks, background executor running daemon threads non-blocking. the master agent loop sits at the center. perception → action → observation. it's deliberately simple. a "dumb loop" where the model reasons and the harness mediates. this is the architecture behind what feels like magic when you use claude code. it's not magic. it's harness engineering. the article below is a deep-dive covering how Anthropic, OpenAI, LangChain, and others build this pattern from the ground up.

2026.05.10 14:22

Claude Code's architecture, mapped. Calude Code is one of the most powerful agent harnessed out there, it's a lot more than "a CLI that calls claude." the actual system has six layers, and the model is just one node inside the loop. the diagram breaks down every component: 𝗜𝗻𝗽𝘂𝘁 𝗟𝗮𝘆𝗲𝗿 handles session management, permission gating, and YAML-based trust tiers before anything reaches the model. 𝗞𝗻𝗼𝘄𝗹𝗲𝗱𝗴𝗲 𝗟𝗮𝘆𝗲𝗿 holds the skill registry, context compressor (3-layer, 92% threshold), task graph, and cross-session memory store. this is where harness intelligence lives outside the weights. 𝗘𝘅𝗲𝗰𝘂𝘁𝗶𝗼𝗻 𝗟𝗮𝘆𝗲𝗿 runs tool dispatch through a typed registry with one handler per tool. bash, read, write, grep, glob, revert. streaming runtime handles parallel execution. prompt cache reuses stable prefixes at 10% cost. 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻 𝗟𝗮𝘆𝗲𝗿 connects the MCP runtime to external servers. filesystem, git, custom. tools register inward, memory writes outward to agent_memory. md. 𝗠𝘂𝗹𝘁𝗶-𝗔𝗴𝗲𝗻𝘁 𝗟𝗮𝘆𝗲𝗿 is the most underappreciated piece. subagent spawner, teammate mailboxes over redis pub/sub, FSM protocol (IDLE→REQUEST→WAIT→RESPOND), autonomous board with atomic locks, and worktree isolation with per-task branches and conflict detection on merge. 𝗢𝗯𝘀𝗲𝗿𝘃𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗟𝗮𝘆𝗲𝗿 wraps everything. event bus with lifecycle hooks, background executor running daemon threads non-blocking. the master agent loop sits at the center. perception → action → observation. it's deliberately simple. a "dumb loop" where the model reasons and the harness mediates. this is the architecture behind what feels like magic when you use claude code. it's not magic. it's harness engineering. the article below is a deep-dive covering how Anthropic, OpenAI, LangChain, and others build this pattern from the ground up.

Akshay 🚀@akshay_pachaar

2026.04.06 13:31

705

151

Forward to community