Papers on Agent Architectures
A categorized index of 227 research papers on agent architectures, execution loops, multi-agent coordination, communication protocols, and deployment patterns. Last updated March 2026.
Agent Architecture Surveys
17 papersUnified taxonomy: policy core, memory, planners, tool routers, critics. "Agent transformer" abstraction.
Taxonomy: Perception, Brain, Planning, Action, Tool Use, Collaboration.
Dual paradigm: Symbolic/Classical vs Neural/Generative. PRISMA 90 studies.
Single vs multi-agent. Vertical vs horizontal. Planning/execution/reflection.
Protocol-focused: MCP, A2A, ACP, ANP, Agora.
Agent Execution Loops & Reasoning
19 papersThe canonical pattern. 34% improvement on ALFWorld.
Self-reflection as verbal reinforcement.
Tree-based search over reasoning paths.
MCTS for LLM agents.
Iterative self-improvement within single generation.
Multi-Agent Coordination & Orchestration
18 papersUnified framework: planning, policy, state, quality. MCP + A2A.
Centralized puppeteer orchestrator. Dynamic agent selection.
Quantitative scaling. Independent/decentralized/centralized/hybrid compared.
Topology selection as function of task dependency. References Claude Code Agent Teams.
Central planning agent + specialized sub-agents.
Agent Communication Protocols
7 papersAgent-to-tool standard. Client-host-server. JSON-RPC 2.0.
Peer coordination, negotiation, delegation. Agent cards.
Classification of MCP, A2A, ACP, ANP.
MCP vs ACP vs A2A vs ANP.
Meta-coordination layer. Protocol Documents for protocol selection.
Self-Improving & Self-Evolving Agents
14 papersOffline Self-Distillation -> Online Interaction.
Sequential Rollout. Skill-integrated Reward. 8.9% improvement.
Extrinsic vs intrinsic metacognition.
Four principles: exploration, memory, skill transfer, planning.
Comprehensive taxonomy. Intra vs inter-test-time learning.
Agent Memory Systems
21 papersFour networks, three operations. 91.4% on LongMemEval.
Semantic, temporal, causal, entity graphs. Policy-guided traversal.
Shared vs distributed. Three-layer hierarchy. Two protocol gaps.
Comprehensive taxonomy. Evaluation limitations.
Critic Router. Reward Prediction Error gating. SKIP/CONSTRUCT/EVOLVE.
Agent Tool Use
30 papersComprehensive tool-augmented LLM survey.
Large-scale tool use benchmark.
Updated tool learning survey.
MCP-based tool use benchmark.
AST-analysis for function calling validation.
Agent Evaluation & Benchmarks
24 papersTaxonomy: what to evaluate + how to evaluate.
Comprehensive evaluation methods survey.
Four pillars: LLM, Memory, Tools, Environment.
Enterprise-focused. Orchestration + memory + tool interaction.
Software engineering benchmark.
Agent Safety & Security
27 papers110 harmful behaviors, 11 categories, 440 tasks.
349 environments, 2000 test cases, 8 risk categories. None >60% safe.
Five-generation taxonomy.
Memory/knowledge base poisoning attacks.
Blockchain for coordination trust.
Enterprise & Production
14 papersFour-level maturity: Prompt -> Context -> Intent -> Specification.
End-to-end guide. MCP, orchestration, observability.
TDD/BDD-inspired continuous evaluation.
Enterprise multi-agent coordination switchboard.
Cross-organizational governance.
Domain-Specific Agents
22 papersFive pillars: planning, tools, memory, collaboration, evolution.
End-to-end scientific research agent.
Chemistry agent with real lab integration.
SE-specific agent design.
Financial services agent crews.
Context Management & Compaction
14 papers54% context reduction without quality loss.
Condenses reasoning steps into compact form.
Prompt compression preserving semantics.
36,000+ messages from real agentic sessions.
Four-level maturity model. Context as engineered system.
Know a paper, tool, or repo that should be listed here? We want this index to be exhaustive.