Sources

References

156 references cited. Research, benchmarks, and real production data.

  1. [1] GitHub. "Research: Quantifying GitHub Copilot's impact on developer productivity." GitHub Blog. 2022-2023. github.blog →
  2. [2] Software.com. "Developer time allocation: median developer codes 52 minutes per day." Referenced in Dubach, P. philippdubach.com, March 2026. philippdubach.com →
  3. [3] Becker, J. et al. "Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity." METR, July 2025. metr.org →
  4. [4] METR. "We are Changing our Developer Productivity Experiment Design." Feb 2026. metr.org →
  5. [5] Cui, Z. et al. "The Effects of Generative AI on High-Skilled Work: Evidence from Three Field Experiments with Software Developers." MIT, Princeton, Wharton, Microsoft. Accepted in Management Science 2026. www.microsoft.com →
  6. [6] Multiple sources. Claude Code ARR estimates, Codex WAU. Idlen March 2026, Codegen Blog March 2026.
  7. [7] Cognition Labs. Devin $4B valuation. Goldman Sachs pilot. Digital Applied Dec 2025; IBM Think Nov 2025.
  8. [8] xAI. "Grok Build." Jan 2026. grokai.build →
  9. [9] AdwaitX. "Grok Build: 8 Parallel AI Agents." March 2026. www.adwaitx.com →
  10. [10] xAI. "Grok Code Fast 1." Aug 2025. x.ai →
  11. [11] bradAGI. "Awesome CLI Coding Agents." GitHub. March 2026. github.com →
  12. [12] AI Engineering (Medium). "Pi-mono: The Minimalist AI Coding Assistant Behind OpenClaw." Feb 2026. ai-engineering-trend.medium.com →
  13. [13] Replit. "2025: Replit in Review." Jan 2026. blog.replit.com →
  14. [14] Replit. "Introducing Replit Agent 4." March 2026. $400M raise, $9B valuation, $1B ARR target. blog.replit.com →
  15. [15] Nocode.mba. "Bolt vs Lovable 2026." www.nocode.mba →
  16. [16] Banani. "Base44 vs Bolt vs Lovable." March 2026. www.banani.co →
  17. [17] DigitalOcean. "What is OpenClaw?" 60K+ stars in 72hrs. www.digitalocean.com →
  18. [18] Emanuel, J. "The Agentic Coding Flywheel: Complete Guide." agent-flywheel.com →
  19. [19] Faros AI. "DORA Report 2025 Key Takeaways." July 2025. 10,000+ developers, 1,255 teams. www.faros.ai →
  20. [20] Bain & Company. "2025 Technology Report." Coding = 25-35% of idea-to-launch time. www.bain.com →
  21. [21] Factory AI. $50M Series B. factory.ai →
  22. [22] NEA. "Factory: The Platform for Agent-Native Development." Sep 2025. www.nea.com →
  23. [23] ClickUp. "ClickUp + Codegen." Dec 2025. clickup.com →
  24. [24] Qodo. "Best AI Code Review Tools 2026." Feb 2026. www.qodo.ai →
  25. [25] Aikido. "Top 7 CodeRabbit Alternatives." Dec 2025. www.aikido.dev →
  26. [26] IBM Think. "Meet Devin the AI Software Engineer." Nov 2025. Goldman Sachs pilot. www.ibm.com →
  27. [27] StrongDM. "The StrongDM Software Factory." Feb 2026. factory.strongdm.ai →
  28. [28] Willison, S. "How StrongDM's AI team build serious software." Feb 2026. simonwillison.net →
  29. [29] Stanford Law CodeX. "Built by Agents, Tested by Agents, Trusted by Whom?" Feb 2026. law.stanford.edu →
  30. [30] Stack Overflow. "2025 Developer Survey." AI adoption and trust data. survey.stackoverflow.co →
  31. [31] Google DORA. "Accelerate State of DevOps 2024." 39,000+ professionals. dora.dev →
  32. [32] Google DORA. "State of AI-assisted Software Development 2025." Sep 2025. dora.dev →
  33. [33] Trickle. "Devin AI Review." 2025. Nubank case study. trickle.so →
  34. [34] Codegen Blog. "Best AI Coding Agents in 2026." March 2026. codegen.com →
  35. [35] Tan, G. "Trust Levels in AI." X/Twitter, March 17, 2026.
  36. [36] Shapiro, D. Five-level taxonomy. Jan 2026. Referenced in AIResourcePro Feb 2026.
  37. [37] InfoQ. "xAI Releases Grok Code Fast 1." Sep 2025. www.infoq.com →
  38. [38] Dubach, P. "93% of Developers Use AI Coding Tools. Productivity Hasn't Moved." March 2026. philippdubach.com →
  39. [39] Denicola, D. "My Participation in the METR Study." July 2025. domenic.me →
  40. [40] The Pragmatic CTO. "The Software Factory." Feb 2026. www.thepragmaticcto.com →
  41. [41] Ry Walker. "Grok Build." March 2026. rywalker.com →
  42. [42] Augment Code. "Cursor vs Windsurf." www.augmentcode.com →
  43. [43] CodeRabbit. "AI dev tool tech stack." 2025. www.coderabbit.ai →
  44. [44] Calmops. "AI Coding Agents and Devin 2026." March 2026. calmops.com →
  45. [45] Nader, N. "How to Build a Custom Agent Framework with PI." Feb 2026. nader.substack.com →
  46. [46] 36Kr. "Security Company Stops Human Code Interaction." Feb 2026. StrongDM coverage. eu.36kr.com →
  47. [47] Addyo. "The reality of AI-Assisted software engineering productivity." Aug 2025. addyo.substack.com →
  48. [48] InfoQ. "Kimi K2.5 Agent Swarm." Feb 2026. www.infoq.com →
  49. [49] VentureBeat. "Moonshot AI Kimi K2.5." Jan 2026. venturebeat.com →
  50. [50] Moonshot AI. "Kimi K2.5." Jan 2026. Pricing and technical specifications. www.kimi.com →
  51. [51] DeepSeek. "DeepSeek-V3.2." Hugging Face model card. huggingface.co →
  52. [52] DeepSeek. "V3.2 Release Notes." Dec 2025. api-docs.deepseek.com →
  53. [53] Raschka, S. "Technical Tour of DeepSeek V3 to V3.2." Dec 2025. magazine.sebastianraschka.com →
  54. [54] MiniMax. "M2.5: Built for Real-World Productivity." March 2026. www.minimax.io →
  55. [55] MiniMax. "Forge: Scalable Agent RL Framework and Algorithm." March 2026. www.minimax.io →
  56. [56] MiniMax. "M2.1: Multilingual and Multi-Task Coding with Strong General Capabilities." March 2026. www.minimax.io →
  57. [57] MiniMax. "MiniMax-M1." Arxiv, June 2025. arxiv.org →
  58. [58] VentureBeat. "MiniMax-M1 is a new open-source model with 1 million token context." Dec 2025. Training cost: $534,700. venturebeat.com →
  59. [59] Codecademy. "Kimi K2.5 Guide." 2026. www.codecademy.com →
  60. [60] WaveSpeedAI. "Kimi K2.5." Feb 2026. wavespeed.ai →
  61. [61] DeepSeek. "V3.2 Technical Paper." arxiv.org →
  62. [62] InfoQ. "xAI Grok Code Fast 1." Sep 2025. www.infoq.com →
  63. [63] LangChain. "Open SWE." GitHub, March 2026. Architectural patterns from Stripe, Ramp, Coinbase. github.com →
  64. [64] Awesome Agents. "Stripe Minions: 1,300 PRs/Week." Feb 2026. awesomeagents.ai →
  65. [65] Anup.io. "Stripe's coding agents: the walls matter more than the model." Feb 2026. www.anup.io →
  66. [66] Coinbase. "Building enterprise AI agents at Coinbase." 2026. www.coinbase.com →
  67. [67] Finextra. "Stripe puts AI coding minions to work." Feb 2026. www.finextra.com →
  68. [68] Modal. "How Ramp built a full context background coding agent on Modal." 2026. modal.com →
  69. [69] InfoQ. "Ramp Internal Coding Agent Platform." Jan 2026. www.infoq.com →
  70. [70] Coinbase. "Making Smarter Decisions, Faster with AI at Coinbase." 2026. www.coinbase.com →
  71. [71] Coinbase. "Improving Product Quality at Coinbase with AI agents." 2026. www.coinbase.com →
  72. [72] CoinDesk. "AI agents and stablecoins." March 2026. x402 protocol. www.coindesk.com →
  73. [73] Stripe Dot Dev Blog. "Minions: Stripe's one-shot end-to-end coding agents." Jan-Feb 2026. stripe.dev →
  74. [74] Ramp Builders. "Why We Built Our Own Background Agent." 2026. builders.ramp.com →
  75. [75] Ry Walker. "Ramp Inspect." Feb 2026. rywalker.com →
  76. [76] Ry Walker. "Background Agents (Open-Inspect)." Feb 2026. rywalker.com →
  77. [77] AIbase. "ByteDance's Trae AI Programming Tool." 2025. www.aibase.com →
  78. [78] AIbase. "Trae MAUs Exceed 1.6 Million." 2025 annual report. news.aibase.com →
  79. [79] AWS. Kiro IDE. Spec-driven development with Claude Sonnet. Autonomous agent announced re:Invent Dec 2025. kiro.dev →
  80. [80] Kilo Code. 1.5M+ users, 25T+ tokens. GitLab co-founder Sid Sijbrandij. $8M seed. github.com →
  81. [81] Zed Industries. Zed AI editor. ACP co-developed with JetBrains Jan 2026. $32M Sequoia. zed.dev →
  82. [82] Google. "Antigravity." Nov 2025. $2.4B Windsurf team acqui-hire. Manager View, Artifacts system. codeconductor.ai →
  83. [83] Double.bot. YC W23. VS Code extension. docs.double.bot →
  84. [84] GitHub. "Quantifying GitHub Copilot's Impact in the Enterprise with Accenture." +8.7% PRs, +84% successful builds. github.blog →
  85. [85] Microsoft. "New Future of Work Report 2024." Dec 2025. Developers spend 14% of time writing new code. www.microsoft.com →
  86. [86] Sweep AI. JetBrains-focused AI assistant. 40K+ installs. sweep.dev →
  87. [87] Blackbox AI. Claims 30M+ developers. 300+ models. From ~$3/month. www.blackbox.ai →
  88. [88] Cui, Z. et al. "The Effects of Generative AI on High-Skilled Work." SSRN. MS/MIT/Accenture. 4,867 developers. papers.ssrn.com →
  89. [89] Pieces for Developers. Long-term memory across IDEs. MCP integration. www.producthunt.com →
  90. [90] Anthropic. AI coding assistants and skill formation. RCT: 52 engineers, 17% lower quiz scores. www.sovereignmagazine.com →
  91. [91] NAV IT (Norway). 26,317 commits over 2 years. No significant productivity change. Selection effect. arxiv.org →
  92. [92] Humlum, A. & Vestergaard, E. "Large Language Models, Small Labor Market Effects." University of Chicago / NBER. 2025. www.nber.org →
  93. [93] Google. Jules autonomous coding agent. GA August 2025. 140,000+ code improvements during beta. CLI Oct 2025. Free to $124.99/month. blog.google →
  94. [94] Sourcegraph. "Amp." Replaced Cody for individual users. No token constraints. Credit-based. sourcegraph.com →
  95. [95] Warp. "2025 in Review." 3.2B lines edited. 120K codebases indexed. TIME Best Invention 2025. www.warp.dev →
  96. [96] Block. "Codename Goose." 29,400+ GitHub stars. ~5,000 weekly users. block.xyz →
  97. [97] Linux Foundation. "Agentic AI Foundation (AAIF)." MCP, Goose, AGENTS.md. www.linuxfoundation.org →
  98. [98] Cosine. "Genie 2." 50+ languages including COBOL, Fortran. FINRA/HIPAA/ITAR. 72% SWE-Lancer. cosine.sh →
  99. [99] Poolside AI. ~$626M Series B. $1B Nvidia investment at $12B valuation. Jason Warner (ex-GitHub CTO). en.wikipedia.org →
  100. [100] Sacra. "Poolside valuation and analysis." 350M+ repos, 50T+ tokens, RLCEF training. sacra.com →
  101. [101] Magic AI. $515M raised. 100M-token context windows. Custom CUDA stack. magic.dev →
  102. [102] Communeify. "Magic 100M Context Window analysis." LTM-2-Mini. www.communeify.com →
  103. [103] NVIDIA. "NemoClaw for OpenClaw." GTC March 2026. OpenShell runtime, Nemotron models. Partners: Adobe, Salesforce, SAP, CrowdStrike, ServiceNow, Dell. nvidianews.nvidia.com →
  104. [104] CodeRabbit. "2025 was the year of AI speed. 2026 will be the year of AI quality." AI PRs: ~1.7x more issues. www.coderabbit.ai →
  105. [105] GitClear. Code quality metrics. ~10% more durable code but "sharp declines in several measures of code quality." Referenced in Dubach [38].
  106. [106] The Next Web. "Nvidia turns OpenClaw into enterprise platform with NemoClaw." March 2026. YAML policies, privacy router. thenextweb.com →
  107. [107] Winbuzzer. "Nvidia Launches NemoClaw." March 2026. Nemotron Coalition, OpenAI acqui-hire. winbuzzer.com →
  108. [108] Graphite. AI code review. <3% unhelpful rate. 55% code change rate. 24hrs to 90min merge. $40/user/month. dev.to →
  109. [109] Cursor BugBot. 2M+ PRs monthly reviewed. 8 parallel review passes. July 2025. Referenced in [108].
  110. [110] Snyk. AI Security Platform. 48% of AI code has vulnerabilities. Gartner Leader AST. Agent Fix, Evo, Studio. snyk.io →
  111. [111] Atlassian. "Rovo Dev." GA late 2025. 75% time savings on simpler tasks. Statsig: 45% PR cycle reduction. $20/user/month. www.atlassian.com →
  112. [112] GitLab Duo. Agent Platform GA January 2026. 64.5% SWE-bench Verified. Gartner Leader. cloudfresh.com →
  113. [113] Harness AI. DevOps agent. Claude Opus 4.5. United Airlines, Citibank. 75% faster releases. www.harness.io →
  114. [114] Moderne. OpenRewrite. 5,000+ deterministic refactoring recipes. JS/TS/Python expansion late 2025. sdtimes.com →
  115. [115] Trunk.io. CI reliability platform. Merge queues batching 100 PRs. AI flaky test management. trunk.io →
  116. [116] Diffblue Cover. RL-based autonomous JUnit test generation. 250x faster than manual. www.diffblue.com →
  117. [117] The Hans India. "AI Now Writes Most Code at Uber." March 2026. ~70% machine-generated, 95% monthly usage. www.thehansindia.com →
  118. [118] TMCnet. "How Uber Built AI Agents That Saved 21,000 Developer Hours." blog.tmcnet.com →
  119. [119] Google DeepMind. "AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms." May 2025. deepmind.google →
  120. [120] VentureBeat. "Meet AlphaEvolve." venturebeat.com →
  121. [121] Live-SWE-agent. University of Illinois, Nov 2025. Self-evolving coding agent. 45.8% SWE-bench Pro. live-swe-agent.github.io →
  122. [122] MCP. 97M+ monthly SDK downloads. 10,000+ servers. en.wikipedia.org →
  123. [123] Anthropic. "Donating MCP and establishing the Agentic AI Foundation." www.anthropic.com →
  124. [124] Google. "Agent2Agent Protocol." April 2025. 150+ organisations. platformengineering.com →
  125. [125] Linux Foundation. "A2A Protocol Project." www.linuxfoundation.org →
  126. [126] LangGraph. 34.5M monthly PyPI downloads. 600-800 companies in production. latenode.com →
  127. [127] CrewAI. 45,900+ GitHub stars. 100K+ certified developers. 1.4B automations. github.com →
  128. [128] Microsoft Agent Framework. AutoGen + Semantic Kernel convergence. Public preview Oct 2025. cloudsummit.eu →
  129. [129] OpenAI Agents SDK. Evolved from Swarm. March 2025. mem0.ai →
  130. [130] Amazon Bedrock AgentCore. GA mid-2025. 100K+ organisations on Bedrock. aws.amazon.com →
  131. [131] Google Vertex AI Agent Builder. ADK 7M+ downloads. cloud.google.com →
  132. [132] MetaGPT. 48K+ stars. Simulates software company. 85.9% HumanEval. MGX Feb 2025. github.com →
  133. [133] ChatDev. Tsinghua/OpenBMB. $0.30/project in ~7 minutes. www.ibm.com →
  134. [134] E2B. Firecracker microVMs. 150ms cold starts. 88% of Fortune 100. $21M Series A July 2025. github.com →
  135. [135] Composio. 1,000+ managed integrations. OAuth handling. $29M July 2025. docs.composio.dev →
  136. [136] IT Pro. "Google CEO: 25%+ of code AI-generated." www.itpro.com →
  137. [137] Shopify. CEO Tobi Lutke memo April 2025. "Reflexive AI usage is now a baseline expectation." AI in performance reviews. Referenced in multiple sources.
  138. [138] Microsoft. CoreAI Platform and Tools division January 2025. 20-30% AI-generated code. Referenced in multiple sources.
  139. [139] Netflix. Productivity Assistant team. 150K tickets/year. 416 hrs/employee recovered. Hiring AI PMs at $240K-$700K. newsletter.nextool.ai →
  140. [140] Meta. CodeCompose. 6.7B parameter model fine-tuned on internal code. 20%+ acceptance rate. Referenced in academic literature.
  141. [141] Anthropic. "Estimating AI productivity gains from Claude conversations." 100K conversations. ~80% time reduction per task. www.anthropic.com →
  142. [142] Anthropic. Internal study: 132 engineers, 59% of work with AI, +67% merged PRs/day. Referenced in [141]. www.anthropic.com →
  143. [143] Self-healing code vulnerability study. 37.6% increase in critical vulnerabilities after 5 iterations. www.module.today →
  144. [144] Stanford. Software developer employment aged 22-25 fell nearly 20% (2022-2025). Referenced in Dubach [38].
  145. [145] MIT Technology Review. "AI coding is everywhere. But not everyone is convinced." Dec 2025. www.technologyreview.com →
  146. [146] "Developer Productivity With and Without GitHub Copilot." Longitudinal study. Arxiv. arxiv.org →
  147. [147] SWE-bench Verified leaderboard. Feb 2026. 76.8% top score (Claude 4.5 Opus). simonwillison.net →
  148. [148] SWE-smith. Princeton. NeurIPS 2025 Spotlight. 50,000+ task instances, 128 repos. Open-source models: 40.2%. arxiv.org →
  149. [149] MIT CSAIL. "Can AI really code?" ICML 2025. SWE-bench Multilingual. news.mit.edu →
  150. [150] NVIDIA GTC 2026 blog. NemoClaw, DGX Spark. blogs.nvidia.com →
  151. [151] Stripe Dot Dev Blog. "Minions." Official technical blog post. stripe.dev →
  152. [152] Ramp Builders. "Why We Built Our Own Background Agent." builders.ramp.com →
  153. [153] Uber. "AI-Augmented Code Review at Scale." www.zenml.io →
  154. [154] Panto. "AI in Coding - Key Statistics 2026." www.getpanto.ai →
  155. [155] Fabro. "The Dark Software Factory." Qlty Software (Bryan Helmkamp / Code Climate founder). Open-source, MIT. Rust single binary. Graphviz DOT workflow graphs, CSS-like model stylesheets, cloud sandboxes, Git checkpointing, automatic retrospectives. fabro.sh →
  156. [156] Fabro GitHub repository. 37 stars, v0.174.0, 1,310 commits. 83.6% Rust, 12.6% TypeScript. github.com →