Amazing stuff!
"Cursor tested hundreds of concurrent AI agents working on complex software projects for weeks at a time, generating over 1 million lines of code. The company found that a hierarchical structure with specialized planner and worker agents outperformed flat coordination models.
Planners continuously explore codebases and create tasks, while workers focus solely on completing assigned work without coordinating with each other.
The system built a web browser from scratch in one week with 1,000 files, migrated Cursor’s own codebase from Solid to React over three weeks with 266,000 additions and 193,000 deletions, and optimized video rendering code that shipped to production.
GPT-5.2 models proved more effective than GPT-5.1-Codex for extended autonomous work, maintaining focus and avoiding drift better than Opus 4.5, which tends to take shortcuts.
The company says prompt engineering matters more than infrastructure, and the optimal coordination structure falls between completely flat and rigidly hierarchical systems. (Cursor)"
No comments:
Post a Comment