Cole Murray: Stop 'Overindexing' on Basic AI Agent 'Computer Use'
Founders, your AI agents aren't just clicking buttons. Cole Murray reveals why true AI testing demands complex orchestration and deep codebase problem-solving.
40 hours of podcasts, in 5 minutes.
This episode delves into the rapid evolution of AI coding agents, highlighted by Devin's increasing autonomy and efficiency. Guests Walden Yan and Cole Murray discuss the architectural decisions for these 'background agents,' challenges in setting up developer environments, and the shift from simple 'computer use' to complex 'testing' for AI. They also explore the ongoing debate between single and multi-agent systems, the importance of memory management, and diverse real-world use cases.
Founders, your AI agents aren't just clicking buttons. Cole Murray reveals why true AI testing demands complex orchestration and deep codebase problem-solving.
AI agents are delivering $1K-$5K ROI per engineer today. Learn how they're tackling auto-triage, security, and even letting PMs prompt code via Slack.
Building AI agents? Walden Yan and Cole Murray explain the 'Harness In-Box vs. Out-of-Box' architecture. Learn why separating the agent's 'brain' from its 'hands' is critical for security and how Devin uses this approach.
AI agents struggle with 'repo setup.' Walden Yan and Cole Murray reveal that a clean human dev environment directly improves agent performance. Here's how to fix it.