Conductor Podcast: AI Orchestration and Human‑In‑The‑Loop

 16 min video

 2 min read

YouTube video ID: fQmlML9Lay4

Source: YouTube video by Y CombinatorWatch original video

PDF

Conductor treats coding agents like a musical ensemble, letting CEOs of projects orchestrate AI‑generated work through prompts and digestible reports. Code has become “sawdust,” a byproduct rather than the primary focus, while the platform’s goal is to give users the feeling of running a company that directs agents. Human‑led design remains essential; AI should not dictate UI choices because the result can feel uncrafted. The software is envisioned as “malleable,” allowing users to mod workflows much like video‑game skins, keeping the core structure stable while customizing the experience.

Workflow & Tooling

Agents operate inside workspaces where each task produces a pull request (PR). The PR sits in a “in‑review” stage until a human reviews, comments, or merges it. If AI‑generated changes are insufficient, users switch to “caveman mode,” manually editing files before the agent resumes. Certain parts of the codebase are marked as “slot‑free zones,” mandating human oversight to prevent AI from developing harmful patterns. The app tracks progress through a state machine—“in‑progress,” “in‑review,” and “done”—and presents a dashboard view of all active agent work.

Technical Implementation

Conductor is built as a Tauri desktop app with a Rust backend and a TypeScript frontend, while authentication and web services run on Phoenix/Elixir. Engineering practices and constraints are encoded in “skills files” and a cloud.md document that guide the agents. The team runs local models such as Parakeet on high‑end hardware equipped with 128 GB RAM, using a “fast mode” to maximize token efficiency. Permissions are set to “dangerously accept all,” granting agents broad autonomy within the defined workflow.

Future Vision

The roadmap emphasizes software that is both malleable and decoupled from infrastructure. Users will be able to mod their development environment—changing prompt speeds, UI skins, or agent collaboration patterns—without altering the underlying codebase. Multi‑agent collaboration is expected to expand, allowing several specialized AIs to work together while the human remains the final arbiter of quality and direction.

  Takeaways

  • Conductor treats coding agents like a musical ensemble, letting CEOs of projects orchestrate AI‑generated work through prompts and digestible reports.
  • The platform enforces a human‑in‑the‑loop workflow by requiring every AI‑produced change to appear as a pull request that a person must review, comment on, or edit in “caveman mode.”
  • Its UI philosophy favors visual, spatial interfaces over terminal‑style interactions, arguing that AI‑chosen UI elements often feel uncrafted.
  • Built with Tauri (Rust backend, TypeScript frontend) and a Phoenix/Elixir web service, Conductor runs local models like Parakeet on high‑end hardware (128 GB RAM) and uses “fast mode” to keep token usage efficient.
  • Looking ahead, Conductor envisions software as a malleable, mod‑friendly layer where users can customize workflows like game skins while the underlying code and infrastructure stay decoupled.

Frequently Asked Questions

What is “caveman mode” in Conductor’s workflow?

Caveman mode allows users to bypass AI‑generated changes and edit files manually. When an agent’s suggestion is insufficient, the developer opens the affected files, makes direct edits, and then saves the changes, after which the agent can continue its work. This fallback keeps human control when AI output falls short.

How does Conductor ensure AI cannot commit directly to the main branch?

Conductor enforces a human‑AI boundary by routing every AI task through isolated workspaces that generate pull requests. The system’s state machine moves each task from “in‑progress” to “in‑review” and finally “done,” but the PR must be reviewed and merged by a human before any code reaches the main branch, preventing autonomous commits.

Who is Y Combinator on YouTube?

Y Combinator is a YouTube channel that publishes videos on a range of topics. Browse more summaries from this channel below.

Does this page include the full transcript of the video?

Yes, the full transcript for this video is available on this page. Click 'Show transcript' in the sidebar to read it.

Helpful resources related to this video

If you want to practice or explore the concepts discussed in the video, these commonly used tools may help.

Links may be affiliate links. We only include resources that are genuinely relevant to the topic.

PDF