ClaudeClip

Video editing where AI & you are co-directors

Built at Build India: Anthropic x Replit x Lightspeed Hackathon

🥈 2nd Place

Created on 15th February 2026

•

ClaudeClip

Video editing where AI & you are co-directors

The problem ClaudeClip solves

ClaudeClip

A video editor in the 'spirit of ClaudeCode' where AI and humans are both first-class citizens.

The Problem It Solves

Claude has taken the programming world by storm — not just because of model capabilities, but because of its human-AI collaboration paradigm. One underexplored frontier here has been video editing.

The Landscape Today

Approach	Strengths	Weakness
Traditional editors (Premiere, DaVinci)	Maximum control	Famously tedious, human-only
AI video generation (Sora, Kling)	Great for individual clips	Not controllable or composable
Agentic editors (existing tools)	AI-assisted workflows	Locked into built-in AI subscriptions; if the app fails to keep up with the latest AI developments, you're stuck

ClaudeClip gives you full human control and powerful AI orchestration, with the freedom to bring your own models.

UI — A well-crafted, "normal" video editor you'd actually want to use
MCP-native orchestration layer — A meticulously engineered tool surface that Claude (or any AI) can command. Endless possibilities for custom workflows using Agent Skills.
Shared state file — Both sides stay synced through a single, semaphore-like project state

We're suited for one-shotting quick clips — tech demos, vlogs, explainer videos — but with human intervention, the possibilities are endless.

How It Works

Start with a folder of videos you want to compose together. If you don't have this, you can also start from scratch, generating videos, photos and motion graphics as you go.
Prompt Claude — "hey, I want to create a quick explainer about the principle of gravity"
Claude uses the MCP to access the semantic content of your existing videos, trims and organises them to form the core, cohesive end-to-end narrative. It generates text transitions, images, and motion graphics, composing everything on the timeline
You're 80% there. Go in, tweak it to perfection manually!

User	Use Case
Researchers / PMs / Engineers	Quickly compose demo videos and leadership presentations from screen recordings and slides
Content creators	One-shot vlogs and explainers from raw footage + a prompt
Enterprise teams	Automate repetitive internal content creation, plugging into existing pipelines
Developers	A reference for how to build MCP-native apps that let users bring their own Agent -- we're coining the term 'BYOA' paradigm

Challenges we ran into

The core challenge was making AI a true peer to the human editor

AI can't "see" the canvas. Unlike a human who glances at the timeline and intuitively understands what's there, the agent is blind. We had to build meticulous state-file maintenance and dedicated seek-and-capture tools so the AI could query what's on the timeline (supported by semantic video and image understanding APIs that we built out too), understand spatial layout, and reason about the project as effectively as a human looking at the screen.
MCP tool-call chaining is fragile. A single video edit (e.g., "add a clip, trim it, overlay a title, fade in") involves a chain of dependent tool calls in an agent loop. Getting sequencing, error recovery, and intermediate state consistency right -- so the agent doesn't hallucinate a clip ID or clobber a previous edit, required careful tool design and validation at every step.

Tracks Applied (1)

Hackathon Prizes

Technologies used

JavaScript

Python

AI/ML

Ai Agents

Claude Code

Model Context Protocol

Human AI Collaboration

Discussion

Builders also viewed

See more projects on Devfolio