open source · windows-first · zero GPU

The edit is a JSON file.
Everything else is an editor automated.

Drop raw takes in a folder. Whisper transcribes them, Claude decides the cut, ffmpeg executes it, Remotion draws the graphics. Out come branded reels in 16:9, 9:16 and 1:1 — captioned, beat-cut, loudness-normalized. No timeline was scrubbed.

{
  "title": "skyloka demo",
  "clips": [
    { "take": "take1.mp4",
      "in": 0.5, "out": 2.5,
      "note": "hook" },
    { "take": "take2.mp4",
      "in": 1.0, "out": 3.0,
      "transition": "crossfade" },
    { "take": "take3.mp4",
      "in": 2.0, "out": 4.0,
      "note": "cta" } ]
}

video

hook

feature

cta

graphics · captions

title + callouts

logo sting

→ reel_16x9.mp4 · reel_9x16.mp4 · reel_1x1.mp4

Five stages, one command

Each stage writes its artifact to disk. If anything fails, --resume picks up exactly where it stopped.

whisper.cpp

Transcribe

Every take becomes a timestamped transcript. Small model, pure CPU, cached forever.

claude api

Decide

Reads all transcripts plus your brief. Outputs a validated Edit Decision List — the strongest delivery of each idea, cut tight.

ffmpeg

Cut

Trims, concats, crossfades, EBU R128 loudness. Beat-grid snapping lands cuts on the music.

remotion

Draw

Title cards, lower thirds, callouts, counters, logo stings — React components rendered to an alpha overlay on CPU.

ffmpeg

Ship

Composite, burn captions, mix music with speech ducking, export every aspect ratio. 4K when you want it.

Install

Windows-first. One PowerShell script fetches whisper.cpp and the model; ffmpeg and Node you likely have.

# one-time setup PS> powershell -ExecutionPolicy Bypass -File .\scripts\setup-windows.ps1 PS> cd remotion; npm install; cd .. PS> copy .env.example .env # add your ANTHROPIC_API_KEY PS> node bin\lokacut.js doctor # make a reel PS> node bin\lokacut.js watch # drop takes → transcripts appear PS> node bin\lokacut.js render --brief brief.md --quality final --formats 16x9,9x16,1x1

Brief templates

A brief is a markdown file describing what must land. Start from one of five.

45s · 16:9

Product demo

Hook in 2s, three features shown not told, one honest number, logo sting.

30s · 9:16

Client promo

Local business, owner lower-third, offer callout, phone number big.

25s · 16:9

Feature launch

Name it in the first second, show it in one continuous flow.

40s · 1:1

Testimonial

Exact words kept, filler cut, meaning never re-ordered.

12s · 9:16

Social teaser

Boldest frame first, one line at a time, hard cut to sting.

What LokaCut is not: a video generator. No diffusion, no synthetic footage, no invented b-roll. The footage is always your real screen recordings and real clips — that's the credibility, not the limitation.