Video Guide Maker

1. Upload

Study guide title

Video file

Drop a video here or click to browse
MP4, MOV, MKV, WebM, M4V

Transcript

Drop a VTT or SRT file or click to browse
Cue timestamps drive segment alignment

2. AI assist (optional)

Paste an Anthropic API key to let Claude generate the fields below. Without a key the pipeline still runs — you'll just get the OCR-derived scaffold with placeholders. Toggle off any feature you don't want to pay for.

Anthropic API key Used once for this run. Sent over HTTPS, held in memory only, never written to disk or logs.

Section titles "Combining AI models" instead of "Segment 3 — 4:25" Alt-text drafts WCAG-purpose descriptions, not OCR transcripts Key terms Term + concise definition per segment Math equations LaTeX from on-screen formulas, with screen-reader labels On-screen text (Claude) Replaces Tesseract OCR for primary frames; better on coloured callouts and decorative fonts

3. Media extras

Local-only enrichments — no API key needed.

Per-segment audio Slice the lecture's narration into one clip per segment so learners can re-listen. Adds ffmpeg processing time and ~500 KB per clip.

4. Output

Review & edit Open the in-browser editor to refine before publishing. Single HTML Self-contained file with all images inlined. Zip bundle HTML plus a static folder, ready to host.

Advanced settings

Scene-change sensitivity 27

More scenesFewer scenes

How visually different two frames must be before they count as a new scene. Lower (5–15) catches every animation step; higher (35+) only major slide changes.

Minimum gap between scenes off

Off60 s

Drop new scenes that arrive less than this many seconds after the previous kept one. Useful for slides that build up gradually — try ~5 s to keep just the final state.

Max frames unlimited

Cap total frames

1200

Hard limit on the number of frames in the guide. Useful for very long lectures or to bound LLM extraction cost. Frames are evenly distributed across the video.

Instructor-frame face threshold 0.12

Strict (drop more)Lenient (keep more)

Fraction of the frame a detected face must occupy before we drop it as instructor-only. Lower = more aggressive; raise it if your slides have an inset webcam you want to keep.

Skip OCR Faster; on-screen text and the heuristic LaTeX wrapping in the guide will be empty. Doesn't affect LLM extraction (which reads the frame image directly).

Skip inverted OCR pass Halves OCR time and avoids the inverted pass producing garbled near-duplicates of body text. Trade-off: white-on-coloured callout text won't be recovered. Leave off for slides with coloured callouts.

Document language en

BCP-47 code (en, en-US, fr…). Sets the lang attribute on the generated HTML and selects Tesseract language packs when available.

Turn lectures into study-ready guides.

Generating your study guide…

1. Upload

2. AI assist (optional)

3. Media extras

4. Output

Generating your study guide…

Generate study guide

1. Upload

2. AI assist (optional)

3. Media extras

4. Output