Features · Transcription Search

Search every word,
spoken on camera

FrameQuery runs automatic speech-to-text with speaker diarization on every video you index. Search by speaker, find any quote, and jump to the exact frame with word-level timestamps.

Download

How It Works

From raw audio to searchable transcript

FrameQuery transcribes your footage during indexing with full speech-to-text, word-level timestamps, and automatic speaker diarization. Find a quote from any interview, filter by speaker, and search by meaning with semantic embeddings - not just exact words.

2m 14s - 2m 22sSarah

We spent three weeks scouting locations before we found the right valley. The light had to be perfect for what we were trying to capture.

2m 22s - 2m 31sDavid

And the permits took even longer than that. You can't just show up with a crew and start filming on protected land.

2m 31s - 2m 38sSarah

Right. But once we had everything sorted, the first two days of shooting were incredible. Golden hour lasted almost forty minutes up there.

2m 38s - 2m 47sAlex

I remember the sound team had issues with the wind though. We ended up using the backup lavs for most of the exterior dialogue.

2m 47s - 2m 53sSarah

That's true. The boom was basically unusable above the treeline. Lesson learned for next time.

2m 53s - 2m 59sDavid

We should budget for a dedicated wind rig on the next shoot. It would have saved us two days of ADR.

2m 59s - 3m 08sSarah

Agreed. And the drone footage from day three more than made up for it. Those wide aerials are probably the best shots in the whole film.

3m 08s - 3m 14sAlex

The client loved those. That single tracking shot over the ridge ended up in every version of the cut.

Click any line to jump to that moment in the video

Search Syntax

Precise control over what you find

Query reference

"quarterly goals"

Exact phrase match. Finds the precise moment someone said this.

-interview

Exclude a term. Combine with other queries to narrow results.

@Sarah

Filter by speaker. Shows only transcript segments from that person.

codec:prores res:4k

Metadata filters to narrow by format, resolution, camera, and more.

Semantic Search

Search by meaning, not just exact words

MiniLM word embeddings run alongside keyword search on every query. Type what you mean and FrameQuery finds transcript segments that express the same idea, even when the words are completely different.

discussing the budgetSemantic

Searching by meaning, not just exact words. None of these results contain “discussing” or “budget” together.

MEETING_Q3_REVIEW.mov14m 22s

92%Semantic

Sarah

We need to figure out the costs before we can commit to anything this quarter.

TEAM_STANDUP_0312.mov3m 08s

87%Semantic

David

The finance team flagged our spend on the infrastructure side. It's higher than projected.

CLIENT_CALL_FINAL.mov22m 41s

94%Keyword + Semantic

Alex

We talked about the budget allocation and they want a revised breakdown by Friday.

MiniLM word embeddings match the meaning of your query against transcript segments. “Discussing the budget” finds “figure out the costs” and “flagged our spend” because they mean the same thing, even though the words are completely different. Runs alongside keyword search on every query.

Navigate

Click any line, jump to the frame

Click any timestamp to jump directly to that moment in the video. The transcript highlights and scrolls in sync as playback continues, with speaker labels so you always know who is talking.

EXT_PARK_INTERVIEW.mov

32m 07s · 4.2 GB · 3840x2160

ProRes 4224K

Framerate23.976

Audio48kHz

Scenes47

CameraRED V-Raptor

SourceD:\Footage\Park

Indexed2 days ago

2m 31s - 2m 38sSarah

But once we had everything sorted, the first two days of shooting were incredible.

2m 38s - 2m 47sAlex

I remember the sound team had issues with the wind though. We ended up using the backup lavs.

2m 47s - 2m 53sSarah

The boom was basically unusable above the treeline. Lesson learned for next time.

2m 53s - 2m 59sDavid

We should budget for a dedicated wind rig on the next shoot.

2m 59s - 3m 08sSarah

Agreed. And the drone footage from day three more than made up for it.

Click any line to jump to that moment

Export

Send transcript segments to your NLE

Select transcript segments and export them as timeline-ready files.

FCPXML v1.11 - Final Cut Pro
FCP7 XML - Legacy FCP & Premiere Pro
CMX 3600 EDL - DaVinci Resolve & Avid
LosslessCut CSV - Lossless trimming, no re-encode

FAQ

Common questions

What languages are supported?

FrameQuery uses Whisper for transcription, supporting 90+ languages with automatic language detection. You don't need to specify the language in advance.

Can I search across multiple videos at once?

Yes. Your transcript search runs across every indexed video in your library. Results are ranked by relevance and show which video and timecode each match is from.

What's the difference between keyword and semantic search?

Keyword search finds exact word matches using BM25 scoring. Semantic search uses MiniLM word embeddings to match by meaning, so “discussing finances” can find “we need to talk about the budget”. Both run simultaneously.

Does transcription cost extra?

No. Transcription runs as part of the standard indexing pipeline on every plan, including Free.

More Features

Face Recognition

Find anyone by face or voice across your entire video library.

Visual Search

Search footage by describing what you see in plain language.

Search every word,spoken on camera