Semantic Search

Semantic Search uses AI embeddings to find similar transcriptions by meaning, not just keywords. It's available in the History window alongside text-based search.

How It Works

Transcriptions are embedded in batches using Gemini's gemini-embedding-001 model
Each transcription becomes a 768-dimensional vector stored locally in MongoDB
When you search, your query is also embedded
Results are ranked by cosine similarity — how closely the meaning matches

This means searching for "project deadline" can find a transcription that says "we need to finish by Friday" even though the words don't overlap.

Using Semantic Search

Open the History window (from the History tab or menu)
Switch to the Semantic Search tab
Type a search query describing what you're looking for
Optionally filter by date range
Results appear ranked by similarity score (0-100%)

Similarity Scores

80-100%: Very strong match — the transcription closely relates to your query
60-80%: Good match — relevant content
40-60%: Moderate match — loosely related
Below 40%: Weak match — may not be relevant

Embedding Coverage

The Search tab shows embedding coverage: e.g., "Embeddings: 150 / 200 (75% coverage)"

Embeddings are generated in the background when you accumulate 100+ unembedded transcripts. This batch approach:

Minimizes API calls
Doesn't block the UI
Processes oldest transcripts first

Configuration

Configure in Settings:

Setting	Default	Description
`embedding_enabled`	`True`	Enable/disable semantic search
`embedding_model`	`gemini-embedding-001`	Embedding model
`embedding_dimensions`	`768`	Vector size
`embedding_batch_size`	`100`	Transcripts per batch

Cost

Gemini's gemini-embedding-001 model is free (1,500 requests/minute). Semantic search adds no cost to your usage.

Semantic Search ​

How It Works ​

Using Semantic Search ​

Similarity Scores ​

Embedding Coverage ​

Configuration ​

Cost ​