Configuration
AI Transcription Notepad stores settings in ~/.config/voice-notepad-v3/.
API Key
You need an OpenRouter API key. OpenRouter is the sole provider, giving access to Gemini models with per-key cost tracking.
Set your key via environment variable:
OPENROUTER_API_KEY=your_keyOr configure it in Settings → API Keys within the app.
Get an API key at openrouter.ai/keys.
Models
The default model is google/gemini-3-flash-preview (Gemini 3 Flash). You can also select google/gemini-3-pro-preview (Gemini 3 Pro) for higher quality output. Change the model in Settings → Models or from the toolbar dropdown.
Behavior Settings
Voice Activity Detection removes silence from audio before sending to the API. This reduces file size and API costs. Uses TEN VAD, a lightweight native library bundled with the application. Enable in Settings → Behavior.
Automatic Gain Control normalizes audio levels for consistent transcription accuracy. Boosts quiet audio (up to +20dB) while leaving loud audio unchanged. Enable in Settings → Behavior.
Audio Archival saves recordings in Opus format (~24kbps) to ~/.config/voice-notepad-v3/audio-archive/. A one-minute recording uses about 180KB.
Cleanup Prompt
The cleanup prompt instructs the AI how to process your transcription. The default removes filler words, adds punctuation and paragraph spacing, follows verbal instructions in the recording, and returns markdown. Customize in Settings → Prompt.
Storage Locations
Settings and data are stored in ~/.config/voice-notepad-v3/:
config.json- API keys and preferencesmongita/- MongoDB-compatible transcript databaseusage/- Daily cost trackingaudio-archive/- Opus recordings (if enabled)