Your Voice.
Your AI.
Fully Local.
A premium AI voice assistant that runs entirely on your hardware.
No cloud. No latency. No compromise.
The Future is Local
Uncompromising performance meets absolute privacy.
"Hey Nova"
Just say the word and she's there. Custom trigger words, ambient listening, no buttons, no clicks. She only answers when you actually want her to.
Ultra-Low Latency
Near-instant response times because nothing has to bounce through the cloud first. Your voice gets processed here, on your machine, not in a rack three time zones away.
Voice Cloning
Clone any voice with just seconds of reference audio. Pick the voice you want. Everything stays on your disk.
Context Analysis
Analyze, debug, and explain code or any type of context via voice commands. Deep LLM integration with bridge overlay built right in.
Voice Dictation
High-accuracy speech-to-text for long-form content, emails, and code documentation. Talk naturally, get perfect text.
100% Local & Private
No data ever leaves your machine. No cloud, no telemetry, no compromises. Your conversations stay yours.
Smarter than you think
Three benchmarks people actually recognize: knowledge (MMLU), math (GSM8K), code (HumanEval). Nova's recommended local model is Qwen3.5-9B. On one GPU, about 16GB VRAM is enough to run it comfortably. We line that up next to a 2022-era cloud assistant and a 2025 frontier cloud model so you can see how far local hardware has come.
| Benchmark | GPT-3.5 Turbo (2022)Cloud · early ChatGPT era | Nova (Qwen3.5-9B)Recommended · local · ~16GB VRAM | GPT-5.2 (2025 top)Frontier cloud |
|---|---|---|---|
| MMLU (knowledge) | 70.0%llm-stats |
85.0%llmbase |
90.2%llm-stats |
| GSM8K (math) | 57.1%llm-stats |
92.1%all-ai |
97.3%llm-stats |
| HumanEval (coding) | 80.3%llm-stats |
82.7%xda |
94.5%llm-stats |
The pills point to the public model page or directory each number came from. Everyone measures a little differently, so verify on the source if you need precision. Start with Qwen3.5-9B (Nova's recommended local model), OpenAI · GPT-5.2, and whatever directory you trust for GPT-3.5 Turbo. MMLU is broad multiple-choice knowledge across 57 subjects. GSM8K is grade-school math word problems. HumanEval is Python from docstrings (pass@1 wording depends on the source).
See Nova in Action
Real demo. No cuts. No tricks.
Engineered for Performance
Custom Whisper STT
Industry-leading speech-to-text, runs fully on your hardware.
Wide LLM Compatibility
Ollama and OpenAI-compatible APIs, so you can swap models without rebuilding your workflow.
AllTalk XTTS v2
Natural voice synthesis and cloning, 100% local.
~16GB VRAM
Realistic floor for the recommended Qwen3.5-9B stack on one GPU, with room left for the rest of the pipeline.