Zero-overhead idle
The 800MB Parakeet model is evicted after five minutes of inactivity, returning memory to the OS automatically.
$5 early access price
Airakeet wraps NVIDIA's Parakeet ASR into a silky macOS menubar experience that unloads when idle, respects 8GB RAM ceilings, and keeps every word on-device.
0
MB model footprint unloaded when idle
0
× real-time factor on M2 MacBook Air
0
minute auto-off to return all RAM
Why it matters
Most dictation apps idle at 2–3GB. Airakeet uses aggressive unloads, streaming buffers, and CoreML tuning to stay invisible until you need it.
The 800MB Parakeet model is evicted after five minutes of inactivity, returning memory to the OS automatically.
Inference runs exclusively on the Apple Neural Engine, keeping the CPU free and the laptop fanless during long sessions.
A translucent HUD with a live waveform of your mic input. Plus, it's fully color-customizable
Dictation drops straight into the active text field using Clipboard + CMD+V—no extra permissions, no network.
Supports standard shortcuts, Fn combos, and a dedicated Shift+Fn gesture for quick starts even on compact keyboards.
Replay exactly what the engine heard via a safety cache so you can validate inputs without uploading sensitive data.
Engineering story
From CoreML conversion to macOS UX polish, Airakeet is a full-stack native build that spotlights low-level craftsmanship.
Converted NVIDIA Parakeet TDT 0.6B to CoreML with quantization and ANE-friendly ops.
Menubar-only surface, no analytics, and scoped macOS permissions keep the threat model tiny.
Custom event tap avoids global listeners to reduce CPU wakeups while preserving instant response.
Extract-and-clear buffers plus timed auto-unload keep RAM usage flat during long recordings.
Results
Transcribes five seconds of speech in 0.11s, injects text instantly, and stays invisible until summoned. Perfect companion for essays, code reviews, or meeting notes.
Under the hood
Parakeet is NVIDIA’s speech model family converted to CoreML. Think of it as a musician trained on billions of sentences who performs directly on your Mac instead of on a cloud stage.
The 1.1B Parakeet-EOU build will unlock live dictation with punctuation, multilingual translation, and smarter “keep listening” behavior without adding cloud latency.
| Attribute | Parakeet | Local Whisper |
|---|---|---|
| Latency target | Optimized for low-latency partials so you see text mid-sentence. | Batch-first decoding introduces a pause before the first characters appear. |
| Hardware sweet spot | Runs comfortably on the Apple Neural Engine with 8GB RAM. | Prefers discrete GPU or 16GB+ unified memory to stay smooth. |
| Streaming feel | Designed for incremental injection with ANE offload. | Often buffers a full sentence before emitting, so text arrives in bursts. |
Forward-looking
High-tier work resumes when I upgrade to a 32GB MacBook Air so I can validate the 1.1B model end-to-end.
Refactor `ASREngine` to load either 0.6B or 1.1B models on demand and prevent RAM collisions.
Streaming text, word-by-word insertion, and silence detection using Parakeet-EOU for instant feedback.
Benchmark M5 hardware, monitor thermals, and explore NVIDIA Canary for multilingual translation.
Future: Streaming engine
Rolls out once I’m on a new 32GB MacBook Air so the 1.1B model fits comfortably—nothing required on your end.
With this update you’ll be able to choose between today’s ultra-efficient 0.6B engine and a higher-capacity Parakeet EOU 1.1B build. That bigger model will enable true word-by-word streaming and EOU (End of Utterance) timing so Airakeet feels like it’s reading your mind.
Ready when you are
Airakeet keeps every syllable on your hardware and is available through a private early-access program. Secure the introductory $5 access (regular $10) while seats are open.