Yomi is a Windows AI buddy that sees your screen, hears your voice, and helps you move through laptop work without breaking flow.
How it works
Voice, type, or just press enter. Every interaction counts the same.
Voice + Screen
Press to start recording, speak your question, then press Enter to send. Yomi captures your screen and responds with text and voice.
Type + Screen
Type a question. Yomi captures your screen and returns a text response.
Just Screen
Open the text panel and press Enter with an empty input. Yomi analyzes your screen and tells you what's on it.
Built to disappear
Yomi captures context from whatever you're looking at. No copy-pasting, no describing. It just knows.
Push to talk or always-on VAD. Sub-2-second response on the fast path. Ask anything, anytime.
Screenshots are used only for your query and never stored by Yomi. No background recording, no silent capture.
Pricing
Start free. Upgrade when you outgrow it.
Explore
Free$0 / year
Start with screen-aware AI, voice, and memory basics. No card needed.
Pro
$144 / year
Daily screen, voice, memory, images, and useful foreground automation.
Max
Power users$384 / year
Heavy automation, long-context work, and high-volume creation.
* Fair usage protection applies. Explore is free with monthly limits.
Download
Windows is available now. macOS is coming soon.
Downloads come directly from GitHub Releases. Yomi is pre-release. sign up for early access