Kokoro 82M
Hexgrad202588 MBTTS
English text-to-speech with 28 voices (American + British accents)
Open-source LLMs running in your browser. Nothing leaves your device. Small models are rough and nothing like ChatGPT; larger ones are more capable but slow to download.
Downloads once, then caches in your browser. Nothing leaves this device.
Detecting device capabilities...
Hexgrad202588 MBTTS
English text-to-speech with 28 voices (American + British accents)
Hugging Face2024182 MBDecoder2K ctx
Tuned with DPO for on-device chat, text rewriting, and function calls
MBZUAI2023260 MBSeq2Seq1K ctx
Fine-tuned on 2.58M LaMini instructions for general instruction following
Liquid AI2026280 MBHybrid33K ctx
Designed for tool calling and structured extraction; covers 9 languages
Microsoft2024320 MBVision
Single model for captioning, object detection, OCR, and phrase grounding
Hugging Face2024388 MBDecoder2K ctx
Stronger at reasoning and instructions while staying fully on-device
Meta2022410 MBSeq2Seq1K ctx
Dedicated news summariser fine-tuned on CNN/DailyMail text-summary pairs
Alibaba2025571 MBDecoder33K ctx
Toggles thinking mode on or off; covers 100+ languages and agent tasks
Google2025763 MBDecoder33K ctx
Lightweight multilingual chat supporting 140+ languages
Meta20241.1 GBDecoder4K ctx
Distilled from larger models for fast on-device chat and tool use
Alibaba20241.3 GBDecoder33K ctx
Handles code generation, fixing, and reasoning across 40+ programming languages
IBM20261.8 GBASR
Speech recognition and translation across 6 languages, ranked #1 on OpenASR
Alibaba20241.8 GBDecoder33K ctx
Tuned for coding, math, and reliable structured output like JSON
Microsoft20242.3 GBDecoder131K ctx
Trained on textbook-quality data for strong reasoning and coding
Hugging Face20252.7 GBDecoder128K ctx
Supports togglable thinking mode and tool calling; fully open weights
DeepSeek20252.9 GBMultimodal
Generates images from text and answers questions about image content
Google20263.4 GBMultimodal128K ctx
Accepts text, images, and audio; reasons and generates text in 140+ languages
Everything on this page runs locally in your browser. These panels explain the technology behind it.
Thank you to the open-source ecosystem powering this page.
We gratefully acknowledge Hugging Face Hub, Transformers.js, ONNX Runtime Web, the ONNX Community, and model authors/publishers including Hexgrad, Hugging Face, MBZUAI, Liquid AI, Microsoft, Meta, Alibaba, Google. Please review each model card and licence before use.