Skip to content

Browser-Ready Models (ONNX / WASM / WebGPU)

Models that run directly in the browser — no server, no Python, no setup. These are the models our tools can use.

Transformers.js Models

Transformers.js runs Hugging Face models in the browser via ONNX Runtime Web. All models below have ONNX exports.

Text

Model Task Size Speed HF Link
distilbert-base-uncased Sentiment/classification 67 MB Fast HF
bert-base-uncased Fill-mask, embeddings 110 MB Medium HF
t5-small Summarization, translation 60 MB Medium HF
all-MiniLM-L6-v2 Sentence embeddings 23 MB Fast HF
xenova/multilingual-e5-small Multilingual embeddings 118 MB Medium HF
Xenova/distilgpt2 Text generation 88 MB Medium HF

Vision

Model Task Size Speed HF Link
mobilenet_v2 Image classification (1000 classes) 14 MB Very fast HF
detr-resnet-50 Object detection 160 MB Slow HF
vit-base-patch16-224 Image classification 330 MB Slow HF
clip-vit-base-patch32 Image-text similarity 150 MB Medium HF
sam-vit-base Segment anything 380 MB Slow HF

Audio

Model Task Size Speed HF Link
whisper-tiny Speech-to-text 39 MB Fast HF
whisper-small Speech-to-text 244 MB Medium HF
wav2vec2-base Speech recognition 95 MB Medium HF

TensorFlow.js Models

Pre-trained models that run via TF.js in the browser. Good for transfer learning (fine-tuning).

Model Task Size Transfer Learning? Link
MobileNet v2 Image classification 7 MB Yes — retrain last layer tfhub.dev
COCO-SSD Object detection 5 MB No github.com/tensorflow
BlazeFace Face detection 400 KB No github.com/tensorflow
PoseNet Body pose estimation 4-13 MB No github.com/tensorflow
Universal Sentence Encoder Sentence embeddings 28 MB No tfhub.dev
Toxicity Toxic text detection 25 MB No github.com/tensorflow

ONNX Model Zoo

Optimized models in ONNX format, convertible to ONNX Runtime Web for browser inference.

Model Task Size ONNX Zoo Link
ResNet-50 Image classification 97 MB github.com/onnx/models
EfficientNet-Lite Image classification 19 MB github.com/onnx/models
SSD MobileNet Object detection 27 MB github.com/onnx/models
BERT-Squad Question answering 130 MB github.com/onnx/models
GPT-2 Text generation 600 MB github.com/onnx/models

WebGPU-Accelerated

Models that leverage WebGPU for 10-50x faster inference than WebGL. Requires Chrome 113+ or Edge 113+.

Library Models WebGPU Support Link
Transformers.js v3 All ONNX models above Yes (via ONNX Runtime) huggingface.co/docs/transformers.js
WebLLM LLaMA, Mistral, Gemma, Phi Yes (full LLM in browser) webllm.mlc.ai
MediaPipe Face, hand, pose, object Yes ai.google.dev/edge/mediapipe

Integration with FreeAgentStore

Many of these browser-ready models are already packaged as tools on FreeAgentStore. Check the Models tab for ready-to-use versions.