Vision · M87 Labs

Moondream

Tiny open vision-language model for efficient image understanding.

FREEMIUMOpen coreHybridWebAPI

An open-weights family of small vision-language models for captioning, visual Q&A, pointing, counting, and object detection — small enough to run on-device (checkpoints down to 0.5B on Hugging Face). Run it locally with the Photon engine, or call Moondream Cloud's OpenAI-compatible API with a free monthly credit tier and pay-per-image pricing.

Model support

Self-contained (on-device)

Ships its own open vision-language weights.

Where it runs

Moondream

Tiny open vision-language model for efficient image understanding.

FREEMIUMOpen coreHybridWebAPI

Model support

Self-contained (on-device)

Ships its own open vision-language weights.

Where it runs

Moondream

Self-contained (on-device)

LandingAI

Roboflow

Voxel51

Moondream

Self-contained (on-device)

LandingAI

Roboflow

Voxel51