Skip to content

Fine-tuning · Modal Labs

Modal

Serverless GPUs. Run training, inference, batch jobs from Python.

FREEMIUMAPICLI

Define cloud workloads in Python, deploy with one command — GPU access on demand, fast cold starts, fair-share pricing. The default 'I need to fine-tune a model from a Jupyter cell' platform.

Model support

Model-agnostic

Where it runs

  • API
  • CLI

Tags

  • #gpu
  • #serverless
  • #python
  • #training
Open ModalDocsPricing

Related in Fine-tuning

  • View Together AI details
    Fine-tuningFREEMIUMVetted

    Together AI

    Together

    Fine-tuning + inference for open-weights models. Broad coverage.

    Hosted inference and fine-tuning across hundreds of open-weights models (Llama, Mistral, DeepSeek, Qwen, etc.). Strong pricing for inference-at-scale; LoRA + full fine-tuning supported.

    • inference
    • fine-tuning
    • open-weights
    • lora
  • View Fireworks AI details
    Fine-tuningFREEMIUM

    Fireworks AI

    Fireworks AI

    Fast inference + fine-tuning. Production deployments at scale.

    Optimized inference platform for open-weights models with strong latency numbers and serverless + dedicated deployment options. Fine-tuning supported; vision and audio models alongside text.

    • inference
    • fine-tuning
    • low-latency
    • production
  • View OpenPipe details
    Fine-tuningFREEMIUM

    OpenPipe

    OpenPipe

    Replace frontier-model spend with a fine-tuned small model.

    Captures your production OpenAI / Anthropic calls, builds a dataset, fine-tunes a small open-weights model on your traffic, then serves the swap behind your existing SDK. The pitch: 10x cost reduction at parity.

    • fine-tuning
    • cost-reduction
    • drop-in
    • open-weights