Fine-tuning · Together

Together AI

Fine-tuning + inference for open-weights models. Broad coverage.

FREEMIUMAPIVetted

Hosted inference and fine-tuning across hundreds of open-weights models (Llama, Mistral, DeepSeek, Qwen, etc.). Strong pricing for inference-at-scale; LoRA + full fine-tuning supported.

Model support

Multi-model

Llama
Mistral
DeepSeek
Qwen

Where it runs

Tags

#inference
#fine-tuning
#open-weights
#lora

Open Together AI Docs Pricing

Related in Fine-tuning

View Fireworks AI details
Fine-tuningFREEMIUM
Fireworks AI
Fireworks AI
Fast inference + fine-tuning. Production deployments at scale.
Optimized inference platform for open-weights models with strong latency numbers and serverless + dedicated deployment options. Fine-tuning supported; vision and audio models alongside text.
- inference
- fine-tuning
- low-latency
- production
Open
View Modal details
Fine-tuningFREEMIUM
Modal
Modal Labs
Serverless GPUs. Run training, inference, batch jobs from Python.
Define cloud workloads in Python, deploy with one command — GPU access on demand, fast cold starts, fair-share pricing. The default 'I need to fine-tune a model from a Jupyter cell' platform.
- gpu
- serverless
- python
- training
Open
View OpenPipe details
Fine-tuningFREEMIUM
OpenPipe
OpenPipe
Replace frontier-model spend with a fine-tuned small model.
Captures your production OpenAI / Anthropic calls, builds a dataset, fine-tunes a small open-weights model on your traffic, then serves the swap behind your existing SDK. The pitch: 10x cost reduction at parity.
- fine-tuning
- cost-reduction
- drop-in
- open-weights
Open

Open Together AI

Multi-model

Fireworks AI

Modal

OpenPipe