Loading…
Eval · Patronus AI
Automated evaluation, guardrails, and monitoring for AI systems.
Platform for evaluating, guarding, and monitoring LLM and agent applications across the deployment lifecycle. Anchored by research-backed evaluator models — Lynx (hallucination detection), GLIDER (LLM judge), and Percival (agent-trace debugger). Offers a self-serve API with free credits, usage-based pricing, and enterprise plans.
Model support
Proprietary evaluator models that score third-party LLM and agent outputs.
Where it runs
Tags