Eval · Patronus AI

Patronus AI

Automated evaluation, guardrails, and monitoring for AI systems.

FREEMIUMCloudWebAPI

Platform for evaluating, guarding, and monitoring LLM and agent applications across the deployment lifecycle. Anchored by research-backed evaluator models — Lynx (hallucination detection), GLIDER (LLM judge), and Percival (agent-trace debugger). Offers a self-serve API with free credits, usage-based pricing, and enterprise plans.

Model support

Self-contained (on-device)

Lynx
GLIDER
Percival

Proprietary evaluator models that score third-party LLM and agent outputs.

Where it runs

Patronus AI

Automated evaluation, guardrails, and monitoring for AI systems.

FREEMIUMCloudWebAPI

Model support

Self-contained (on-device)

Lynx
GLIDER
Percival

Proprietary evaluator models that score third-party LLM and agent outputs.

Where it runs

Patronus AI

Self-contained (on-device)

DeepEval

Ragas

Braintrust

Promptfoo

Patronus AI

Self-contained (on-device)

DeepEval

Ragas

Braintrust

Promptfoo