Vibe train SLM evals in Claude Code

Build production-ready AI judges without leaving Claude

  • Start Zero shot or with CSV

  • Get your endpoint in minutes

  • > 15% + accuracy improvement

  • < 100ms accuracy and low cost

LLM OUTPUT

"Here's a clear explanation..."

P

Eval

Helpful
Learn more about use cases and get examples here