LogoLogo
⌘Ctrlk
LogoLogo
  • Intro
    • Getting started in 30 seconds
    • Evaluator Portfolio
    • Why Anything?
    • Concepts
    • Making Sense of Evaluation Results
    • Principles
  • Roadmap
  • Agentic Integration
    • Concepts
    • Examples
      • Common Workflows
      • Use a Judge
      • Add a custom evaluator
      • Evaluate an LLM response
      • Evaluate a multi-turn chatbot conversation
      • RAG evaluation
      • Connect a model
      • Run batch evaluations
      • Comprehensively Test Your LLM Code
      • Find the best prompt and model
      • OTEL Trace Evaluation via CLI
      • CLI
      • Python SDK Examples
      • Poker app
  • Self-hosting
  • Unit Testing in CI/CD
  • Integrations
  • Frequently Asked Questions
  • Breaking Change Policy
    • Scorable
    • Python SDK Docs
    • Python SDK Github
    • TypeScript SDK Docs
    • TypeScript SDK GitHub
    • CLI
    • REST API
    • MCP
    • Trust Center
Powered by GitBook
For the complete documentation index, see llms.txt. This page is also available as Markdown.
  1. CONCEPTS & EXAMPLES

Examples

Advanced use cases and common recipes

Common WorkflowsUse a JudgeAdd a custom evaluatorEvaluate an LLM responseEvaluate a multi-turn chatbot conversationRAG evaluationConnect a modelRun batch evaluationsComprehensively Test Your LLM CodeFind the best prompt and modelOTEL Trace Evaluation via CLICLIPython SDK ExamplesPoker app
PreviousLifecycle ManagementNextCommon Workflows

Last updated 1 month ago