LogoLogo
search
⌘Ctrlk
LogoLogo
  • Intro
  • QUICK START
    • Getting started in 30 seconds
    • Evaluator Portfolio
  • OVERVIEW
    • Why Anything?
    • Concepts
    • Principles
    • Usage Flows
    • Making Sense of Evaluation Results
  • Roadmap
  • USAGE
    • Usage
    • Cookbook
      • Use a Judge
      • Add a custom evaluator
      • Evaluate an LLM response
      • Evaluate a multi-turn chatbot conversation
      • RAG evaluation
      • Connect a model
      • Run batch evaluations
      • Comprehensively Test Your LLM Code
      • Find the best prompt and model
      • CLI
      • Python SDK Examplesarrow-up-right
      • Poker apparrow-up-right
  • Self-hosting
  • Unit Testing in CI/CD
  • Integrations
  • Frequently Asked Questions
  • Breaking Change Policy
  • RESOURCES
    • Scorablearrow-up-right
    • Python SDK Docsarrow-up-right
    • Python SDK Githubarrow-up-right
    • TypeScript SDK Docsarrow-up-right
    • TypeScript SDK GitHubarrow-up-right
    • CLIarrow-up-right
    • REST APIarrow-up-right
    • MCParrow-up-right
    • Trust Centerarrow-up-right
gitbookPowered by GitBook
block-quoteOn this pagechevron-down
  1. USAGE

Cookbook

Advanced use cases and common recipes

Use a Judgechevron-rightAdd a custom evaluatorchevron-rightEvaluate an LLM responsechevron-rightEvaluate a multi-turn chatbot conversationchevron-rightRAG evaluationchevron-rightConnect a modelchevron-rightRun batch evaluationschevron-rightComprehensively Test Your LLM Codechevron-rightFind the best prompt and modelchevron-rightCLIchevron-rightPython SDK Exampleschevron-rightPoker appchevron-right
PreviousLifecycle Managementchevron-leftNextUse a Judgechevron-right