Skip to content

Ragas

Testset Generation for Agents or Tool use cases

explodinggradients/ragas

Ragas

explodinggradients/ragas

🚀 Get Started
🚀 Get Started
📚 Core Concepts
📚 Core Concepts
- Components
  Components
  - General
    General
    
    Prompt
  - Evaluation
    Evaluation
    
    Evaluation Sample
    
    Evaluation Dataset
- Metrics
  Metrics
  - Overview
  - Available Metrics
    
    Available Metrics
    
    Retrieval Augmented Generation
    Retrieval Augmented Generation
    
    Context Precision
    
    Context Recall
    
    Context Entities Recall
    
    Noise Sensitivity
    
    Response Relevancy
    
    Faithfulness
    
    Agents or Tool Use Cases
    Agents or Tool Use Cases
    
    Agentic or Tool use
    
    Topic Adherence
    
    Tool Call Accuracy
    
    Agent Goal Accuracy
    
    Natural Language Comparison
    Natural Language Comparison
    
    Factual Correctness
    
    Semantic Similarity
    
    Traditional non LLM metrics
    Traditional non LLM metrics
    
    Traditional NLP Metrics
    
    Non LLM String Similarity
    
    BLEU Score
    
    ROUGE Score
    
    String Presence
    
    Exact Match
    
    SQL
    SQL
    
    SQL
    
    Execution based Datacompy Score
    
    SQL Query Equivalence
    
    General Purpose
    General Purpose
    
    General Purpose Metrics
    
    Aspect Critic
    
    Simple Criteria Scoring
    
    Rubrics Based Scoring
    
    Instance Specific Rubrics Scoring
    
    Other Tasks
    Other Tasks
    
    Summarization
- Test Data Generation
  Test Data Generation
  - RAG
    RAG
    
    Testset Generation for RAG
    
    KG Building
    
    Scenario Generation
  - Agents or tool use
    Agents or tool use
    
    Testset Generation for Agents or Tool use cases
- Feedback Intelligence
  Feedback Intelligence
🛠️ How-to Guides
🛠️ How-to Guides
- Customizations
  Customizations
  - General
    General
    
    Customise models
    
    Run Config
  - Metrics
    Metrics
    
    Modify Prompts
    
    Add Custom Metrics
  - Testset Generation
    Testset Generation
    
    Add custom scenarios
    
    Seed Generation with Production Data
- Applications
  Applications
  - Cost Analysis
- Integrations
  Integrations
- Migrations
  Migrations
  - From v0.1 to v0.2
📖 References
📖 References
- Core
  Core
  - Prompt
  - LLMs
  - Embeddings
  - RunConfig
  - Executor
- Evaluation
  Evaluation
- Testset Generation
  Testset Generation
  - Schemas
  - Graph
  - Transforms
  - Synthesizers
  - Generation
- Integrations
❤️ Community

Testset Generation for Agents or Tool use cases

Evaluating agentic or tool use workflows can be challenging as it involves multiple steps and interactions. It can be especially hard to curate a test suite that covers all possible scenarios and edge cases. We are working on a set of tools to generate synthetic test data for evaluating agent workflows.

Talk to founders to work together on this and discover what's coming for upcoming releases.