Skip to content
Ragas
๐ Core Concepts
Initializing search
explodinggradients/ragas
๐ Get Started
๐ Core Concepts
๐งช Experimental
๐ ๏ธ How-to Guides
๐ References
Community
Ragas
explodinggradients/ragas
๐ Get Started
๐ Get Started
Installation
Evaluate your first LLM App
Evaluate a simple RAG
Generate Synthetic Testset for RAG
๐ Core Concepts
๐ Core Concepts
Components
Components
General
General
Prompt
Evaluation
Evaluation
Evaluation Sample
Evaluation Dataset
Metrics
Metrics
Overview
Available Metrics
Available Metrics
Retrieval Augmented Generation
Retrieval Augmented Generation
Context Precision
Context Recall
Context Entities Recall
Noise Sensitivity
Response Relevancy
Faithfulness
Nvidia Metrics
Nvidia Metrics
Answer Accuracy
Context Relevance
Response Groundedness
Agents or Tool Use Cases
Agents or Tool Use Cases
Agentic or Tool use
Topic Adherence
Tool Call Accuracy
Agent Goal Accuracy
Natural Language Comparison
Natural Language Comparison
Factual Correctness
Semantic Similarity
Traditional non LLM metrics
Traditional non LLM metrics
Traditional NLP Metrics
Non LLM String Similarity
BLEU Score
ROUGE Score
String Presence
Exact Match
SQL
SQL
SQL
Execution based Datacompy Score
SQL Query Equivalence
General Purpose
General Purpose
General Purpose Metrics
Aspect Critic
Simple Criteria Scoring
Rubrics Based Scoring
Instance Specific Rubrics Scoring
Other Tasks
Other Tasks
Summarization
Test Data Generation
Test Data Generation
RAG
RAG
Testset Generation for RAG
KG Building
Scenario Generation
Agents or tool use
Agents or tool use
Testset Generation for Agents or Tool use cases
Feedback Intelligence
Feedback Intelligence
๐งช Experimental
๐งช Experimental
Tutorials
Tutorials
Prompt
RAG
Workflow
Agent
Core Concepts
Core Concepts
Metrics
Datasets
Experimentation
๐ ๏ธ How-to Guides
๐ ๏ธ How-to Guides
Customizations
Customizations
General
General
Customise models
Run Config
Caching
Metrics
Metrics
Modify Prompts
Adapt Metrics to Languages
Write your own Metrics
Write your own Metrics - (advanced)
Testset Generation
Testset Generation
Non-English Testset Generation
Persona Generation
Custom Single-hop Query
Custom Multi-hop Query
Applications
Applications
Metrics
Metrics
Cost Analysis
Evaluating Multi-turn Conversations
Evaluations with Vertex AI models
Testset Generation
Testset Generation
Single-hop Query Testset
Benchmarking
Benchmarking
Benchmarking Gemini models
Integrations
Integrations
Arize
Amazon Bedrock
Haystack
Griptape
LangChain
LangGraph
LangSmith
LlamaIndex RAG
LlamaIndex Agents
LlamaStack
R2R
Swarm
Migrations
Migrations
From v0.1 to v0.2
๐ References
๐ References
Core
Core
Prompt
LLMs
Embeddings
RunConfig
Executor
Cache
Evaluation
Evaluation
Schemas
Metrics
evaluate()
Testset Generation
Testset Generation
Schemas
Graph
Transforms
Synthesizers
Generation
Integrations
Community
๐ Core Concepts
Metrics
Datasets and Experiment Results
Experiments
Back to top