RAGChecker v0.1.0 Release Note

We are excited to announce the initial release of RAGChecker, version 0.1.0. RAGChecker is a comprehensive evaluation framework designed for in-depth analysis and diagnostics of Retrieval-Augmented Generation (RAG) systems.

Key Features

Fine-grained Evaluation: Utilizes claim-level entailment checking for detailed analysis of RAG system performance.
Comprehensive Metric Suite: Includes metrics for overall performance, retriever effectiveness, and generator capabilities.
Flexible Model Integration: Supports various LLMs for claim extraction and checking, including AWS Bedrock models.
Easy-to-use CLI: Provides a command-line interface for quick evaluation of RAG outputs.
Python API: Offers a Python API for seamless integration into existing workflows and scripts.

Metrics Included

Overall: Precision, Recall, F1 Score
Retriever: Claim Recall, Context Precision
Generator: Context Utilization, Noise Sensitivity, Hallucination, Self-knowledge, Faithfulness

Getting Started

To start using RAGChecker, install it via pip:

pip install ragchecker
python -m spacy download en_core_web_sm

For detailed usage instructions and examples, please refer to our GitHub repository: https://github.com/amazon-science/RAGChecker

Feedback and Contributions

As an open-source project, we welcome feedback, bug reports, and contributions from the community. Please use the GitHub issues section for reporting bugs or suggesting enhancements.

Thank you for your interest in RAGChecker. We look forward to seeing how it helps improve RAG systems across various applications!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAGChecker v0.1.0

RAGChecker v0.1.0 Release Note

Key Features

Metrics Included

Getting Started

Feedback and Contributions