🔍 Evaluating RAG systems is crucial yet challenging. Nour El Mawass and Maria Knorps dive into the stability and usefulness of RAG evaluation libraries. They explore the impact of evaluator LLMs, query reformulation, and dataset characteristics on performance. This talk offers insights into selecting and interpreting LLM-based evaluations effectively.
🎥 https://youtu.be/yrWmK3pXrKk
=> More informations about this toot | More toots from PyDataParis@mastodon.social
text/gemini
This content has been proxied by September (3851b).