Raget: Possible miscalculation of all Ragas metrics, in particular Precision and Recall #1924

Chabert-Liddell · 2024-05-03T09:32:06Z

Issue Type

Bug

Source

source

Giskard Library Version

2.11

Giskard Hub Version

OS Platform and Distribution

No response

Python version

No response

Installed python packages

No response

Current Behaviour?

Giskard RAGet uses the reference context when calling Ragas. 

https://github.com/Giskard-AI/giskard/blob/main/giskard/rag/metrics/ragas_metrics.py

        ragas_sample = {
            "question": question_sample["question"],
            "answer": answer,
            "contexts": question_sample["reference_context"].split("\n\n"),
            "ground_truth": question_sample["reference_answer"],
        }

According to Ragas documentation the retrieved context should be used (the one used for the answer Generation).

As an example, when computing Precision or Recall which both uses {"question", "contexts", "ground_truth"}, if you are giving the reference context, then you are evaluating your test set generation pipeline  and not your RAG pipeline.

Standalone code OR list down the steps to reproduce the issue

Relevant log output

No response

alexcombessie · 2024-05-07T16:13:41Z

@pierlj what do you think?

pierlj · 2024-05-07T16:44:51Z

Hi @Chabert-Liddell, you are right, thanks for pointing this out. A fix will be release soon!

pierlj mentioned this issue May 10, 2024

[GSK-3513] Fix RAGAS metric computation #1925

Merged

henchaves closed this as completed Jun 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Raget: Possible miscalculation of all Ragas metrics, in particular Precision and Recall #1924

Raget: Possible miscalculation of all Ragas metrics, in particular Precision and Recall #1924

Chabert-Liddell commented May 3, 2024

alexcombessie commented May 7, 2024

pierlj commented May 7, 2024

Raget: Possible miscalculation of all Ragas metrics, in particular Precision and Recall #1924

Raget: Possible miscalculation of all Ragas metrics, in particular Precision and Recall #1924

Comments

Chabert-Liddell commented May 3, 2024

Issue Type

Source

Giskard Library Version

Giskard Hub Version

OS Platform and Distribution

Python version

Installed python packages

Current Behaviour?

Standalone code OR list down the steps to reproduce the issue

Relevant log output

alexcombessie commented May 7, 2024

pierlj commented May 7, 2024