Fixed: NameError during evalutation of llamaindex query engine #2331

Prigoistic · 2025-09-30T15:39:26Z

Issue Link / Problem Description

Fixes #2330
Evaluating a LlamaIndex query engine raised a runtime NameError: EvaluationResult not defined, because it was imported only under t.TYPE_CHECKING. Intermittent LlamaIndex execution failures also led to IndexError during result collection due to mismatched lengths.

Changes Made

Import EvaluationResult at runtime from ragas.dataset_schema in src/ragas/integrations/llama_index.py.
Make response/context collection robust:
- Handle failed executor jobs (NaN placeholders) by inserting empty response/context to maintain alignment with dataset size.
- Prevent IndexError during dataset augmentation.
Light defensive checks to ensure stable evaluation even when some query-engine calls fail.

Testing

Automated tests added/updated

How to Test

Manual testing steps:

Install for local dev: uv run pip install -e . -e ./examples
Follow the LlamaIndex integration guide to set up a query_engine and EvaluationDataset: docs
Ensure LlamaIndex LLM is configured with n=1 (or unset) to avoid “n values greater than 1 not support” warnings.
Run an evaluation that previously failed; it should complete without the NameError and without IndexError during result collection.
Optional: run lints uv run ruff check .

References

Related issues: #2330
Documentation: LlamaIndex integration how-to (link)

Screenshots/Examples (if applicable)

N/A

jjmachan · 2025-10-03T04:03:47Z

hey @Prigoistic I've fixed the CI - could you take a look and see if everything looks good?
we'll merge it in after that 🙂

anistark · 2025-10-03T07:50:16Z

src/ragas/integrations/llama_index.py

-        retrieved_contexts.append([n.node.text for n in r.source_nodes])
+        # Handle failed jobs which are recorded as NaN in the executor
+        if isinstance(r, float) and math.isnan(r):
+            responses.append("")


I think it's better to fail loudly than silently.

If we still need to pass through, better to keep None. The later metrics can skip None or handle them explicitly.

responses.append(None) retrieved_contexts.append(None) logger.warning(f"Query engine failed for query {i}: '{queries[i]}'")

anistark · 2025-10-03T08:04:27Z

src/ragas/integrations/llama_index.py

+            retrieved_contexts.append([])
+        else:
+            # Cast to LlamaIndex Response type for proper type checking
+            response = t.cast("LlamaIndexResponse", r)


This'll be hard on type hints.

Probably better to take from llama_index.core.base.response.schema import Response as LlamaIndexResponse

anistark · 2025-10-03T08:06:01Z

src/ragas/integrations/llama_index.py

+        else:
+            # Cast to LlamaIndex Response type for proper type checking
+            response = t.cast("LlamaIndexResponse", r)
+            responses.append(response.response or "")


Make this more explicit?

responses.append(response.response if response.response is not None else "")

Prigoistic · 2025-10-11T16:37:14Z

@jjmachan yes everything looks good to me

Prigoistic · 2025-10-11T16:38:42Z

I see no conflicts so far and all the checks has been passed too, you can go further and merge this :)

anistark · 2025-10-11T16:53:17Z

@Prigoistic I don't see any changes to the comments.

Prigoistic · 2025-10-11T16:57:45Z

@anistark oh shoot i forgot to push the changes gimme a sec

…errors

Prigoistic · 2025-10-11T18:27:27Z

@anistark pushed the changes as per the comments :) please check it once

examples/ragas_examples/__init__.py

examples/ragas_examples/benchmark_llm/evals.py

src/ragas/integrations/llama_index.py

… of import in eval.py

examples/ragas_examples/benchmark_llm/evals.py

Fixed: NameError during evalutation of llamaindex query engine

1e1f601

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Sep 30, 2025

jjmachan added 4 commits October 1, 2025 08:05

fixed formatting

13f87ec

Merge branch 'main' into fix-2330-llamaindex-query-engine-nameerror

c93cf60

Merge branch 'main' into fix-2330-llamaindex-query-engine-nameerror

553b397

fixed type issues

bbd45b1

anistark reviewed Oct 3, 2025

View reviewed changes

Fix: Handle None responses in llama_index integration and fix import …

77f34f5

…errors

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:S This PR changes 10-29 lines, ignoring generated files. labels Oct 11, 2025

anistark reviewed Oct 11, 2025

View reviewed changes

examples/ragas_examples/__init__.py Show resolved Hide resolved

examples/ragas_examples/benchmark_llm/evals.py Show resolved Hide resolved

src/ragas/integrations/llama_index.py Show resolved Hide resolved

Resolved : Optional issue in llama_index.py and reverted back removal…

d5319f4

… of import in eval.py

anistark reviewed Oct 11, 2025

View reviewed changes

examples/ragas_examples/benchmark_llm/evals.py Outdated Show resolved Hide resolved

Resolved ragas.experiment issue

f1d9061

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Oct 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed: NameError during evalutation of llamaindex query engine #2331

Fixed: NameError during evalutation of llamaindex query engine #2331

Uh oh!

Prigoistic commented Sep 30, 2025

Uh oh!

jjmachan commented Oct 3, 2025

Uh oh!

anistark Oct 3, 2025

Uh oh!

anistark Oct 3, 2025

Uh oh!

anistark Oct 3, 2025 •

edited

Loading

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

anistark commented Oct 11, 2025

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fixed: NameError during evalutation of llamaindex query engine #2331

Are you sure you want to change the base?

Fixed: NameError during evalutation of llamaindex query engine #2331

Uh oh!

Conversation

Prigoistic commented Sep 30, 2025

Issue Link / Problem Description

Changes Made

Testing

How to Test

References

Screenshots/Examples (if applicable)

Uh oh!

jjmachan commented Oct 3, 2025

Uh oh!

anistark Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

anistark Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

anistark Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

anistark commented Oct 11, 2025

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

Prigoistic commented Oct 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anistark Oct 3, 2025 •

edited

Loading