LCORE-1094: Rag chunks are not parsed #893

are-ces · 2025-12-09T13:03:38Z

Description

Added parsing of rag_chunks, then added them to TurnSummary to store the rag chunks in the transcript.

Type of change

Tools used to create PR

Identify any AI code assistants used in this PR (for transparency and review context)

Assisted-by: Claude

Related Tickets & Documents

Related Issue # LCORE-1094
Closes # LCORE-1094

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

Set up LCS with a rag db, called the rag and checked that the transcript stores the rag chunks.

Summary by CodeRabbit

New Features
- Responses now extract and attach RAG chunks (content, source, relevance) from tool results so conversation summaries include source-attributed snippets.
Bug Fixes
- Retrieval flow now populates referenced RAG chunks instead of leaving them empty, improving source visibility.
Tests
- Integration and unit tests updated to exercise richer response structures and validate RAG chunk extraction and referenced-doc listings.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-09T13:03:58Z

Walkthrough

Added a parser parse_rag_chunks_from_responses_api() that extracts RAG chunks from Responses API outputs and wired it into retrieve_response() so TurnSummary.rag_chunks is populated. Also updated imports and expanded unit/integration tests to cover the new parsing and richer tool-result payloads.

Changes

Cohort / File(s)	Summary
Responses API handler `src/app/endpoints/query_v2.py`	Added `parse_rag_chunks_from_responses_api(response_obj: Any) -> list[RAGChunk]`. Updated `retrieve_response()` to call this parser and assign the result to `TurnSummary.rag_chunks`. Adjusted import ordering to include `RAGChunk`.
Unit tests for parsing `tests/unit/app/endpoints/test_query_v2.py`	Replaced inline mocks with helpers for citation/file-search outputs; updated `test_retrieve_response_parses_referenced_documents` to validate extracted RAG chunks (content, source, score) and expanded referenced-docs assertions.
Integration test update `tests/integration/endpoints/test_query_v2_integration.py`	Enhanced mocked tool result in `test_query_v2_endpoint_with_tool_calls` to include a `text` attribute (richer tool payload).

Sequence Diagram(s)

(Skipped — change is localized parsing and assignment without multi-component sequential flow warranting a diagram.)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

LCORE-601: Add RAG chunks in query response #550 — Also adds and propagates RAG chunks and related types/processing; directly related to RAG extraction behavior.
LCORE-693: Added rag_chunks to streaming_query #585 — Implements similar RAG chunk extraction/propagation for query flows and streaming_query helpers.
Base implementation of non-streaming Responses API #753 — Extends Responses-API v2 parsing and overlaps with the same retrieve_response functionality modified here.

Suggested reviewers

asamal4
tisnik

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately captures the main change: adding RAG chunk parsing that was previously missing, matching the core objective of fixing the bug where rag chunks were not being parsed.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

src/app/endpoints/query_v2.py (1)

422-429: Fix the parser function to return RAGChunk objects.

The integration of RAG chunk parsing is correct, but the parse_rag_chunks_from_responses_api function returns a list of dicts instead of list[RAGChunk] objects, causing the TurnSummary validation to fail.

See the detailed fix in the review comment for lines 455-482.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4269f0b and ad277ff.

📒 Files selected for processing (1)

src/app/endpoints/query_v2.py (3 hunks)

🧰 Additional context used

📓 Path-based instructions (3)

src/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

src/**/*.py: Use absolute imports for internal modules in LCS project (e.g., from auth import get_auth_dependency)
All modules must start with descriptive docstrings explaining their purpose
Use logger = logging.getLogger(__name__) pattern for module logging
All functions must include complete type annotations for parameters and return types, using modern syntax (str | int) and Optional[Type] or Type | None
All functions must have docstrings with brief descriptions following Google Python docstring conventions
Function names must use snake_case with descriptive, action-oriented names (get_, validate_, check_)
Avoid in-place parameter modification anti-patterns; return new data structures instead of modifying input parameters
Use async def for I/O operations and external API calls
All classes must include descriptive docstrings explaining their purpose following Google Python docstring conventions
Class names must use PascalCase with descriptive names and standard suffixes: Configuration for config classes, Error/Exception for exceptions, Resolver for strategy patterns, Interface for abstract base classes
Abstract classes must use ABC with @abstractmethod decorators
Include complete type annotations for all class attributes in Python classes
Use import logging and module logger pattern with standard log levels: debug, info, warning, error

Files:

src/app/endpoints/query_v2.py

src/app/endpoints/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

Use FastAPI HTTPException with appropriate status codes for API endpoint error handling

Files:

src/app/endpoints/query_v2.py

src/**/{client,app/endpoints/**}.py

📄 CodeRabbit inference engine (CLAUDE.md)

Handle APIConnectionError from Llama Stack in integration code

Files:

src/app/endpoints/query_v2.py

🧬 Code graph analysis (1)

src/app/endpoints/query_v2.py (2)

src/utils/suid.py (2)

normalize_conversation_id (101-122)

to_llama_stack_conversation_id (125-145)

src/utils/types.py (1)

TurnSummary (135-220)

🪛 GitHub Actions: Integration tests

src/app/endpoints/query_v2.py

[error] 425-425: TurnSummary validation failed: rag_chunks[0].content must be a string. Received MagicMock during test_query_v2_endpoint_with_tool_calls. Traceback points to query_v2.py:425.

🪛 GitHub Actions: Unit tests

src/app/endpoints/query_v2.py

[error] 476-476: AttributeError: 'dict' object has no attribute 'text' while parsing rag chunks from responses API in parse_rag_chunks_from_responses_api. Expected items to have 'text' and 'score' attributes (triggered during tests/unit/app/endpoints/test_query_v2.py::test_retrieve_response_parses_referenced_documents).

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (6)

GitHub Check: build-pr
GitHub Check: Konflux kflux-prd-rh02 / lightspeed-stack-on-pull-request
GitHub Check: E2E: server mode / azure
GitHub Check: E2E: server mode / ci
GitHub Check: E2E: library mode / ci
GitHub Check: E2E: library mode / azure

🔇 Additional comments (1)

src/app/endpoints/query_v2.py (1)

46-46: LGTM! Import consolidation improves organization.

The import reorganization properly consolidates related utility functions into a single import statement.

coderabbitai · 2025-12-09T13:07:35Z

src/app/endpoints/query_v2.py

+def parse_rag_chunks_from_responses_api(response_obj: Any) -> list:
+    """
+    Extract rag_chunks from the llama-stack OpenAI response.
+
+    Args:
+        response_obj: The ResponseObject from OpenAI compatible response API in llama-stack.
+
+    Returns:
+        List of rag chunk dicts with content, source, score
+    """
+    rag_chunks = []
+
+    for output_item in response_obj.output:
+        if (
+            hasattr(output_item, "type")
+            and output_item.type == "file_search_call"
+            and hasattr(output_item, "results")
+        ):
+
+            for result in output_item.results:
+                rag_chunk = {
+                    "content": result.text,
+                    "source": "file_search",
+                    "score": result.score,
+                }
+                rag_chunks.append(rag_chunk)
+
+    return rag_chunks


⚠️ Potential issue | 🔴 Critical

Fix AttributeError and type mismatch issues in RAG chunk parsing.

The function has critical issues confirmed by pipeline failures:

Line 476: result.text causes AttributeError: 'dict' object has no attribute 'text' when result is a dict.

Return type: Returns list[dict] but TurnSummary.rag_chunks expects list[RAGChunk] objects, causing validation failures.

Type annotation: Incomplete return type -> list violates coding guidelines requiring complete type annotations.

The function must handle both dict and object access patterns like other functions in this file (e.g., _build_tool_call_summary and parse_referenced_documents_from_responses_api).

Apply this diff to fix all issues:

+from utils.types import RAGChunk + # ... (in the imports section) -def parse_rag_chunks_from_responses_api(response_obj: Any) -> list: +def parse_rag_chunks_from_responses_api(response_obj: Any) -> list[RAGChunk]: """ Extract rag_chunks from the llama-stack OpenAI response. Args: response_obj: The ResponseObject from OpenAI compatible response API in llama-stack. Returns: - List of rag chunk dicts with content, source, score + List of RAGChunk objects with content, source, and score """ - rag_chunks = [] + rag_chunks: list[RAGChunk] = [] for output_item in response_obj.output: if ( hasattr(output_item, "type") and output_item.type == "file_search_call" and hasattr(output_item, "results") ): - for result in output_item.results: - rag_chunk = { - "content": result.text, - "source": "file_search", - "score": result.score, - } - rag_chunks.append(rag_chunk) + # Handle both dict and object access patterns + if isinstance(result, dict): + content = result.get("text", "") + score = result.get("score") + else: + content = getattr(result, "text", "") + score = getattr(result, "score", None) + + if content: # Only add if content exists + rag_chunks.append( + RAGChunk( + content=content, + source="file_search", + score=score, + ) + ) return rag_chunks

🧰 Tools

🪛 GitHub Actions: Unit tests

[error] 476-476: AttributeError: 'dict' object has no attribute 'text' while parsing rag chunks from responses API in parse_rag_chunks_from_responses_api. Expected items to have 'text' and 'score' attributes (triggered during tests/unit/app/endpoints/test_query_v2.py::test_retrieve_response_parses_referenced_documents).

onmete · 2026-01-07T13:22:23Z

/lgtm

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/unit/app/endpoints/test_query_v2.py (1)

851-875: Missing conversations.create mock will cause test failure.

The test is missing a mock for mock_client.conversations.create, which is required by retrieve_response when no conversation_id is provided (as in line 871). All other similar tests in this file include this mock (see lines 120, 163, 240, 292, 334, etc.).

🔧 Add the missing mock

 async def test_retrieve_response_parses_referenced_documents(
     mocker: MockerFixture,
 ) -> None:
     """Test that retrieve_response correctly parses referenced documents from response."""
     mock_client = mocker.AsyncMock()
 
     # Create output items using helper functions
     output_item_1 = _create_message_output_with_citations(mocker)
     output_item_2 = _create_file_search_output(mocker)
 
     response_obj = mocker.Mock()
     response_obj.id = "resp-docs"
     response_obj.output = [output_item_1, output_item_2]
     response_obj.usage = None
 
     mock_client.responses.create = mocker.AsyncMock(return_value=response_obj)
+    # Mock conversations.create for new conversation creation
+    mock_conversation = mocker.Mock()
+    mock_conversation.id = "conv_abc123def456"
+    mock_client.conversations.create = mocker.AsyncMock(return_value=mock_conversation)
     mock_vector_stores = mocker.Mock()
     mock_vector_stores.data = []
     mock_client.vector_stores.list = mocker.AsyncMock(return_value=mock_vector_stores)
     mock_client.shields.list = mocker.AsyncMock(return_value=[])

🧹 Nitpick comments (1)

tests/unit/app/endpoints/test_query_v2.py (1)
851-851: Consider using consistent mock pattern.

Line 851 uses mocker.AsyncMock() directly, while other tests in this file use mocker.Mock() and explicitly set async methods (e.g., lines 111, 156, 206). While functional, consistency improves maintainability.
♻️ Use consistent mock pattern
-    mock_client = mocker.AsyncMock()
+    mock_client = mocker.Mock()

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between ad277ff and 4cd0215.

📒 Files selected for processing (3)

src/app/endpoints/query_v2.py
tests/integration/endpoints/test_query_v2_integration.py
tests/unit/app/endpoints/test_query_v2.py

🚧 Files skipped from review as they are similar to previous changes (1)

src/app/endpoints/query_v2.py

🧰 Additional context used

📓 Path-based instructions (2)

tests/{unit,integration}/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

tests/{unit,integration}/**/*.py: Use pytest for all unit and integration tests; do not use unittest framework
Unit tests must achieve 60% code coverage; integration tests must achieve 10% coverage

Files:

tests/integration/endpoints/test_query_v2_integration.py
tests/unit/app/endpoints/test_query_v2.py

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

Use pytest-mock with AsyncMock objects for mocking in tests

Files:

tests/integration/endpoints/test_query_v2_integration.py
tests/unit/app/endpoints/test_query_v2.py

🔇 Additional comments (4)

tests/integration/endpoints/test_query_v2_integration.py (1)

350-350: LGTM! Text attribute supports RAG chunk extraction.

The addition of the text attribute to the mock result aligns with the new RAG chunk parsing functionality, which extracts content from file search results.

tests/unit/app/endpoints/test_query_v2.py (3)

790-816: LGTM! Helper function improves test maintainability.

The extraction of citation setup logic into a reusable helper function follows good testing practices and makes the test more readable.

818-844: LGTM! Helper function properly structures file search results.

The helper creates well-structured mock file search results with all necessary attributes (text, score, filename, etc.) to support RAG chunk extraction testing.

898-905: LGTM! RAG chunks validation is comprehensive.

The new assertions properly validate that RAG chunks are extracted from file_search_call results, checking content, source, and score for each chunk.

tisnik

LGTM

coderabbitai bot reviewed Dec 9, 2025

View reviewed changes

are-ces marked this pull request as draft December 9, 2025 13:08

are-ces force-pushed the rag-chunks-fix branch 3 times, most recently from 0411286 to 5eae3bf Compare December 9, 2025 13:48

Added rag_chunks to turn summary

4cd0215

are-ces force-pushed the rag-chunks-fix branch from 5eae3bf to 4cd0215 Compare December 9, 2025 15:17

are-ces marked this pull request as ready for review January 7, 2026 13:22

coderabbitai bot reviewed Jan 7, 2026

View reviewed changes

tisnik approved these changes Jan 7, 2026

View reviewed changes

tisnik merged commit 2adb747 into lightspeed-core:main Jan 7, 2026
21 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LCORE-1094: Rag chunks are not parsed #893

LCORE-1094: Rag chunks are not parsed #893

Uh oh!

are-ces commented Dec 9, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 9, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Dec 9, 2025

Uh oh!

onmete commented Jan 7, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

tisnik left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LCORE-1094: Rag chunks are not parsed #893

LCORE-1094: Rag chunks are not parsed #893

Uh oh!

Conversation

are-ces commented Dec 9, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tools used to create PR

Related Tickets & Documents

Checklist before requesting a review

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

onmete commented Jan 7, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

tisnik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

are-ces commented Dec 9, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 9, 2025 •

edited

Loading