Add VDB runtime summary observability by jioffe502 · Pull Request #2198 · NVIDIA/NeMo-Retriever

jioffe502 · 2026-06-01T20:47:31Z

Summary

Stacked on #2040. The intended review surface is the final commit on jioffe502:codex/vdb-runtime-observability until #2040 lands.

Adds a VDB-neutral runtime summary block to pipeline runtime metrics so harness/session artifacts can report which backend target and retrieval mode were actually configured.

Add describe_vdb_runtime(...) for JSON-safe VDB runtime summaries
Include a vdb block beside the legacy vdb_op in graph pipeline runtime summaries
Normalize dense vs hybrid retrieval-mode metadata, including signals and uses_query_texts
Sanitize config values and omit execution-only query_texts
Cover helper behavior and CLI runtime summary output in tests

Testing

uv run --extra dev pytest -q tests/test_vdb_runtime.py tests/test_graph_pipeline_cli.py
uvx pre-commit run --all-files
Full JP20 smoke with local OpenAI-compatible embeddings stub: python -m nemo_retriever.examples.graph_pipeline /datasets/nv-ingest/jp20 --run-mode inprocess --evaluation-mode beir --vdb-kwargs-json ...hybrid=true...
- Processed 1,940 pages, ran BEIR over 115 queries, and persisted 1,884 LanceDB rows.
- Runtime summary included vdb.retrieval.mode = "hybrid", signals = ["dense_vector", "lexical_text"], and uses_query_texts = true.

…hybrid-search # Conflicts: # nemo_retriever/src/nemo_retriever/vdb/lancedb.py

…hybrid-search

jioffe502 and others added 7 commits May 14, 2026 21:25

Implement LanceDB hybrid retrieval

6410d75

Document hybrid query text ordering assumption

1bf9168

Address LanceDB hybrid review comments

d96dcf4

Merge branch 'main' into codex/lancedb-true-hybrid-search

89d07a3

Merge remote-tracking branch 'upstream/main' into codex/lancedb-true-…

892b49f

…hybrid-search # Conflicts: # nemo_retriever/src/nemo_retriever/vdb/lancedb.py

Merge remote-tracking branch 'upstream/main' into codex/lancedb-true-…

e965ae7

…hybrid-search

Tighten hybrid retrieval VDB contract

3965bfb

jioffe502 force-pushed the codex/vdb-runtime-observability branch from dc4a790 to 3f0022e Compare June 2, 2026 20:39

jioffe502 added 2 commits June 2, 2026 20:43

Clarify hybrid FTS index build

e3a5f30

Add VDB runtime summary observability

07e295a

jioffe502 force-pushed the codex/vdb-runtime-observability branch from 3f0022e to 07e295a Compare June 2, 2026 20:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add VDB runtime summary observability#2198

Add VDB runtime summary observability#2198
jioffe502 wants to merge 9 commits into
NVIDIA:mainfrom
jioffe502:codex/vdb-runtime-observability

jioffe502 commented Jun 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jioffe502 commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jioffe502 commented Jun 1, 2026 •

edited

Loading