Skip to content

Add VDB runtime summary observability#2198

Draft
jioffe502 wants to merge 9 commits into
NVIDIA:mainfrom
jioffe502:codex/vdb-runtime-observability
Draft

Add VDB runtime summary observability#2198
jioffe502 wants to merge 9 commits into
NVIDIA:mainfrom
jioffe502:codex/vdb-runtime-observability

Conversation

@jioffe502
Copy link
Copy Markdown
Collaborator

@jioffe502 jioffe502 commented Jun 1, 2026

Summary

Stacked on #2040. The intended review surface is the final commit on jioffe502:codex/vdb-runtime-observability until #2040 lands.

Adds a VDB-neutral runtime summary block to pipeline runtime metrics so harness/session artifacts can report which backend target and retrieval mode were actually configured.

  • Add describe_vdb_runtime(...) for JSON-safe VDB runtime summaries
  • Include a vdb block beside the legacy vdb_op in graph pipeline runtime summaries
  • Normalize dense vs hybrid retrieval-mode metadata, including signals and uses_query_texts
  • Sanitize config values and omit execution-only query_texts
  • Cover helper behavior and CLI runtime summary output in tests

Testing

  • uv run --extra dev pytest -q tests/test_vdb_runtime.py tests/test_graph_pipeline_cli.py
  • uvx pre-commit run --all-files
  • Full JP20 smoke with local OpenAI-compatible embeddings stub: python -m nemo_retriever.examples.graph_pipeline /datasets/nv-ingest/jp20 --run-mode inprocess --evaluation-mode beir --vdb-kwargs-json ...hybrid=true...
    • Processed 1,940 pages, ran BEIR over 115 queries, and persisted 1,884 LanceDB rows.
    • Runtime summary included vdb.retrieval.mode = "hybrid", signals = ["dense_vector", "lexical_text"], and uses_query_texts = true.

@jioffe502 jioffe502 force-pushed the codex/vdb-runtime-observability branch from dc4a790 to 3f0022e Compare June 2, 2026 20:39
@jioffe502 jioffe502 force-pushed the codex/vdb-runtime-observability branch from 3f0022e to 07e295a Compare June 2, 2026 20:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant