Skip to content

align extraction links with quickstart, NeMo-Retriever paths, and API docs#2048

Merged
kheiss-uwzoo merged 4 commits into
NVIDIA:26.03from
kheiss-uwzoo:kheiss/6179241-batch1
May 18, 2026
Merged

align extraction links with quickstart, NeMo-Retriever paths, and API docs#2048
kheiss-uwzoo merged 4 commits into
NVIDIA:26.03from
kheiss-uwzoo:kheiss/6179241-batch1

Conversation

@kheiss-uwzoo
Copy link
Copy Markdown
Collaborator

@kheiss-uwzoo kheiss-uwzoo commented May 15, 2026

Summary

Documentation-only PR for NVBug 6179241 (extraction docs link fix, batch 1). Branch kheiss/6179241-batch1 is based on 26.03 with 4 commits; 13 files changed (+24 / −32 lines).

Link and navigation updates (11 extraction docs)

  • In-tree quickstart anchors — Replaced scattered nv-ingest / external notebook URLs with quickstart-guide.md#ingest_python_example and #ingest_cli_example where appropriate (audio.md, nemoretriever-parse.md, notebooks.md, nv-ingest-python-api.md, nv-ingest_cli.md, and related pages).
  • Official NVIDIA sources only — Removed third-party links (Gitee mirror, AI-App GitHub fork); restored github.com/NVIDIA/nv-ingest for image_viewer.py.
  • 26.03–pinned upstream paths — Air-gap and quickstart guides now reference NVIDIA/NeMo-Retriever 26.03 for compose/Helm (helm/ on that branch), including the raw GitHub Helm README used in the MIG section.
  • CLI reference — Tip points to the self-hosted quickstart anchor plus the 26.03 CLI companion notebook on NVIDIA/NeMo-Retriever (not release/25.3.0).
  • Support matrixnemotron-parse link updated to docs.api.nvidia.com.
  • UDF related topics — Example README link updated to NVIDIA/NeMo-Retriever main examples/udfs/README.md.

Review-driven cleanups

  • quickstart-guide.md — Removed self-referential !!! tip blocks that only linked to anchors on the same page; stripped accidental UTF-8 BOM so the H1 renders correctly in MkDocs.
  • python-api-reference.md — Tip aligned with nv-ingest-python-api.md (in-tree quickstart + API page cross-link).

CI / pre-commit

  • CONTRIBUTING.md and releasenotes-nv-ingest.md — Added missing EOF newlines so pre-commit run --all-files passes (end-of-file-fixer).

Base: 26.03 · Commits: 50fc8073, dde2c19d, 5f42a3fe, 4697db31

Files changed

Category Files
Deploy / air gap air-gapped-deployment.md, quickstart-guide.md
Client entry points nv-ingest-python-api.md, nv-ingest_cli.md, python-api-reference.md, cli-reference.md
Guides & matrix audio.md, nemoretriever-parse.md, notebooks.md, support-matrix.md, user-defined-functions.md
CI hygiene CONTRIBUTING.md, releasenotes-nv-ingest.md

Test plan

  • CI pre-commit (--all-files) passes on the PR.
  • mkdocs build (or docs CI) succeeds with no warnings for touched pages.
  • Spot-check internal links from notebooks.md, cli-reference.md, and python-api-reference.md to quickstart anchors.
  • Confirm air-gapped and quickstart links to NVIDIA/NeMo-Retriever 26.03 open in the browser (compose, helm/values.yaml, helm/README.md).
  • Confirm quickstart-guide.md has no BOM (first byte is #, not EF BB BF).
  • Verify image_viewer.py and nemotron-parse URLs resolve to official NVIDIA destinations.

Point notebook tips at the self-hosted quickstart anchors, refresh NeMo-Retriever Helm and compose URLs for air-gapped and quickstart flows, fix the raw GitHub helm README target, and tidy link targets across the extraction doc set.
@kheiss-uwzoo kheiss-uwzoo marked this pull request as ready for review May 15, 2026 22:17
@kheiss-uwzoo kheiss-uwzoo requested a review from a team as a code owner May 15, 2026 22:17
@kheiss-uwzoo kheiss-uwzoo requested review from jioffe502 and removed request for a team May 15, 2026 22:17
@greptile-apps
Copy link
Copy Markdown
Contributor

greptile-apps Bot commented May 15, 2026

Greptile Summary

This documentation-only PR updates cross-reference links across 11 extraction docs to align with the 26.03 self-hosted story: in-tree quickstart anchors replace scattered external notebook URLs, and NeMo-Retriever external links are pinned to the 26.03 branch instead of legacy release/26.3.0 refs or moving main refs.

  • Eight files (audio.md, nemoretriever-parse.md, nv-ingest-python-api.md, nv-ingest_cli.md, python-api-reference.md, cli-reference.md, notebooks.md) replace external nv-ingest notebook URLs with relative quickstart anchors, removing previously-flagged self-referential tips and third-party mirror links.
  • air-gapped-deployment.md and quickstart-guide.md replace release/26.3.0 / main NeMo-Retriever links with 26.03-branch URLs; support-matrix.md updates the nemotron-parse link to docs.api.nvidia.com; user-defined-functions.md moves the UDF example link from a pinned nv-ingest release to NeMo-Retriever/main.

Confidence Score: 5/5

Safe to merge — all changes are link and anchor-text updates with no code logic affected.

Every change is a documentation link correction or wording update. The in-tree quickstart anchor replacements are internally consistent, the 26.03-branch pins are the correct stable ref for this base branch, and the previously-flagged issues (Gitee mirror, AI-App fork, BOM, self-referential tips, version mismatches) have all been resolved in earlier commits on this branch.

No files require special attention; the style note on quickstart-guide.md and the moving-target main ref in user-defined-functions.md are optional editorial polish.

Important Files Changed

Filename Overview
docs/docs/extraction/quickstart-guide.md Helm chart link updated to NeMo-Retriever 26.03 branch; docker-compose link in air-gap section pinned to 26.03; self-referential tips removed; MIG section still uses a raw.githubusercontent link with the text raw.githubusercontent as the anchor.
docs/docs/extraction/air-gapped-deployment.md All three external links updated from release/26.3.0 to 26.03 branch; note text reworded to remove the main moves forward contradiction.
docs/docs/extraction/notebooks.md CLI and Python notebook links replaced with in-tree quickstart anchors; prose updated to reflect guides and notebooks.
docs/docs/extraction/support-matrix.md nemotron-parse link updated from docs.nvidia.com NIM example page to docs.api.nvidia.com API reference.
docs/docs/extraction/user-defined-functions.md UDF examples link changed from pinned nv-ingest release/26.1.2 to NeMo-Retriever main branch, which is a moving target.
docs/docs/extraction/audio.md Two Python tip links switched from external nv-ingest notebook URL to local quickstart-guide.md anchor.
docs/docs/extraction/cli-reference.md Tip now points at the 26.03-branch companion notebook and the in-tree CLI quickstart anchor.
docs/docs/extraction/nemoretriever-parse.md Two Python tip links updated to in-tree quickstart anchor, matching changes in audio.md.
docs/docs/extraction/nv-ingest-python-api.md Tip updated to reference quickstart Python anchor and Python API reference page instead of the external notebook URL.
docs/docs/extraction/nv-ingest_cli.md Tip updated to reference in-tree quickstart CLI anchor and CLI reference page instead of the external notebook URL.
docs/docs/extraction/python-api-reference.md Tip now links to in-tree quickstart Python anchor and the Python API page, replacing the previous external notebook URL.
CONTRIBUTING.md Missing newline added at end of file.
docs/docs/extraction/releasenotes-nv-ingest.md Missing newline added at end of file.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Extraction docs tip / link] -->|old| B[External nv-ingest GitHub notebook URL]
    A -->|new| C[In-tree quickstart-guide.md anchor]
    C --> D["#ingest_python_example"]
    C --> E["#ingest_cli_example"]

    F[NeMo-Retriever external link] -->|old: release/26.3.0 or main| G[Legacy / moving ref]
    F -->|new| H[26.03 branch pin]

    H --> I[docker-compose.yaml]
    H --> J[helm/README.md]
    H --> K[helm/values.yaml]
    H --> L[cli_client_usage.ipynb]

    M[support-matrix.md] -->|nemotron-parse| N[docs.api.nvidia.com API reference]
    O[user-defined-functions.md] -->|UDF examples| P[NeMo-Retriever main branch]
Loading
Prompt To Fix All With AI
Fix the following 2 code review issues. Work through them one at a time, proposing concise fixes.

---

### Issue 1 of 2
docs/docs/extraction/user-defined-functions.md:943
**Moving-target link for versioned docs**

The "Related Topics" link was changed from the pinned `nv-ingest/release/26.1.2` ref to `NeMo-Retriever/main`, which advances continuously. If the UDF API or README changes between now and the next time a user reads the 26.03 docs, they'll follow a link to content that no longer matches what's described on the page. Consider pinning to `26.03` (matching the branch this PR targets) or to the equivalent `release/26.03` tag if one exists in NeMo-Retriever.

### Issue 2 of 2
docs/docs/extraction/quickstart-guide.md:531
**Meaningless anchor text on MIG section link**

The rendered link text `raw.githubusercontent` gives readers no context about the destination. Consider replacing it with a descriptive label such as `Helm README` so it reads "…expose the desired MIG profiles (see [Helm README](https://raw.githubusercontent.com/NVIDIA/NeMo-Retriever/26.03/helm/README.md))."

Reviews (4): Last reviewed commit: "chore: add missing EOF newlines for pre-..." | Re-trigger Greptile

Comment thread docs/docs/extraction/python-api-reference.md Outdated
Comment thread docs/docs/extraction/quickstart-guide.md Outdated
Comment thread docs/docs/extraction/cli-reference.md Outdated
Comment thread docs/docs/extraction/air-gapped-deployment.md Outdated
Comment thread docs/docs/extraction/quickstart-guide.md Outdated
… pins

Drop third-party notebook and fork URLs, pin NeMo Retriever air-gap and quickstart references to the 26.03 branch (including helm paths on that branch), point the CLI tip at the 26.03 companion notebook plus the quickstart anchor, and remove redundant same-page tips in the quickstart.
Comment thread docs/docs/extraction/quickstart-guide.md Outdated
Fix end-of-file-fixer failures in CONTRIBUTING.md and releasenotes-nv-ingest.md so CI pre-commit --all-files passes.
@kheiss-uwzoo kheiss-uwzoo added the doc Improvements or additions to documentation label May 18, 2026
## Related Topics

- [NV-Ingest UDF Examples](https://github.com/NVIDIA/nv-ingest/blob/release/26.1.2/examples/udfs/README.md)
- [NV-Ingest UDF Examples](https://github.com/NVIDIA/NeMo-Retriever/blob/main/examples/udfs/README.md)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove nv-ingest references

Comment thread CONTRIBUTING.md
@@ -536,4 +536,4 @@ Portions adopted from

- [https://github.com/nv-morpheus/Morpheus/blob/branch-24.06/CONTRIBUTING.md](https://github.com/nv-morpheus/Morpheus/blob/branch-24.06/CONTRIBUTING.md)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove morpheus references

@@ -319,10 +315,6 @@ image_caption:[]

### Using the `nv-ingest-cli`
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove nv-ingest references

!!! tip

There is a Jupyter notebook available to help you get started with the CLI. For more information, refer to [CLI Client Quick Start Guide](https://github.com/NVIDIA/NeMo-Retriever/blob/main/client/client_examples/examples/cli_client_usage.ipynb).
For CLI examples, see the [CLI section of the self-hosted quickstart](quickstart-guide.md#ingest_cli_example) or the [CLI companion notebook on the `26.03` branch](https://github.com/NVIDIA/NeMo-Retriever/blob/26.03/client/client_examples/examples/cli_client_usage.ipynb).
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this hardlinking a release branch?

For more Python examples, refer to the [Python Client Quick Start Guide](quickstart-guide.md#ingest_python_example).


## Use NVCF Endpoints for Cloud-Based Inference
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NVCF== build.nvidia.com

## Helm charts and packaging artifacts

From a connected environment, download and version-control the chart archive you install, for example (see [NV-Ingest Helm README](https://github.com/NVIDIA/NeMo-Retriever/blob/release/26.3.0/helm/README.md)):
From a connected environment, download and version-control the chart archive you install, for example (see [NV-Ingest Helm README](https://github.com/NVIDIA/NeMo-Retriever/blob/26.03/helm/README.md)):
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove nv-ingest references

@kheiss-uwzoo kheiss-uwzoo merged commit 63595d6 into NVIDIA:26.03 May 18, 2026
4 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

doc Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants