Skip to content

ci: List bundled codecs#2830

Draft
kajalj22 wants to merge 10 commits into
mainfrom
list-bundled-codecs
Draft

ci: List bundled codecs#2830
kajalj22 wants to merge 10 commits into
mainfrom
list-bundled-codecs

Conversation

@kajalj22

Copy link
Copy Markdown
Contributor

What does this PR do ?

tools: add list_codecs.sh to enumerate all bundled codecs

Summary

  • Adds tools/list_codecs.sh to enumerate all bundled codecs in the container
  • Covers video/audio (ffmpeg, GStreamer), compression (stdlib, blosc, numcodecs, pyarrow, zarr, h5py), and image/video (Pillow, OpenCV, imagecodecs, imageio, PyAV, decord, tifffile)
  • For each library: shows install path, pip location, and reverse-dependency tree (which package pulled it in)

Test plan

  • Run bash tools/list_codecs.sh inside the RL container

kajalj22 and others added 2 commits June 15, 2026 15:57
Lists video/audio (ffmpeg, GStreamer), compression (blosc, numcodecs,
pyarrow, zarr, h5py, stdlib), and image/video (Pillow, OpenCV,
imagecodecs, imageio, PyAV, decord, tifffile) codecs in the container.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
For each library, also prints the file path it was loaded from,
which pip package installed it, and the reverse-dependency tree
(via pipdeptree if available) showing what pulled it in.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
@kajalj22 kajalj22 requested a review from a team as a code owner June 15, 2026 21:17
@copy-pr-bot

copy-pr-bot Bot commented Jun 15, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@kajalj22 kajalj22 marked this pull request as draft June 15, 2026 21:18
@kajalj22 kajalj22 changed the title List bundled codecs ci: List bundled codecs Jun 15, 2026
kajalj22 and others added 8 commits June 15, 2026 18:41
srun containers have a minimal PATH so python3/ffmpeg aren't always
found. Auto-detect both across common install locations at startup
and error-out clearly if Python is missing entirely.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
- Add ffmpeg -buildconf output (configure flags, required for OSRB)
- Scan all *.libs/ dirs under site-packages for bundled .so files
  (catches libx264/libx265/libavcodec bundled inside PyAV wheel etc.)
- Flag royalty-bearing codec hits (AAC, H.264, H.265 and variants)
  across every section as they are found
- Add ROYALTY-BEARING CODEC SUMMARY section at the end, ready to
  paste into OSRB submission point 12

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
Replace manual Python interpreter detection with uv run python,
which picks up the correct venv automatically.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
uv is not installed in the RL container. Use /opt/nemo_rl_venv/bin/python
directly, with fallbacks to other common locations.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
/opt/nemo_rl_venv/bin/python3.13 is the primary Python version for RL.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Kajal Jain <kajalj@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant