added info for v100 runtime for model deployment by Milstein · Pull Request #307 · nerc-project/nerc-docs

Milstein · 2026-02-07T00:31:27Z

How to Use the NVIDIA V100 GPU Accelerator to Reduce Costs?

You can use the NVIDIA V100 GPU to reduce costs when deploying your model.
To do this, make sure you select the Serving Runtime as (V100 Support) vLLM NVIDIA GPU ServingRuntime for KServe, which is customized to support the NVIDIA V100 GPU architecture. Then, choose NVIDIA A100 GPU as the Accelerator and set the Number of accelerators to 1.

added info for v100 runtime for model deployment

41f131e

Milstein requested a review from joachimweyl February 7, 2026 00:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added info for v100 runtime for model deployment#307

added info for v100 runtime for model deployment#307
Milstein wants to merge 1 commit intomainfrom
v100_model_vllm-runtime-info

Milstein commented Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Milstein commented Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant