Skip to content

added info for v100 runtime for model deployment#307

Open
Milstein wants to merge 1 commit intomainfrom
v100_model_vllm-runtime-info
Open

added info for v100 runtime for model deployment#307
Milstein wants to merge 1 commit intomainfrom
v100_model_vllm-runtime-info

Conversation

@Milstein
Copy link
Contributor

@Milstein Milstein commented Feb 7, 2026

How to Use the NVIDIA V100 GPU Accelerator to Reduce Costs?

You can use the NVIDIA V100 GPU to reduce costs when deploying your model.
To do this, make sure you select the Serving Runtime as (V100 Support) vLLM NVIDIA GPU ServingRuntime for KServe, which is customized to support the NVIDIA V100 GPU architecture. Then, choose NVIDIA A100 GPU as the Accelerator and set the Number of accelerators to 1.

@Milstein Milstein requested a review from joachimweyl February 7, 2026 00:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant