Remote Code Execution via Model Name in NVIDIA Triton Inference Server (Python backend)
CVE-2025-23316 Published on September 17, 2025
NVIDIA Triton Inference Server for Windows and Linux contains a vulnerability in the Python backend, where an attacker could cause a remote code execution by manipulating the model name parameter in the model control APIs. A successful exploit of this vulnerability might lead to remote code execution, denial of service, information disclosure, and data tampering.
Vulnerability Analysis
CVE-2025-23316 is exploitable with network access, and does not require authorization privileges or user interaction. This vulnerability is considered to have a low attack complexity. The potential impact of an exploit of this vulnerability is considered to be critical as this vulnerability has a high impact to the confidentiality, integrity and availability of this component.
Weakness Type
What is a Shell injection Vulnerability?
The software constructs all or part of an OS command using externally-influenced input from an upstream component, but it does not neutralize or incorrectly neutralizes special elements that could modify the intended OS command when it is sent to a downstream component.
CVE-2025-23316 has been classified to as a Shell injection vulnerability or weakness.
Products Associated with CVE-2025-23316
Want to know whenever a new CVE is published for NVIDIA Triton Inference Server? stack.watch will email you.
Affected Versions
NVIDIA Triton Inference Server Version All versions prior to 25.08 is affected by CVE-2025-23316Exploit Probability
EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.