NVIDIA Triton Python Backend OOB Write
CVE-2025-23319 Published on August 6, 2025
NVIDIA Triton Inference Server for Windows and Linux contains a vulnerability in the Python backend, where an attacker could cause an out-of-bounds write by sending a request. A successful exploit of this vulnerability might lead to remote code execution, denial of service, data tampering, or information disclosure.
Vulnerability Analysis
CVE-2025-23319 can be exploited with network access, and does not require authorization privileges or user interaction. This vulnerability is consided to have a high level of attack complexity. The potential impact of an exploit of this vulnerability is considered to be very high.
Weakness Type
Buffer Access with Incorrect Length Value
The software uses a sequential operation to read or write a buffer, but it uses an incorrect length value that causes it to access memory that is outside of the bounds of the buffer. When the length value exceeds the size of the destination, a buffer overflow could occur.
Products Associated with CVE-2025-23319
You can be notified by email with stack.watch whenever vulnerabilities like CVE-2025-23319 are published in NVIDIA Triton Inference Server:
Affected Versions
NVIDIA Triton Inference Server Version All versions prior to 25.07 is affected by CVE-2025-23319Exploit Probability
EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.