NVIDIA Triton Python Backend OOB Write
CVE-2025-23319 Published on August 6, 2025

NVIDIA Triton Inference Server for Windows and Linux contains a vulnerability in the Python backend, where an attacker could cause an out-of-bounds write by sending a request. A successful exploit of this vulnerability might lead to remote code execution, denial of service, data tampering, or information disclosure.

NVD

Vulnerability Analysis

CVE-2025-23319 can be exploited with network access, and does not require authorization privileges or user interaction. This vulnerability is consided to have a high level of attack complexity. The potential impact of an exploit of this vulnerability is considered to be very high.

Attack Vector:
NETWORK
Attack Complexity:
HIGH
Privileges Required:
NONE
User Interaction:
NONE
Scope:
UNCHANGED
Confidentiality Impact:
HIGH
Integrity Impact:
HIGH
Availability Impact:
HIGH

Weakness Type

Buffer Access with Incorrect Length Value

The software uses a sequential operation to read or write a buffer, but it uses an incorrect length value that causes it to access memory that is outside of the bounds of the buffer. When the length value exceeds the size of the destination, a buffer overflow could occur.


Products Associated with CVE-2025-23319

You can be notified by email with stack.watch whenever vulnerabilities like CVE-2025-23319 are published in NVIDIA Triton Inference Server:

 

Affected Versions

NVIDIA Triton Inference Server Version All versions prior to 25.07 is affected by CVE-2025-23319

Exploit Probability

EPSS
0.58%
Percentile
68.16%

EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.