NVIDIA Triton Server Python Backend Shared Memory Overflow Info Disclosure
CVE-2025-23320 Published on August 6, 2025

NVIDIA Triton Inference Server for Windows and Linux contains a vulnerability in the Python backend, where an attacker could cause the shared memory limit to be exceeded by sending a very large request. A successful exploit of this vulnerability might lead to information disclosure.

NVD

Vulnerability Analysis

CVE-2025-23320 is exploitable with network access, and does not require authorization privileges or user interaction. This vulnerability is considered to have a low attack complexity. The potential impact of an exploit of this vulnerability is considered to have a high impact on confidentiality, with no impact on integrity and availability.

Attack Vector:
NETWORK
Attack Complexity:
LOW
Privileges Required:
NONE
User Interaction:
NONE
Scope:
UNCHANGED
Confidentiality Impact:
HIGH
Integrity Impact:
NONE
Availability Impact:
NONE

Weakness Type

Generation of Error Message Containing Sensitive Information

The software generates an error message that includes sensitive information about its environment, users, or associated data.


Products Associated with CVE-2025-23320

Want to know whenever a new CVE is published for NVIDIA Triton Inference Server? stack.watch will email you.

 

Affected Versions

NVIDIA Triton Inference Server Version All versions prior to 25.07 is affected by CVE-2025-23320

Exploit Probability

EPSS
0.06%
Percentile
18.05%

EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.