CVE-2025-23331: Triton Inference Server DoS via Excessive Memory Allocation
CVE-2025-23331 Published on August 6, 2025

NVIDIA Triton Inference Server for Windows and Linux contains a vulnerability where a user could cause a memory allocation with excessive size value, leading to a segmentation fault, by providing an invalid request. A successful exploit of this vulnerability might lead to denial of service.

NVD

Vulnerability Analysis

CVE-2025-23331 can be exploited with network access, and does not require authorization privileges or user interaction. This vulnerability is considered to have a low attack complexity. The potential impact of an exploit of this vulnerability is considered to have no impact on confidentiality and integrity, and a high impact on availability.

Attack Vector:
NETWORK
Attack Complexity:
LOW
Privileges Required:
NONE
User Interaction:
NONE
Scope:
UNCHANGED
Confidentiality Impact:
NONE
Integrity Impact:
NONE
Availability Impact:
HIGH

Weakness Type

What is a Stack Exhaustion Vulnerability?

The product allocates memory based on an untrusted, large size value, but it does not ensure that the size is within expected limits, allowing arbitrary amounts of memory to be allocated.

CVE-2025-23331 has been classified to as a Stack Exhaustion vulnerability or weakness.


Products Associated with CVE-2025-23331

Want to know whenever a new CVE is published for NVIDIA Triton Inference Server? stack.watch will email you.

 

Affected Versions

NVIDIA Triton Inference Server Version All versions prior to 25.06 is affected by CVE-2025-23331

Exploit Probability

EPSS
0.15%
Percentile
35.59%

EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.