CVE-2025-23331: Triton Inference Server DoS via Excessive Memory Allocation
CVE-2025-23331 Published on August 6, 2025
NVIDIA Triton Inference Server for Windows and Linux contains a vulnerability where a user could cause a memory allocation with excessive size value, leading to a segmentation fault, by providing an invalid request. A successful exploit of this vulnerability might lead to denial of service.
Vulnerability Analysis
CVE-2025-23331 can be exploited with network access, and does not require authorization privileges or user interaction. This vulnerability is considered to have a low attack complexity. The potential impact of an exploit of this vulnerability is considered to have no impact on confidentiality and integrity, and a high impact on availability.
Weakness Type
What is a Stack Exhaustion Vulnerability?
The product allocates memory based on an untrusted, large size value, but it does not ensure that the size is within expected limits, allowing arbitrary amounts of memory to be allocated.
CVE-2025-23331 has been classified to as a Stack Exhaustion vulnerability or weakness.
Products Associated with CVE-2025-23331
Want to know whenever a new CVE is published for NVIDIA Triton Inference Server? stack.watch will email you.
Affected Versions
NVIDIA Triton Inference Server Version All versions prior to 25.06 is affected by CVE-2025-23331Exploit Probability
EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.