NVIDIA Triton Inference Server HTTP Endpoint DoS via Large Compressed Payload
CVE-2026-24158 Published on March 24, 2026

NVIDIA Triton Inference Server contains a vulnerability in the HTTP endpoint where an attacker may cause a denial of service by providing a large compressed payload. A successful exploit of this vulnerability may lead to denial of service.

NVD

Vulnerability Analysis

CVE-2026-24158 is exploitable with network access, and does not require authorization privileges or user interaction. This vulnerability is considered to have a low attack complexity. The potential impact of an exploit of this vulnerability is considered to have no impact on confidentiality and integrity, and a high impact on availability.

Attack Vector:
NETWORK
Attack Complexity:
LOW
Privileges Required:
NONE
User Interaction:
NONE
Scope:
UNCHANGED
Confidentiality Impact:
NONE
Integrity Impact:
NONE
Availability Impact:
HIGH

Weakness Type

What is a Stack Exhaustion Vulnerability?

The product allocates memory based on an untrusted, large size value, but it does not ensure that the size is within expected limits, allowing arbitrary amounts of memory to be allocated.

CVE-2026-24158 has been classified to as a Stack Exhaustion vulnerability or weakness.


Products Associated with CVE-2026-24158

Want to know whenever a new CVE is published for NVIDIA Triton Inference Server? stack.watch will email you.

 

Affected Versions

NVIDIA Triton Inference Server Version All versions prior to 26.01 is affected by CVE-2026-24158

Exploit Probability

EPSS
0.04%
Percentile
11.26%

EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.