NVIDIA Triton Inference Server HTTP Endpoint DoS via Large Compressed Payload
CVE-2026-24158 Published on March 24, 2026
NVIDIA Triton Inference Server contains a vulnerability in the HTTP endpoint where an attacker may cause a denial of service by providing a large compressed payload. A successful exploit of this vulnerability may lead to denial of service.
Vulnerability Analysis
CVE-2026-24158 is exploitable with network access, and does not require authorization privileges or user interaction. This vulnerability is considered to have a low attack complexity. The potential impact of an exploit of this vulnerability is considered to have no impact on confidentiality and integrity, and a high impact on availability.
Weakness Type
What is a Stack Exhaustion Vulnerability?
The product allocates memory based on an untrusted, large size value, but it does not ensure that the size is within expected limits, allowing arbitrary amounts of memory to be allocated.
CVE-2026-24158 has been classified to as a Stack Exhaustion vulnerability or weakness.
Products Associated with CVE-2026-24158
Want to know whenever a new CVE is published for NVIDIA Triton Inference Server? stack.watch will email you.
Affected Versions
NVIDIA Triton Inference Server Version All versions prior to 26.01 is affected by CVE-2026-24158Exploit Probability
EPSS (Exploit Prediction Scoring System) scores estimate the probability that a vulnerability will be exploited in the wild within the next 30 days. The percentile shows you how this score compares to all other vulnerabilities.