CVE-2026-54235

vLLM: temperature=NaN and temperature=Infinity bypass validation and propagate to GPU kernels

CVSS 6.9 MEDIUMEPSS 0.3%CWE-1287

Vexday Risk Score

13Low

SSVC decision (CISA)

Track

No exploitation signal → monitor

CVSS 6.9EPSS 0.3%KEV nãoPoC —Patch —

Lifecycle

22 Jun 2026Published on NVD

Recommendation: Monitor — no exploitation signal at the moment.

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, ll temperature validation gates use comparison operators (<, >), which silently evaluate to False for NaN and for positive Infinity in Python's IEEE 754 float semantics. Both values pass every guard and propagate to GPU sampling kernels, where they produce undefined behavior or CUDA errors that can crash the inference worker. This vulnerability is fixed in 0.23.1rc0.

CVSS:4.0/AV:N/AC:L/AT:N/PR:N/UI:N/VC:N/VI:N/VA:L/SC:N/SI:N/SA:N

Affected products

vllm-project · vllm

Want to know if your infrastructure is exposed to this?

Talk to TrueHacking →

References

https://github.com/vllm-project/vllm/commit/d598d239737cfa37bcfcb98886ec3f3557fc7198 https://github.com/vllm-project/vllm/pull/45116 https://github.com/vllm-project/vllm/security/advisories/GHSA-7h4p-rffg-7823