CVE-2024-34078: Late Unicode Normalization Bypass

Performing Unicode normalization (like NFKC or NFC) at the end of a sanitization or filtering function can lead to security bypasses. Attackers can use alternative full-width Unicode characters (e.g., '<') that bypass initial HTML element filtering, and are later normalized into dangerous characters (e.g., '<') before output. Normalization should occur befor

Provally CuratedPublic repositoryMediumMedium confidenceVerifiedApache-2.0Python
greprules fetch cve-2024-34078-late-unicode-normalization-bypass --engine opengrep

Description

Performing Unicode normalization (like NFKC or NFC) at the end of a sanitization or filtering function can lead to security bypasses. Attackers can use alternative full-width Unicode characters (e.g., '<') that bypass initial HTML element filtering, and are later normalized into dangerous characters (e.g., '<') before output. Normalization should occur befor