VAITP Dataset

Dataset Statistics

> Vulnerability Details

Vulnerability:

Description:

ODC:

ODC taxonomy:

Code Defects Classification	Fault Nature	Example
Function	Missing Functionality	A new function or functionality in an existing function is missing
	Incorrect Functionality	Part of the function code structure needs to be altered
	Extraneous Functionality	Functionality that is actually not needed
Interface	Missing Interface	A parameter is missing in a function call
	Incorrect Interface	Incorrect information was passed to a function call
	Extraneous Interface	Surplus data was passed to a function call
Checking	Missing Check	Conditional logic is missing
	Incorrect Check	Incorrect logic used
	Extraneous Check	Superfluous logic should not be present
Assignment	Missing Assignment	A variable was not assigned a value or not initialized
	Incorrect Assignment	An incorrect value was assigned to a variable
	Extraneous Assignment	A variable should not have been assigned
Timing/Serialization	Timing Issues	Thread issues or race conditions
	Serialization Issues	Incorrect serialization operations
	Extraneous Serialization	Superfluous serialization
Build/Package/Merge	Building Issues	Undefined reference to function
	Packaging Issues	Conflict of dependency versions
	Merging Conflicts	Two branches of a git changed the same line of code
Algorithm	Missing Algorithm	Part of the algorithm is missing
	Incorrect Algorithm	Algorithm is not correctly coded
	Extraneous Algorithm	Algorithm has superfluous code

Category:

Subcategory:

CVEs:

CWEs:

References:

Solution:

Available Python Code:

Dataset Explorer

Vulnerable entries

Patched entries

Vulnerable Python Code

# VAITP - Vulnerability Attack and Injection Tool for Python
										# Select a vulnerable file from the explorer

Patched Python Code

# VAITP - Vulnerability Attack and Injection Tool for Python
										# Select a patched file from the explorer

Search for CVE

CVE

Vulnerability

ODC

Category

Subcategory

Accessibility Scope

Details

Total vulnerabilities in the dataset (not showing ignored and non-python related vulnerabilties): 1611

1926

CVE-2026-44345

Dockerfile injection in BentoML via `bento.yaml` allows code execution.

BentoML is a Python library for building online serving systems optimized for AI apps and model inference. Prior to 1.4.39, src/bentoml/_internal/container/frontend/dockerfile/templates/base_v2.j2 interpolates docker.base_image raw with no escaping, newline filtering, or validation. A malicious bento.yaml with a multi-line docker.base_image value smuggles arbitrary Dockerfile directives into the generated Dockerfile, and bentoml containerize then runs docker build which executes the injected RUN directives on the victim host. This vulnerability is fixed in 1.4.39.

Checking

Input Validation and Sanitization

Command Injection

Local

1925

CVE-2026-44899

Mistune Image directive allows CSS injection via width/height options.

Mistune is a Python Markdown parser with renderers and plugins. Prior to 3.2.1, the Image directive plugin validates the :width: and :height: options with a regex compiled as _num_re = re.compile(r"^\d+(?:\.\d*)?"). When the validated value is not a plain integer, render_block_image() inserts it directly into a style="width:...;" or style="height:...;" attribute. Because the value was accepted by the prefix-only regex, any CSS after the leading digits reaches the style= attribute verbatim and without escaping. This vulnerability is fixed in 3.2.1.

Checking

Input Validation and Sanitization

Cross-Site Scripting (XSS)

Remote

1924

CVE-2026-44898

Mistune vulnerable to XSS in TOC rendering via unescaped heading text.

Mistune is a Python Markdown parser with renderers and plugins. Prior to 3.2.1, render_toc_ul() builds a <ul> table-of-contents tree from a list of (level, id, text) tuples. Both the id value (used as href="#<id>") and the text value (used as the visible link label) are inserted into <a> tags via a plain Python format string — with no HTML escaping applied to either value. When heading IDs are derived from user-supplied heading text (the standard use-case for readable slug anchors), an attacker can craft a heading whose text breaks out of the href="#..." attribute context, injecting arbitrary HTML tags including <script> blocks directly into the rendered TOC. This vulnerability is fixed in 3.2.1.

Checking

Input Validation and Sanitization

Cross-Site Scripting (XSS)

Remote

1923

CVE-2026-44897

Mistune is vulnerable to XSS via unsanitized HTML heading IDs.

Mistune is a Python Markdown parser with renderers and plugins. Prior to 3.2.1, HTMLRenderer.heading() builds the opening <hN> tag by string-concatenating the id attribute value directly into the HTML — with no call to escape(), safe_entity(), or any other sanitisation function. A double-quote character " in the id value terminates the attribute, allowing an attacker to inject arbitrary additional attributes (event handlers, src=, href=, etc.) into the heading element. This vulnerability is fixed in 3.2.1.

Checking

Input Validation and Sanitization

Cross-Site Scripting (XSS)

Remote

1922

CVE-2026-44896

Mistune's image directive allows XSS via unescaped attribute injection.

Mistune is a Python Markdown parser with renderers and plugins. In 3.2.0 and earlier, in src/mistune/directives/image.py, the render_figure() function concatenates figclass and figwidth options directly into HTML attributes without escaping. This allows attribute injection and XSS even when HTMLRenderer(escape=True) is used, because these values bypass the inline renderer. Version 3.2.1 contains a patch.

Checking

Input Validation and Sanitization

Cross-Site Scripting (XSS)

Remote

1921

CVE-2026-44844

Deeply nested EML file causes denial of service via recursion.

eml_parser serves as a python module for parsing eml files and returning various information found in the e-mail as well as computed information. Prior to 3.0.1, EmlParser.get_raw_body_text() recurses unconditionally for every nested message/rfc822 attachment without any depth limit. An attacker who can supply a badly crafted EML file with approximately 120 nested message/rfc822 parts triggers an unhandled RecursionError and aborts parsing of the message. A 12 KB EML file is enough to crash a worker. Though this causes the parser to crash, it is an unlikely scenario as the suggested EML that crashes the parser would not pass basic RFC compliance tests. This vulnerability is fixed in 3.0.1.

Algorithm

Resource Management

Resource Exhaustion

Remote

1920

CVE-2026-44843

LangChain insecure deserialization allows instantiation with untrusted arguments.

LangChain is a framework for building agents and LLM-powered applications. Prior to 0.3.85 and 1.3.3, LangChain contains older runtime code paths that deserialize run inputs, run outputs, or other application-controlled payloads using overly broad object allowlists. These paths may call load() with allowed_objects="all". This does not enable arbitrary Python object deserialization, but it does allow any trusted LangChain-serializable object to be revived, which is broader than these runtime paths require. As a result, attacker-supplied LangChain serialized constructor dictionaries may cause trusted runtime paths to instantiate classes with untrusted constructor arguments. This vulnerability is fixed in 0.3.85 and 1.3.3.

Timing/Serialization

Input Validation and Sanitization

Insecure Parsing or Deserialization

Remote

1919

CVE-2026-44708

Mistune's math plugin is vulnerable to XSS via unsanitized expressions.

Mistune is a Python Markdown parser with renderers and plugins. Prior to 3.2.1, the mistune math plugin renders inline math ($...$) and block math ($$...$$) by concatenating the raw user-supplied content directly into the HTML output without any HTML escaping. This occurs even when the parser is explicitly created with escape=True, which is supposed to guarantee that all user-controlled text is sanitised before reaching the DOM. This vulnerability is fixed in 3.2.1.

Checking

Input Validation and Sanitization

Cross-Site Scripting (XSS)

Remote

1918

CVE-2026-44450

Lumiverse allows authenticated RCE via unsanitized server creation arguments.

Lumiverse is a full-featured AI chat application. Prior to 0.9.7, the MCP server creation endpoint validates the command field against an allowlist of binary names but forwards the args array to the child process without any validation. Every binary on the allowlist accepts an inline-code execution flag (-e for node/bun, -c for python3/deno), giving any logged-in user arbitrary OS-level code execution on the Lumiverse server. The route requires only requireAuth (not requireOwner). The server binds on all interfaces (::) and the host-header rebinding check is bypassed trivially by any HTTP client that sends Host: localhost:<port> directly, making this exploitable from any machine with network access to the server port. This vulnerability is fixed in 0.9.7.

Checking

Input Validation and Sanitization

Command Injection

Remote

1916

CVE-2026-44502

URL parsing mismatch in Bugsink's webhooks leads to an SSRF vulnerability.

Bugsink is a self-hosted error tracking tool. Prior to 2.1.3, Bugsink’s webhook URL validation could be (partially) bypassed because of a mismatch in URL parsing. The original validation logic parsed webhook URLs with Python’s urllib.parse.urlparse, then sent the request with requests.post. For malformed inputs involving backslashes and @, those components can disagree about where the authority ends and which hostname is the real target. A URL may therefore appear to target an allowlisted public hostname during validation, while the HTTP client actually connects to a different host. This vulnerability is fixed in 2.1.3.

Interface

Input Validation and Sanitization

Server-Side Request Forgery (SSRF)

Remote

« Previous1112 13 14 15 16 17 18 19 20 Next »

Introducing the "VAITP dataset": a specialized repository of Python vulnerabilities and patches, meticulously compiled for the use of the security research community. As Python's prominence grows, understanding and addressing potential security vulnerabilities become crucial. Crafted by and for the cybersecurity community, this dataset offers a valuable resource for researchers, analysts, and developers to analyze and mitigate the security risks associated with Python. Through the comprehensive exploration of vulnerabilities and corresponding patches, the VAITP dataset fosters a safer and more resilient Python ecosystem, encouraging collaborative advancements in programming security.

The supreme art of war is to subdue the enemy without fighting.
Sun Tzu – “The Art of War”

:: Shaping the future through research and ingenuity ::

VAITP

Newsletter Signup

Quick Links

Community Word

VAITP Dataset

Please set your Metamask wallet address before proposing changes!

Available Python Code:

Dataset Explorer

Vulnerable Python Code

Patched Python Code

Legal Disclaimer