DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Source code vulnerability | DataSalon

Home Software Engineering & SecuritySource code vulnerability

Software Engineering & Security

Source code vulnerability

Available on 1 platform

Description

A collection of labeled source code snippets across 3+ programming languages including C++, Java, and Python. The dataset categorizes code blocks as either vulnerable or secure to facilitate training for automated security auditing.

Use Cases

Train a binary classifier to distinguish between secure and vulnerable code using the label and source_code fields.
Evaluate the cross-language generalization of security models using the C++, Java, and Python subsets.
Fine-tune a code-specific language model for vulnerability detection using the provided labeled examples.

Strengths

Includes labeled source code examples for C++, Java, and Python.
Categorizes code snippets based on the presence of security vulnerabilities.
Provides a compact data structure for cross-language vulnerability benchmarking.

English Text Computer Science Binary Classification Beginner

Related Datasets

Quality Score

D19

Description

Source

Reputation

Quality Score

D19

Description

Source

Reputation

Access

Community

0 views

Access

Community

0 views

Source code vulnerability

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Community