SecCoderX Reward Model GRPO Dataset: Secure Code Generation via Reinforcement Learning

Name: SecCoderX Reward Model GRPO Dataset: Secure Code Generation via Reinforcement Learning
Creator: SecCoderX
Published: 2026-02-13T09:14:20
Keywords: Vulnerability Detection, Secure Code Generation, Text, Reinforcement Learning, Reward Model

by SecCoderXUpdated 4mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

SecCoderX_Reward_Model_GRPO_dataset is a dataset associated with the research paper 'Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model'. The dataset was created by the SecCoderX organization and was last updated on Hugging Face in March 2026.

Use Cases

Training a vulnerability reward model for secure code generation based on the described research.
Fine-tuning reinforcement learning agents for code generation tasks.
Benchmarking code generation models against security vulnerabilities.
Studying the relationship between code features and security flaws.

Strengths

Dataset is directly linked to a specific, cited research paper (arXiv:2602.07422).
Last update timestamp is precise (2026-03-02 18:26:53).

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count and file size are unknown, which may limit suitability assessment.

Provenance

Source: SecCoderX
Collection Method: Likely generated as part of reinforcement learning research for secure code generation.
Time Range: Associated research paper is from 2026.
Freshness: Last updated 2026-03-02 18:26:53; freshness should be verified.
Geography: null

License is unknown; users should verify terms before use.

Text Vulnerability Detection Secure Code Generation Reinforcement Learning Reward Model

Related Datasets

Quality Score

D38

Description

42

Source

36

Reputation

40

Access

26

Community

22 downloads

1 likes

0 views

Dataset Info

Author: SecCoderX
Created: Feb 13, 2026
Updated: Mar 2, 2026
Last synced: Apr 27, 2026

Access

26

Community

22 downloads

1 likes

0 views

Dataset Info

Author: SecCoderX
Created: Feb 13, 2026
Updated: Mar 2, 2026
Last synced: Apr 27, 2026

SecCoderX Reward Model GRPO Dataset: Secure Code Generation via Reinforcement Learning

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info