Prompt Injection Bit Signatures: 24,254 Labeled Prompts from Four Public Datasets

Name: Prompt Injection Bit Signatures: 24,254 Labeled Prompts from Four Public Datasets
Creator: issdandavis
Published: 2026-04-13T03:44:06
Keywords: Prompt Injection, Ai Safety, Cybersecurity, Text Classification, Tabular

by issdandavisUpdated 2mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

24,254 labeled prompts for prompt injection detection, sourced from four public datasets. The data was processed using the Six Sacred Tongues bijective tokenizer from the SCBE-AETHERMOORE framework to create lossless bit signatures. The dataset was created by issdandavis and last updated on April 13, 2026.

Use Cases

Train classifiers to detect prompt injection attempts based on labeled prompts.
Benchmark model robustness against adversarial prompts based on the stratified split by source and label.
Analyze the effectiveness of tokenization methods for security tasks based on the lossless bit signature mapping.

Strengths

24,254 labeled prompts provide a substantial sample size.
Data is stratified 70/15/15 by source and label, ensuring representation across splits.
Incorporates prompts from four distinct public datasets.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is known, but specific features and file formats are unknown.
Data may reflect source bias inherent to the four aggregated public datasets.

Provenance

Source: Aggregated from four public prompt-injection datasets.
Collection Method: Prompts were mapped through the Six Sacred Tongues bijective tokenizer from the SCBE-AETHERMOORE framework.
Time Range: null
Freshness: Last updated 2026-04-13 03:50:03; freshness should be verified.
Geography: null

null

Tabular Prompt Injection Ai Safety Cybersecurity Text Classification

Related Datasets

Quality Score

C41

Description

42

Source

46

Reputation

37

Access

26

Community

4 downloads

1 likes

0 views

Dataset Info

Author: issdandavis
Created: Apr 13, 2026
Updated: Apr 13, 2026
Last synced: Apr 20, 2026

Access

26

Community

4 downloads

1 likes

0 views

Dataset Info

Author: issdandavis
Created: Apr 13, 2026
Updated: Apr 13, 2026
Last synced: Apr 20, 2026

Prompt Injection Bit Signatures: 24,254 Labeled Prompts from Four Public Datasets

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info