DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Education Datasets | DataSalon

All Categories

🎓

Education

Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics

13,421 datasets

Education

Teacher Beliefs on Cultural Diversity and National Culture

A collection of the Diversity Teaching Beliefs Scale (DTBS), a multidimensional instrument assessing teacher beliefs. It measures three distinct dimensions: Acknowledging cultural differences (ACD), emphasizing cultural communalities (ECC), and socialization of the national culture (SNC). The data was used in three studies to validate the scale's psychometric properties and analyze belief co-occurrence patterns.

Social Sciences+1

0 views

Education

Child Care Licensing Section Evaluations and Violation Counts

Child care licensing inspection data details sections evaluated, standards assessed, and violations found per activity. The dataset links to operations and activity records via Operation ID and Activity ID. It covers inspections, investigations, and assessments conducted by the City of Austin.

HhscEvaluationCclInspectionStandardsCcl Section+1

0 views

Education

Paderborn University Car Bearing Dataset for Fault Diagnosis

The Paderborn University Car Bearing Dataset is a collection of measurements for condition monitoring and fault diagnosis. It has been converted to a .csv format for accessibility. The dataset originates from research at Paderborn University.

TabularCondition MonitoringMechanical EngineeringAutomotiveBearing Fault+1

0 views

Education

Texas County Academic Performance and University Enrollment Rankings

This dataset classifies 229 Texas counties into five strategic tiers based on composite opportunity scores for university recruitment. It identifies 6 'Prime Target' counties with strong academic readiness, large student populations, and low current university presence. The analysis weights academic readiness (40%), market size (30%), current market share (20%), and test participation (10%).

Computer and Information Science+1

0 views

Education

Exam Violation Detection Using BiLSTM Models

Exam-violation-bilstm is a dataset hosted on Kaggle, likely containing time-series or sequential data for detecting academic misconduct. The dataset's title suggests it is designed for training Bidirectional Long Short-Term Memory (BiLSTM) neural networks. Its author, organization, and specific details like size and license are currently unknown.

TabularTime SeriesStudent BehaviorTime Series ClassificationExam Violation+1

0 views

Education

Life Cycle Assessment Data for Compostable Nappies

Output data from a Life Cycle Assessment model details the environmental impact of compostable nappies across five stages: manufacture, transport, use, collection, and disposal. The dataset includes material inputs, energy consumption, and environmental outputs for analyzing end-of-life scenarios like landfill, incineration, and composting. It was published by the Environmental Information Data Centre.

🇬🇧 United KingdomLife Cycle AssessmentHuman Health and SafetyLcaCompostable NappiesChina+1

0 views

Education

Great Britain Bedrock Geology Fence Diagram with 121 Cross-Sections

2009-2012 fence diagram of Great Britain's bedrock geology comprises 121 cross-sections with an aggregate length over 20,000 km. Compiled by 14 expert regional geologists from the British Geological Survey, it integrates subsurface data to depths between 1.5 and 6 km. The dataset provides a consistent 3D structural context for regional studies and education.

GeologyNerc DdcFence diagrams+1

0 views

Education

UK Onshore Coal Resource Maps for New Exploitation Technologies

British Geological Survey maps depict the spatial extent of principal UK coal resources, overlaying existing workings and potential new technologies. The project covers all onshore coalfields, including Northern Ireland, and includes data for 21 individual regions at a 1:100,000 scale. Work was initiated in April 2002 and completed in October 2003.

Coalbed MethaneMineral ResourcesNerc DdcCoal resource mapsCoal+1

0 views

Education

QULU: DBE Exam Papers from South Africa

South African Department of Basic Education exam papers, likely for high school subjects. The dataset is hosted on Kaggle, but the specific subjects, years, and number of papers are unknown. The original author and last update date are not provided.

TextEducationSouth AfricaExam Papers+1

0 views

Education

Titanic Dataset for Machine Learning Tasks

Titanic Dataset is a collection of passenger information from the RMS Titanic disaster, commonly used for introductory machine learning tasks. It is published on the Kaggle platform. The dataset's specific size, features, and last update date are unknown from the provided metadata.

TabularMachine LearningTitanicClassificationSurvival Prediction+1

0 views

Education

Iris Flower Dataset for Machine Learning Practice

A classic dataset for machine learning practice, published on Kaggle. It likely contains measurements for classifying iris flower species. The original Iris dataset is a foundational benchmark for classification algorithms.

TabularMachine LearningBotanyClassificationIris Flower+1

0 views

Education

Mt Unlearning Checkpoints: Model States for Machine Unlearning

Mt Unlearning Checkpoints is a dataset uploaded by harishm17 to Hugging Face, last updated on March 29, 2026. The dataset's title and platform tags suggest it contains saved model states related to the machine unlearning research domain. The specific content, scale, and structure require verification after download.

TabularMachine UnlearningAi SafetyModel CheckpointsModel TrainingCheckpointsRegionus+1

0 views

Education

Drowsy Driver Warning System Field Test: 2.4 Million Miles of Truck Data

A field operational test of an early prototype Drowsy Driver Warning System was conducted by the National Highway Traffic Safety Administration and the Federal Motor Carrier Safety Administration. The final dataset for analysis consisted of 102 drivers from 3 for-hire trucking fleets using 46 instrumented trucks, containing nearly 12.4 terabytes of truck instrumentation, kinematic data, and video recordings for 2.4 million miles of driving. This dataset is described as the largest ever collected by the U.S. Department of Transportation.

MultimodalTransport EngineeringTelecommunicationsEngineeringData CollectionAeronauticsTruckField Operational TestSoftware DeploymentAutomotive EngineeringLane Departure Warning SystemTest BiologyWarning SystemSituation AwarenessDriver SafetyLarge ScaleStatisticsDrowsiness DetectionTruck Telematics+1

0 views

Education

Tyrannosaurus Rex Morphological Variation and Sexual Dimorphism Data

Digitized treatments from a 2008 book chapter by paleontologist Peter Larson analyze variation and sexual dimorphism in Tyrannosaurus rex. The data is sourced from the Plazi repository and the original publication 'Tyrannosaurus rex, the tyrant king'. It likely contains detailed morphological measurements and observations for the species.

TextVariation AstronomyEvolutionary BiologyBiologyMorphologyZoologyPaleontologySexual dimorphism+1

0 views

Education

PredictABEL: Functions for Risk Model Assessment and Simulated Data

A package by Suman Kundu, Yurii S. Aulchenko, and A. Cecile J.W. Janssens provides functions to assess the performance of risk prediction models. It includes measures like c-statistic (AUC), Hosmer-Lemeshow test, net reclassification improvement (NRI), and integrated discrimination improvement (IDI). The package also contains functions for logistic regression analysis and to construct a simulated dataset with genotypes and disease status for evaluating genetic risk models.

TabularMachine LearningComputer SciencePredictive ModellingRisk assessmentBenchmarkHealthcareClinical ResearchGenetic RiskSyntheticComputer Security+1

0 views

Education

Research on Preventing Teen Pregnancy and STDs in the United States

A research brief synthesizing findings from over 150 studies on adolescent reproductive health in the United States. It was authored by Jennifer Manlove for Child Trends and focuses on factors influencing behaviors like sexual initiation, contraceptive use, and pregnancy. The brief includes a table of successful programs and approaches.

TextMedicineEnvironmental HealthPregnancyTeenage PregnancyGeneticsPsychologyObstetricsBiologyTeen PregnancySocial ScienceSexually Transmitted DiseasesHealthcarePopulationFinanceAdolescent HealthPublic HealthSexually Active+1

0 views

Education

MSMARCO-Passage Teacher Training IDs for Cross-Architecture Knowledge Distillation

Teacher train files containing query and passage IDs from the MSMARCO-Passage collection, created for the paper 'Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation'. Sebastian Hofstätter and colleagues at TU Wien developed this resource to support research into distilling ranking knowledge between different neural architectures. The associated documentation and code are hosted on GitHub.

TextMachine LearningComputer ScienceArchitectureDistillationNeural RankingArtificial IntelligenceChemistryMsmarcoGeographyArtificial Neural NetworkRanking Information RetrievalChromatographyKnowledge DistillationInformation Retrieval+1

0 views

Education

PRAMS: Pregnancy Risk Assessment Monitoring System Survey Data

31 U.S. states and 1 city collect data on maternal attitudes, behaviors, and experiences before, during, and after pregnancy. The Pregnancy Risk Assessment Monitoring System (PRAMS) is a joint project between the CDC and state health departments, surveying between 1,300 and 3,400 women per state annually since 1987. Surveillance reports are available for years including 1995-2000, 2002, and 2007.

TabularEpidemiologySurvey DataComputer ScienceMaternal HealthHealthcarePublic Health+1

0 views

Education

Prevalence Assessment Studies with Frequentist and Bayesian Methods

The prevalence package provides statistical tools for prevalence assessment studies. It includes Frequentist and Bayesian methods, requiring the JAGS (Just Another Gibbs Sampler) software for the Bayesian truePrev functions. The package was authored by Brecht Devleesschauwer.

TabularMedicineEnvironmental HealthEpidemiologyMedical ResearchGeographyStatisticsBayesian methodsPublic Health+1

0 views

Education

City of Austin Small Area Plan Implementation Status Report

City of Austin Planning Department provides a PDF report on the implementation status of adopted small area plans, such as neighborhood and station area plans. The dataset tracks the status of recommendations according to the name of the planning area, but statistics are no longer being updated.

TabularEnglishNeighborhood DevelopmentUnited StatesUrban PlanningCity Government+1

0 views

PreviousPage 412 of 670Next