DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Education Datasets | DataSalon

All Categories

🎓

Education

Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics

13,410 datasets

Education

Student Motivation and Course Evaluations Across 30 College Courses

Comprising survey responses from 2,949 undergraduate students across 30 courses at a large public university, collected in 2022. It examines relationships between students' motivational perceptions, course ease, and their evaluations of teaching using the MUSIC Model of Academic Motivation Inventory.

Student EngagementMotivational ClimateGrading LeniencyStudent Evaluations Of TeachingStudent MotivationMUSIC Model of MotivationCourse EvaluationClass Climate+1

0 views

Education

Communication Accessibility in Rotterdam Public Transport via Mystery Visits

Featuring qualitative data from a study investigating communication accessibility in public transport for communication-vulnerable individuals in Rotterdam. It includes photographs, customer journey maps, and focus-group interview transcripts collected by Speech and Language Therapy students between September and December 2021. The data was gathered through experiential learning mystery visits conducted with travel companions who have lived experience of communication vulnerability.

Social Sciences+1

0 views

Education

Quantum Consciousness Theory Knowledge Dataset from Educational Media

A structured knowledge dataset exploring quantum consciousness theories. It was created by transforming educational media into a structured format. The dataset's author, organization, and last update date are unknown.

TextGraphQuantum ConsciousnessEducationPhilosophy Of MindTheoretical Physics+1

0 views

Education

Islamic Knowledge AI Survey: 160+ Papers on AI and Islamic Sources (2016–2026)

160+ papers from 2016 to 2026 are surveyed, examining how AI systems operationalize Islamic knowledge. The survey spans NLP, information retrieval, speech processing, multimodal learning, educational technology, and LLM alignment. It was authored by QCRI and last updated on February 25, 2026.

TextAudioIslamic KnowledgeEducational TechnologyNatural Language ProcessingAi SurveyLlm Alignment+1

0 views

Education

Contrast Learning Normal: Dataset for Contrastive Learning Experiments

Contrast_learning_normal is a dataset published on Kaggle, likely intended for machine learning experiments involving contrastive learning. The dataset's specific content, size, and features are not described in the available metadata. Its author, organization, and last update date are unknown.

TabularMachine LearningNormal DataContrastive Learning+1

0 views

Education

AI Readiness Assessment Data

AI readiness assessment data published on Kaggle. The dataset likely contains metrics for evaluating organizational preparedness for artificial intelligence adoption. Specific details regarding its size, features, and creation date are not provided in the available metadata.

TabularOrganizational MetricsAssessmentAi Readiness+1

0 views

Education

SWITCH-Basic V1: Tangible Interface Interaction Data for Embodied Agents

SWITCH-Basic V1 Open contains between 1,000 and 10,000 records of real-world Tangible Computer Interface (TCI) interaction data for embodied agents. Developed by BAAI-Agents and released in early 2026, this multimodal collection includes images and videos of physical interface interactions and verification tasks.

MultimodalIMAGEFOLDERSize Categories1 Kn10 KLanguagezhUiLanguageenTask Categoriesvisual Question AnsweringTangible InterfacesTask CategoriesroboticsLibrarymlcroissantModalityimageLibrarydatasetsBenchmarkModalityvideoLicensecc By Nc 40Task CategoriesotherRegionusEmbodied AiGui AgentArxiv251117649+1

0 views

Education

Andrew Ng Machine Learning Tweets with Sentiment Polarity Labels

8 columns include Tweet, polarity, username, and timestamp. The polarity column provides sentiment labels ranging from neutral to positive and negative. This dataset consists of tweets posted by Andrew Ng, co-founder of Coursera and adjunct professor at Stanford University.

TextTabularMachine LearningSocial MediaSentiment AnalysisEducationNatural Language ProcessingTweets+1

0 views

Education

Gold Digger Job Applicant Classification Case Study with 20,000 Observations

Case-Study-Applicants-for-a-Gold-Digger-position is a synthetic dataset from OpenML containing 20,000 fictional job applications. It includes applicant characteristics such as age, diploma, salary expectation, and exam score, with a binary hiring outcome. The dataset is intended as a playground for data science skill development and interview preparation.

TabularClassificationJob ApplicantsCase StudyPersonnel Selection+1

0 views

Education

Machine Learning Learning: Educational Dataset

A dataset titled 'ML _LEARNING' published on Kaggle. The dataset's content likely relates to machine learning concepts or educational exercises. No further metadata on size, columns, or origin is available.

TabularMachine LearningEducationTutorial+1

0 views

Education

EEOS (Coleman Study): 1966 U.S. Student and School Survey on Educational Opportunity

Commissioned by the U.S. Department of Health, Education, and Welfare in 1966, the Equality of Educational Opportunity Study (EEOS) is a landmark social survey used for national policy-making. It includes test scores and questionnaire responses from a national sample of first-, third-, sixth-, ninth-, and twelfth-grade students, as well as their teachers and principals. The data captures student demographics, socioeconomic background, attitudes, and performance on standardized tests of verbal skills, reading, and mathematics.

TabularSurvey DataEducation EquityStudent PerformancePsychologyStandardized TestingHealthcareMathematics EducationSociology+1

0 views

Education

HBSC: U.S. Adolescent Health Behavior Survey, 2005-2006

A U.S. national school-based survey from the 2005-2006 school year, part of the WHO-sponsored Health Behavior in School-Aged Children (HBSC) study. The data capture health-related attitudes and behaviors of young people across more than 40 countries. It was conducted by Ronald J. Iannotti under the Eunice Kennedy Shriver National Institute of Child Health and Human Development.

TabularSubstance UseSurvey DataPsychologyHealthcareComputer VisionAdolescent HealthSchool SurveyCross NationalSchool Health+1

0 views

Education

Word Order Data for Malayo-Polynesian Languages of Southeast Asia

A dataset compiled by Mark Donohue for a chapter in 'The Oxford Guide of Malayo-Polynesian languages'. It contains linguistic data on word order for Malayo-Polynesian languages in Southeast Asia. The dataset is associated with the Living Tongues Institute for Endangered Languages.

TabularSoutheast AsiaEndangered LanguagesHistoryMalayo-PolynesianOrder ExchangeWord OrderEthnologyGeographyPhilosophyBusinessWord Group TheoryLinguistics+1

0 views

Education

Impact of Language Policy on Indian Education Outcomes

This dataset supports an analysis of the impact of official language policies on educational outcomes in India, using historical state formation. It examines literacy and college graduation rates in districts where the official language did or did not match the local language. The analysis suggests political reorganization can mitigate negative effects of language mismatch.

Language PolicyCensus RecordsEducation+1

0 views

Education

Reskilling Impact on Antidepressant Use in Danish Injured Workers

This dataset supports research on the effects of reskilling education on antidepressant use among injured workers and their partners in Denmark. The analysis is based on the universe of the Danish population, exploiting institutional variation in access to higher education following work accidents. The study finds reskilling prevents antidepressant use for one in three participants, with comparable spillover effects on partners.

ReskillingMental HealthDomestic PartnershipEducationSpillover Effects+1

0 views

Education

Reskilling Impact on Antidepressant Use in Danish Injured Workers

ReskillingMental HealthDomestic PartnershipEducationSpillover Effects+1

0 views

Education

Education Data from Kaggle

Kaggle hosts a dataset titled 'Education'. The dataset's specific contents, such as student records, test scores, or institutional metrics, are not detailed in the provided metadata. Its size, structure, and creation date are unknown.

TabularStudent PerformanceEducationSchool Data+1

0 views

Education

Dolci Instruct SFT: 2.1 Million Multilingual Samples for Olmo 3 Training

2,152,112 instruction-tuning samples comprise this mixture released by Allen Institute for AI in 2026 for training the Olmo 3 7B Instruct model. It aggregates prompts from sources like OpenThoughts 3, featuring a 32K context length and a blend of expert, machine, and crowdsourced annotations.

LanguagedanLanguageamhLanguagecebLanguagefilLanguagedeuLanguagebenLanguageellLanguagearzLanguagearyLanguageeusLanguagearbLanguageacqMultilingualitymultilingualTask CategoriesotherAnnotations CreatorscrowdsourcedLanguageengLanguageapcLanguagearsAnnotations Creatorsexpert Generated+1

0 views

Education

Middle School Student Knowledge Tracing with 1.05 Million Interactions

A dataset of 1.05 million student interactions from 4,939 middle school students. It is hosted on Kaggle and appears to be designed for modeling student knowledge acquisition over time. The specific author, organization, and time range of data collection are not provided.

TabularMiddle SchoolStudent PerformanceKnowledge Tracing+1

0 views

Education

School Characteristics and NEET Student Outcomes in the United Kingdom

Supplementary materials for a study on the relationship between school characteristics and students becoming 'Not in Education, Employment or Training'. The dataset, authored by Sam Denny, was last updated in March 2026 and is licensed under CC BY 4.0.

PsychologyStatistics+1

0 views

PreviousPage 388 of 669Next