Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,410 datasets
Comprising survey responses from 2,949 undergraduate students across 30 courses at a large public university, collected in 2022. It examines relationships between students' motivational perceptions, course ease, and their evaluations of teaching using the MUSIC Model of Academic Motivation Inventory.
Featuring qualitative data from a study investigating communication accessibility in public transport for communication-vulnerable individuals in Rotterdam. It includes photographs, customer journey maps, and focus-group interview transcripts collected by Speech and Language Therapy students between September and December 2021. The data was gathered through experiential learning mystery visits conducted with travel companions who have lived experience of communication vulnerability.
A structured knowledge dataset exploring quantum consciousness theories. It was created by transforming educational media into a structured format. The dataset's author, organization, and last update date are unknown.
160+ papers from 2016 to 2026 are surveyed, examining how AI systems operationalize Islamic knowledge. The survey spans NLP, information retrieval, speech processing, multimodal learning, educational technology, and LLM alignment. It was authored by QCRI and last updated on February 25, 2026.
Contrast_learning_normal is a dataset published on Kaggle, likely intended for machine learning experiments involving contrastive learning. The dataset's specific content, size, and features are not described in the available metadata. Its author, organization, and last update date are unknown.
AI readiness assessment data published on Kaggle. The dataset likely contains metrics for evaluating organizational preparedness for artificial intelligence adoption. Specific details regarding its size, features, and creation date are not provided in the available metadata.
SWITCH-Basic V1 Open contains between 1,000 and 10,000 records of real-world Tangible Computer Interface (TCI) interaction data for embodied agents. Developed by BAAI-Agents and released in early 2026, this multimodal collection includes images and videos of physical interface interactions and verification tasks.
8 columns include Tweet, polarity, username, and timestamp. The polarity column provides sentiment labels ranging from neutral to positive and negative. This dataset consists of tweets posted by Andrew Ng, co-founder of Coursera and adjunct professor at Stanford University.
Case-Study-Applicants-for-a-Gold-Digger-position is a synthetic dataset from OpenML containing 20,000 fictional job applications. It includes applicant characteristics such as age, diploma, salary expectation, and exam score, with a binary hiring outcome. The dataset is intended as a playground for data science skill development and interview preparation.
A dataset titled 'ML _LEARNING' published on Kaggle. The dataset's content likely relates to machine learning concepts or educational exercises. No further metadata on size, columns, or origin is available.
Commissioned by the U.S. Department of Health, Education, and Welfare in 1966, the Equality of Educational Opportunity Study (EEOS) is a landmark social survey used for national policy-making. It includes test scores and questionnaire responses from a national sample of first-, third-, sixth-, ninth-, and twelfth-grade students, as well as their teachers and principals. The data captures student demographics, socioeconomic background, attitudes, and performance on standardized tests of verbal skills, reading, and mathematics.
A U.S. national school-based survey from the 2005-2006 school year, part of the WHO-sponsored Health Behavior in School-Aged Children (HBSC) study. The data capture health-related attitudes and behaviors of young people across more than 40 countries. It was conducted by Ronald J. Iannotti under the Eunice Kennedy Shriver National Institute of Child Health and Human Development.
A dataset compiled by Mark Donohue for a chapter in 'The Oxford Guide of Malayo-Polynesian languages'. It contains linguistic data on word order for Malayo-Polynesian languages in Southeast Asia. The dataset is associated with the Living Tongues Institute for Endangered Languages.
This dataset supports an analysis of the impact of official language policies on educational outcomes in India, using historical state formation. It examines literacy and college graduation rates in districts where the official language did or did not match the local language. The analysis suggests political reorganization can mitigate negative effects of language mismatch.
This dataset supports research on the effects of reskilling education on antidepressant use among injured workers and their partners in Denmark. The analysis is based on the universe of the Danish population, exploiting institutional variation in access to higher education following work accidents. The study finds reskilling prevents antidepressant use for one in three participants, with comparable spillover effects on partners.
This dataset supports research on the effects of reskilling education on antidepressant use among injured workers and their partners in Denmark. The analysis is based on the universe of the Danish population, exploiting institutional variation in access to higher education following work accidents. The study finds reskilling prevents antidepressant use for one in three participants, with comparable spillover effects on partners.
Kaggle hosts a dataset titled 'Education'. The dataset's specific contents, such as student records, test scores, or institutional metrics, are not detailed in the provided metadata. Its size, structure, and creation date are unknown.
2,152,112 instruction-tuning samples comprise this mixture released by Allen Institute for AI in 2026 for training the Olmo 3 7B Instruct model. It aggregates prompts from sources like OpenThoughts 3, featuring a 32K context length and a blend of expert, machine, and crowdsourced annotations.
A dataset of 1.05 million student interactions from 4,939 middle school students. It is hosted on Kaggle and appears to be designed for modeling student knowledge acquisition over time. The specific author, organization, and time range of data collection are not provided.
Supplementary materials for a study on the relationship between school characteristics and students becoming 'Not in Education, Employment or Training'. The dataset, authored by Sam Denny, was last updated in March 2026 and is licensed under CC BY 4.0.