DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Education Datasets | DataSalon

All Categories

🎓

Education

Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics

13,313 datasets

Education

Global K-12 STEM, Robotics, AI and Engineering Education Data

Global K-12 STEM, Robotics, AI & Engineering Education Dataset (Grades 1–12) is aggregated from Kaggle. Its specific size, source, and update frequency are not detailed in the available metadata. The dataset likely contains information on educational programs, resources, or outcomes related to STEM fields.

TabularK 12RoboticsStem EducationEngineering EducationAi Education+1

0 views

Education

PolyglotTeachers SFT Synth: Multilingual Supervised Fine-Tuning Examples

Synthetic supervised fine-tuning examples were generated by teacher models evaluated in the Polyglot Teachers paper. The dataset contains examples across six languages: Arabic, Czech, German, Indonesian, Japanese, Spanish, and Tagalog. It was created by ljvmiranda921 and last updated on April 5, 2026.

TextMultilingualLanguage ModelsSynthetic DataSupervised Fine TuningSynthetic+1

0 views

Education

Personality Assistant 9B Data: Mixed Persona and Tool Calling Examples

2,997 samples totaling approximately 5.8 million tokens were created by ToastyPigeon for fine-tuning the Qwen3.5-9B model. The dataset comprises 1,925 personality-based conversation examples and 1,072 tool calling examples. It was last updated on March 23, 2026.

TextConversational AiTool CallingPersonality AssistantLlm Training+1

0 views

Education

Bengaluru Real Estate Price Data for Housing Market Analysis

A dataset concerning housing prices in Bengaluru, India, compiled by AmitabhaChakraborty. The description references a study indicating property prices in the city fell by almost 5 percent in the second half of 2017. It is released under a CC0-1.0 license.

Tabular🇮🇳 IndiaTabular DataUrban PlanningBengaluruHousing PricesReal Estate+1

0 views

Education

FitBit Steps: Minute-Level Physical Activity Records from Wearable Devices

FitBit_Steps provides minute-level step counts recorded by wearable devices for multiple users. The dataset, authored by Mobius and sourced from OpenML, contains user IDs, timestamps, and step counts for precise activity tracking. It is released under a CC0-1.0 license.

TabularTime SeriesHealth MonitoringHealthcareFitnessTabular DataFitbitWearable DevicesPhysical Activity+1

0 views

Education

CPU Activity: System Performance Metrics from a Multi-User Sun Workstation

Two collection epochs of system activity data gathered every 5 seconds from a Sun Sparcstation 20/712 in a multi-user university department. The dataset contains 22 attributes measuring memory, process, and system call activity, with the goal of predicting the portion of time CPUs run in user mode. The final dataset includes an equal number of observations from each collection period.

TabularSystem MonitoringCpu ActivityMultivariateComputer HardwarePerformance MetricsComputer Performance+1

0 views

Education

student_performance_por

649 student records from two Portuguese secondary schools, collected via school reports and questionnaires. The dataset includes final year grades (G3) alongside demographic, social, and school-related features for the Portuguese language subject. It was uploaded to OpenML under a CC-BY-4.0 license.

TabularAcademic AchievementSecondary EducationStudent PerformanceHealthcareEducationDemographics+1

0 views

Education

BLM OR Stand Exam Publication Point Hub: Forest Plot Locations for Growth Modeling

STAND_EXAM_PUB_PT is a spatial dataset of Forest Stand Exam point locations recorded via the EcoSurvey application. The data, published by the Department of the Interior, is used to generate stand-level statistics and export files for growth and yield models. It was last updated on 2026-03 26.

TabularGeospatialGeospatial PointsTimber StandForest RegenerationEcosurveyWashingtonManagementOregonTimberTreesNatural ResourcesForest RestockingForest Stand ExamPacific NorthwestBiotaForest coverForestForest Management+1

0 views

Education

Chlorophyll-A and Phaeophytin Measurements from the North American Atlantic Coast

Chlorophyll-A and phaeophytin data were collected by Florida State University along the North American Atlantic coastline during three periods in 1986. The dataset likely contains position and concentration measurements in milligrams per cubic meter, with samples taken every two hours. It provides a snapshot of phytoplankton biomass and water quality for a specific region and year.

TabularTime SeriesPhytoplanktonOceanographyCoastal Water QualityChlorophyll A+1

0 views

Education

Arabian Sea CTD and Ocean Station Data from the MASAI Project

From December 1986 to August 1987, conductivity, temperature, depth, and oxygen data were collected aboard the RRS Charles Darwin in the Indian Ocean and Arabian Sea. The dataset was part of the Monsoon And Sea-Air Interaction (MASAI) project and is available in processed NODC C100 and F-022 formats. It provides high-resolution vertical profiles for studying oceanographic conditions during the monsoon period.

TabularTime SeriesOceanographyIndian OceanCtd DataOcean Station DataSea Air InteractionMonsoon Research+1

0 views

Education

Ryans Computers Product Catalog Archive

A structured archive of product catalogs from ryans.com, a major retail chain for computer hardware and electronics in Bangladesh. The dataset contains technical specifications, pricing, and descriptions. It was scraped and published by sayurio.

TabularJSONSize Categories10 Kn100 KLibrarypolarsE CommerceModalitytextBangladesh MarketLibrarymlcroissantLibrarydatasetsLibrarypandasRegionusComputer HardwareRetail Products+1

0 views

Education

Florida Reef Coral Damage Assessment After Hurricane Irma 2017

October 9-18, 2017 surveys document the impact of Hurricane Irma on the Florida Reef Tract. National Oceanic and Atmospheric Administration researchers collected coral demographic data and roving diver observations across 57 sites from Biscayne Bay to the Marquesas. The dataset includes detailed belt transect records and broad-scale photographic documentation of damage and disease.

TabularGeospatialCoral ReefHurricane ImpactsHealthcareField SurveyCoastal EcologyMarine Biology+1

0 views

Education

Psychological Resistance to Kurdistan Independence in Human and AI Responses

A 2026 study by Davin Nabizadehchianeh from Harvard Dataverse compares human and AI psychological responses to Kurdish independence. It analyzes autoethnographic data from interactions with individuals from Turkey, Iran, Iraq, and Syria, alongside responses from 37 variants across nine large language model platforms. The analysis reveals a stark contrast, with 70.27% of LLM variants supporting independence versus near-universal human resistance.

Social Sciences+1

0 views

Education

CTD Oceanographic Profiles from Gulf of Alaska 1986 Cruise

University of Alaska Institute of Marine Science collected this dataset aboard the R/V Alpha Helix during cruise HX94 from December 15 to 19, 1986. It contains high-resolution conductivity-temperature-depth (CTD) and salinity-temperature-depth (STD) profiles from 17 stations in the Gulf of Alaska and Prince William Sound. Data is processed to the NODC standard High-Resolution STD/CTD Data (F022) format.

TabularAudioTime SeriesOceanographyCtd ProfilesPhysical OceanographyGulf Of Alaska+1

0 views

Education

WFD RBMP2: Economic Analysis for Water Body Improvements in England

WFD RBMP2 Economic analysis 2015_Scenario 3 and 4_v1.10 is an impact assessment dataset for updated river basin management plans in England, created by the Environment Agency in 2016. It contains data for all water bodies in England, comparing scenarios of technically feasible improvements regardless of cost versus those where benefits exceed costs. The dataset was built from multiple data sources and is based on many assumptions.

TabularEconomic AnalysisBenchmarkWater QualityRiver BasinFinanceEnvironmentWater Resources Management+1

0 views

Education

Underwater Video and Images from Browse Basin Marine Survey

49 survey stations provide underwater video footage and still images from the Leveque Shelf in the Browse Basin, collected in May 2013. The data includes real-time onboard characterizations and USBL navigational files for each video transect. This marine survey was conducted by Geoscience Australia to assess seabed geology and CO2 storage potential.

Earth sciencesMarine DataMarinePublished ExternalHvc 144640+1

0 views

Education

London Pupil Home-to-School Distance Percentages by Borough and Phase, 2018

Greater London Authority data shows the percentage of pupils at state-funded schools who live more than 2 miles from school (for those under 8) or 3 miles from school (for those over 8). The data is derived from the DfE National Pupil Database and was used to create the GLA London Schools Atlas. The dataset was last updated on the platform on 2026-03-25.

TabularGeospatialPupil DistanceLondonSchoolsEducation FacilitiesEducationGeospatial Analysis+1

0 views

Education

Supplemental Data for a Scoping Review on Systematized Review Labels

Supplemental data for a scoping review titled 'Examining the meaning and methodological characteristics of the systematized review label'. The data includes an extraction dictionary, extracted data for each review, and citations for excluded non-English studies. It was authored by Zahra Premji and last updated on April 25, 2026.

TabularSystematic ReviewBibliographic DataResearch MethodsScoping review+1

0 views

Education

South Korean Private Tutoring Expenditures and Regional Indicators, 2010-2024

South Korean district-year panel data from the Survey on Private Tutoring Expenditures in Primary and Secondary Education, spanning 2010 to 2024. The dataset is merged with regional indicators from official statistical sources, including lagged repeater rates and university admission competition ratios. It was authored by Jieun Hong and published via Harvard Dataverse in April 2026.

TabularSouth KoreaEducation PolicyPrivate TutoringRegional Panel+1

0 views

Education

Avatars and Learning Outcomes: A Critical Literature Review on Virtual Patients

A literature review authored by David Topps, harvested from Borealis Dataverse and last updated on April 25, 2026. The work critically examines claims about the efficacy of AI-assisted avatars in medical education, specifically regarding virtual patients and learning outcomes beyond engagement.

TextMedical EducationVirtual PatientsLiterature ReviewHealthcareAvatars+1

0 views

PreviousPage 290 of 664Next