DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Media & Communication Datasets | DataSalon

All Categories

📺

Media & Communication

News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation

10,957 datasets

EPIC-KITCHENS-100: 37,455 Extracted Kitchen Activity Video Clips

37,455 video clips extracted from the EPIC-KITCHENS-100 dataset, comprising the extension not included in EPIC-KITCHENS-55. The clips are stored as MP4 files, organized by participant and narration ID, and have been downscaled and compressed. The dataset was created by lightly-ai and last updated on Hugging Face in March 2026.

VideoMultimodalCSVSize Categories10 Kn100 KLibrarypolarsAction RecognitionModalitytextModalitytabularLibrarymlcroissantKitchen ScenesLibrarydatasetsLibrarypandasModalityvideoVideo ClipsLicensecc By Nc 40RegionusHuman Activity+1

0 views

Media & Communication

Atomic Weights of the Elements: Review 2000

A review of internationally accepted standard atomic weights for the elements, published by the IUPAC Commission on Atomic Weights and Isotopic Abundances. The work summarizes the development of atomic-weight determinations in the 20th century and variations recognized in the latter part of the century. It reflects the increasing scientific interest in source-specific atomic weight differences and their impact on fundamental constants.

TabularIsotopeAtomic PhysicsAtomic MassChemical NomenclatureAnalytical Chemistry JournalChemistryPhysicsNatural AbundanceEnvironmental ChemistryNuclear PhysicsAtomic Number+1

0 views

Media & Communication

Antarctic Oceanographic CTD Data from 1995/96 Campaign

CTD data from 16 oceanographic stations in Mar de la Flota, Antarctica, collected aboard the icebreaker Almirante Irizar during the 1995/96 Summer Antarctic Campaign. The data was gathered by the Argentine Antarctic Institute using a Neil Brown Ins. Sys. Model MKIIIb CTD profiler. The dataset was last updated in May 1996.

TabularGeospatialOceanographyMarine ScienceAntarctic ResearchCtd Profiles+1

0 views

Media & Communication

TEMPO Gridded Cloud Fraction and Pressure (O2-O2 dimer) Near Real-Time

TEMPO satellite observations provide cloud information on a regular grid covering its field of regard. Level 3 files are derived by combining Level 2 scan cycles and contain effective cloud fraction and cloud optical centroid pressure. The near real-time product is optimized for low-latency applications, with processing completed within 2-3 hours after observation.

GeospatialSatellite ImageryBenchmarkComputer VisionCloud PropertiesNear Real Time+1

0 views

Media & Communication

Depression Covariate Percentages Among Older Adults in Four Districts

Supplying percentage distributions of selected covariates associated with depression levels. It focuses on older adults across four districts, with data covering factors like social isolation, financial stress, and perceived health status. The dataset is 13.5 KB in size.

Social IsolationE FinancialVarious Risk FactorsHigh Perceived StressStress LevelsXlink DepressionMental Health ProblemsPoor Health StatusPerceived Social StatusAcross Four DistrictsInadequate Family RelationshipsStudy IntendsLoneliness Influence DepressionSignificantly AssociatedSevere DepressionConfirmatory Factor AnalysisXlinkSocial Discrimination+1

0 views

Media & Communication

Film Genre Statistics

Film genre statistics likely contain data on movie classifications and their associated metrics. The dataset is published on Kaggle, but its specific contents, size, and creation details are unknown. Its recency and the identity of its author are also unspecified.

TabularFilm GenreStatisticsEntertainment+1

0 views

Media & Communication

ChatGPT Google Play Store Reviews

Kaggle hosts a dataset of user reviews for the ChatGPT application from the Google Play Store. The dataset likely contains textual feedback, ratings, and metadata submitted by users. Its exact size, column structure, and collection period are unspecified.

TextChatGPTApp FeedbackSentiment AnalysisUser ReviewsGoogle Play Store+1

0 views

Media & Communication

TMDB Movies Dataset

TMDB Movies Dataset is a collection of movie-related data from The Movie Database (TMDB). It was published on Kaggle, but the specific number of records, columns, and creation details are unknown. The dataset's content and structure require verification after download.

TabularMoviesTmdbEntertainment+1

0 views

Media & Communication

Fake News Dataset

Fake News is a dataset hosted on Kaggle. The dataset likely contains text articles or social media posts labeled for veracity. Metadata is minimal; actual content and labeling methodology require verification after download.

TextMisinformationFake NewsMedia ContentText Classification+1

0 views

Media & Communication

Best Movies List

A list of movies curated as 'best' and published on Kaggle. The dataset's specific contents, such as titles, ratings, genres, or release years, are not detailed in the available metadata. The author, organization, and temporal coverage are unknown.

TabularRecommendationMoviesEntertainment+1

0 views

Media & Communication

Bangla Real and Fake News Dataset for Detection Tasks

A Bangla Fake News Detection Dataset containing both real and fake news articles. The dataset likely contains text articles labeled for authenticity. Its author, organization, and specific size are unknown.

TextNews ClassificationFake NewsBangla LanguageText Data+1

0 views

Media & Communication

Egu26 Compression Sc2 5: Materials for a Scientific Session

Materials for the EGU26 SC2.5 session, published by climet-eu on GitHub under a CC-BY-4.0 license. The repository was last updated on 2026-05-12. The specific content and scale of the materials are not detailed in the available metadata.

MultimodalClimateSc2 5MaterialsEgu26+1

0 views

Media & Communication

NOAA Milford Microalgal Culture Collection Transfer and Test Records

The Milford Microalgal Culture Collection from NOAA's Department of Commerce holds over 200 live cultures representing 13 classes of algae. Cultures are maintained in three different media and undergo routine transfers with bacterial/mold contaminant testing four times a year. Transfer dates and test results are recorded, with the dataset last updated in March 2026.

TabularExcelAquacultureUs Department Of CommerceMilford LaboratoryNoaaAquaculture Sustainability Branch AsbNational Marine Fisheries ServiceBiologyCulturesNortheast Fisheries Science CenterLaboratory DataCulture CollectionContinuousMicroalgaeDocnoaanmfsnefsc+1

0 views

Media & Communication

Replication Data for Superconducting Gap and Electron-Boson Coupling in Nickelate Films

Replication data for the paper 'Nodeless superconducting gap and electron-boson coupling in (La,Pr,Sm)3Ni2O7 films'. Jianchang Shen deposited this dataset on Harvard Dataverse in May 2026. The data likely contains experimental measurements supporting the findings on superconductivity in rare-earth nickelate films.

TabularCondensed Matter PhysicsNickelatesSuperconductivity+1

0 views

Media & Communication

MPRA Expression Analysis: 5'UTR Activity in HuH7 and HepG2 Cells

32.7 KB of expression ratios for candidate regulatory sequences (CRS) in two human liver cell lines. The data, authored by Kyle Leix and last updated in March 2026, quantifies activity by comparing cDNA to genomic DNA barcode counts across five independent replicates. Ratios are normalized to negative control sequences for each sample.

TabularDiv PGene ExpressionLdlr CisSterol Response ElementsInterrogate Candidate CresImportant RoleGene Expression PlaysWell CharacterizedHealthcareInsertion UpstreamMpraDisruption Reduced EndogenousExhibits Biochemical HallmarksSignificant ImpactRegulatory ElementsNoncoding SequencesNative Genomic ContextCis129 Bp IntervalCell LineParticularly Strong ActivityLdlrCommon Genetic VariantWide Association Studies+1

0 views

Media & Communication

E. coli Lipidomics Dataset for Tox-Act1 Wild-Type and Mutant Strains

A full dataset of identified lipids from E. coli bacteria expressing either the wild-type Tox-Act1 protein or its C151A mutant variant. The data was generated using LC-MS/MS methods and includes phospholipids and lysophospholipids analyzed in negative mode. Author Gianlucca G. Nicastro published the dataset on figshare under a CC-BY-4.0 license, with a last update in March 2026.

TabularExcelWidespread PhenomenonE ColiBacteria Deploy ToxinsImmunity Gene AcquisitionDiv PTargeting EnzymesIntoxicated Cells ShowedSalmonellaLipidomicsMouse Gut ColonizationPermuted NlpcStudy BroadensKill CompetitorsEvolutionary Analysis RevealedSystematic IdentificationTargeting NlpcBacterial ToxinsToxin Domain DiversityDepth CharacterizationFirst Direct CharacterizationSilicoP60 PapainTargeting Activities+1

0 views

Media & Communication

Recycled Aggregate Concrete Compressive Strength with 14 Mix Design Inputs

Sandeerah Choudhary published this dataset on figshare in March 2026. It contains 14 input features related to the mix design and aggregate properties of recycled aggregate concrete. The dataset is stored in an XLSX file sized at 46.3 KB.

TabularExcelMaterial PropertiesCivil EngineeringCompressive strengthMix DesignRecycled Aggregate Concrete+1

0 views

Media & Communication

Global Wilderness Areas Inventory from 1989 Reconnaissance Study

The World Wilderness Areas dataset is a reconnaissance-level inventory of undeveloped land shaped primarily by natural forces. It was created by the Sierra Club and the World Bank's Center for Earth Resource Analysis, integrated by UNEP/GRID, and published in 1989. The data covers approximately 50 million square kilometers, representing about one-third of the Earth's land surface, and includes 1089 polygons in ARC/INFO vector format.

GeospatialEnvironmental scienceGlobal CoverageWilderness ConservationLand UseFinanceLarge Scale+1

0 views

Media & Communication

Historical Northern Hemisphere Cyclone Pressure Center Tracks 1899-1939

Northern Hemisphere maps document the trajectories of all atmospheric pressure centers from January 1899 to June 1939. The data was encoded from a 40-year map series and archived by the National Climatic Data Center. It includes separate decade-by-decade track maps for low and high pressure systems.

Time SeriesGeospatialHistorical WeatherCyclone TracksPressure CentersNorthern Hemisphere+1

0 views

Media & Communication

Russian Podcast Audio and Transcripts Generated by NotebookLM

Russian-language podcast recordings generated by Google NotebookLM. The dataset contains 417 episodes totaling 77 hours, 23 minutes, and 22 seconds of audio in WAV format at 24 kHz. It was uploaded by AigizK to Hugging Face and last updated on March 26, 2026.

TextAudioRussianOPTIMIZED-PARQUETParquetText To SpeechTask Categoriestext To SpeechLibrarypolarsLibrarydaskRussian LanguageSpeech SynthesisSize Categoriesn1 KModalitytextLibrarymlcroissantPodcastLibrarydatasetsEmotional SpeechLicensecc By Nc 40RegionusTask Categoriesautomatic Speech RecognitionLanguageruAutomatic Speech RecognitionNotebooklm+1

0 views

PreviousPage 293 of 548Next