DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Media & Communication Datasets | DataSalon

All Categories

📺

Media & Communication

News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation

10,984 datasets

Archaeological Linen Textile Analysis Data

Date from a textile analysis of linen archaeological textiles. The dataset is authored by Payton Becker and was last updated in March 2026. It is a small dataset of 17.8 KB with an unknown number of rows and columns.

EgyptTextile ArchiveColonialism History+1

0 views

Media & Communication

LSIP Sector Skills Analysis: London Labour Market Evidence (2025–26)

December 2025 slide decks summarising sector-level evidence for the 2025–26 refresh of London's Local Skills Improvement Plan (LSIP). These early resources reflect data and stakeholder input available at the time, with a fully reviewed evidence base to follow. The materials were produced by the Greater London Authority.

TextSkillsJobsLondonLabour MarketSector AnalysisEmployment+1

0 views

Media & Communication

UK Fixed Broadband Speeds from Major Internet Service Providers

Ofcom annual reports detail the performance of the UK's fixed broadband, mobile, WiFi, digital television, and radio networks. The data is gathered from major fixed broadband Internet Service Providers, including BT, KCOM, Sky, TalkTalk, and Virgin Media, covering both retail and wholesale services. The dataset was last updated on March 25, 2026.

Tabular🇬🇧 United KingdomBroadbandTelecommunicationsInternet InfrastructureIsp+1

0 views

Media & Communication

Tech News Daily: Aggregated Technology News Articles

Tech News Daily is a dataset published on HuggingFace by author shaurya03. The title suggests it likely contains daily technology news articles or headlines. The dataset was last updated on 2026-04-18 17:29:00.

TextTech NewsDaily UpdatesText Data+1

0 views

Media & Communication

Review Checkpoints: Machine Learning Model Evaluation Data

review-chekpoints--2026-06-05--13275-13275 is a dataset hosted on Kaggle. The title suggests it contains information related to checkpoints, likely for evaluating or reviewing machine learning models. No further metadata, such as author, size, or column descriptions, is available.

TabularMachine LearningModel EvaluationReview Checkpoints+1

0 views

Media & Communication

Media Coverage of the Vietnam War from 1961-1973

Daniel C. Hallin's study analyzes the complete New York Times corpus from 1961 to 1965 and a sample of hundreds of television reports from 1965 to 1973. The work draws on interviews with journalists and Defense Department footage to critique conventional wisdom about media influence. It provides a detailed account of what Americans read and watched about the Vietnam War.

TextHistoryMedia StudiesTelevision AnalysisEliteWorld War IiNewspaper AnalysisAdversaryLawVietnam WarForeign PolicyGovernment LinguisticsSociologyPolitical SciencePoliticsComputer Security+1

0 views

Media & Communication

Khmer ASR Cultural: 134 Hours of Curated Cambodian Speech

134.6 hours of manually curated Khmer speech-text pairs focusing on Cambodian cultural topics, produced by DDD-Cambodia. The collection features recordings from 8 native speakers with an average duration of 8.54 seconds per utterance and includes detailed speaker metadata.

ParquetSize Categories10 Kn100 KLibrarypolarsArxiv240610118LibrarydaskModalityaudioModalitytextLanguagekmLibrarymlcroissantLicensecc By Sa 40LibrarydatasetsRegionusTask Categoriesautomatic Speech RecognitionTask Categoriestext Classification+1

0 views

Media & Communication

TMDB Movies: Film Metadata Collection

TMDB.MOVIES.CSV is a dataset from Kaggle, likely containing information about movies sourced from The Movie Database. The specific number of records, columns, and data collection timeframe are unknown. The dataset's author and organization are not provided.

TabularMoviesTmdbEntertainment+1

0 views

Media & Communication

Movies

Movies is a dataset published on Kaggle. Its specific contents, size, and origin are not detailed in the provided metadata. The dataset likely contains information related to films, such as titles, genres, or ratings.

TabularMoviesMediaEntertainment+1

0 views

Media & Communication

Olympic Medal Prediction Analysis with Coaching Impact Models

You Liu's dataset on figshare analyzes the allocation and impact of great coaches on Olympic medal predictions. The 5.5 KB XLS file contains methodological analysis related to coaching systems like the Brazilian Swimming Team and Chinese Diving Team. It was last updated in March 2026.

Time Attenuation RuleDynamic Influence MechanismImproved Cusum AlgorithmTraditional Methods IgnoringDiv PSystematic Coaching SystemDecay ModelBrazilian Swimming Team7 6Ai Tactical OptimizationPoint Driven DifferenceDynamic Confounding BiasCompetition Size StabilityDynamic Causal InferenceEmpirical Studies ShowExponential Decay FunctionDual Test MechanismZhou Jihong CoachingChinese Diving TeamDynamic Difference Method+1

0 views

Media & Communication

Olympic Coaching Allocation and Impact Analysis

You Liu's research dataset analyzes the allocation and impact of great coaches on Olympic performance. The dataset is 5.5 KB in size and is stored in an XLS file format. It employs methods like the Improved Cusum Algorithm and Decay Model for analysis.

0 views

Media & Communication

Most Popular Movies

Most Popular Movies is a dataset hosted on Kaggle. Its specific contents, such as the number of records, columns, and time period covered, are not detailed in the available metadata. The dataset likely contains information related to movie popularity metrics.

TabularPopularityMoviesEntertainment+1

0 views

Media & Communication

Wine Reviews Dataset

Kaggle hosts a dataset of wine reviews. The dataset's specific size, origin, and content are not detailed in the provided metadata. Its actual scope and quality require verification after download.

TabularReviewsWineConsumer Feedback+1

0 views

Media & Communication

Data File 052825: Btof and Family Firm Metrics

Vincent Barker's 182 KB dataset, shared under a CC-BY-4.0 license, supports a paper currently under review. Its title and tags suggest it contains metrics related to Btof and family firms, likely in a tabular format. Metadata is minimal; actual content requires verification after download.

TabularCSVBtofFamily FirmsCorporate FinanceFirm Performance+1

0 views

Media & Communication

Movies 5000: A Collection of Film Data

Movies_5000 is a dataset published on Kaggle. The title suggests it contains information related to films, likely including titles and other attributes. The dataset's specific content, size, and origin require verification after download.

TabularMoviesFilmEntertainment+1

0 views

Media & Communication

Movies Dataset from Kaggle

A dataset titled 'movies' published on the Kaggle platform. The specific content, size, and origin are unknown from the provided metadata. Further details such as columns, sample data, and license require verification after download.

TabularMoviesMediaEntertainment+1

0 views

Media & Communication

DigiFakeAV: Synthetic and Real Video Clips with Audio

The DigiFakeAV dataset contains synthetic and real video clips, with video frames stored as raw numpy arrays and audio provided in uncompressed WAV format. It was created by author 'cambrain' and last updated on March 12, 2026. The dataset is part of a project focused on audio-visual media.

AudioVideoAudio VisualDeepfake DetectionSynthetic VideoMultimediaSynthetic+1

0 views

Media & Communication

Movies Recommendation Dataset for Collaborative Filtering

Kaggle hosts a dataset focused on movie recommendations. The dataset likely contains user-movie interaction data, such as ratings or watch history, suitable for building recommendation systems. Its specific scale, origin, and creation date are not detailed in the provided metadata.

TabularMovie RecommendationCollaborative FilteringEntertainment+1

0 views

Media & Communication

Depression Counsellor Dataset

A dataset hosted on Kaggle by user Harshit76. The dataset likely contains text data related to depression counseling. Its specific contents, size, and creation details are not provided in the available metadata.

TextMental HealthCounselingDepression+1

0 views

Media & Communication

Movies Dataset from Kaggle

Movies dataset is a collection of data related to films, published on the Kaggle platform. The specific contents, scale, and creation details are unknown from the provided metadata. Users must download the dataset to verify its actual scope and features.

TabularMoviesMediaEntertainment+1

0 views

PreviousPage 323 of 549Next