DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Media & Communication Datasets | DataSalon

All Categories

📺

Media & Communication

News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation

11,040 datasets

Erosion Hazard Zone Buffers for Austin Streams

100-foot buffers of stream centerlines with drainage areas of 64 acres or greater, created by the City of Austin. The dataset includes locations of structures, trees, and other features of concern threatened by erosion.

Erosion Hazard ZoneErosionTreesAgoltosocrataThreatenedCreekStructuresConcernBufferLocations+1

0 views

Media & Communication

Moroccan Public Opinion on AI from Reddit Discussions

Moroccan discussions and public opinion about artificial intelligence were collected from Reddit. The dataset appears to contain real-world text data from social media posts and comments. The author, organization, and specific collection date are unknown.

TextSocial MediaArtificial IntelligencePublic Opinion+1

0 views

Media & Communication

Political Science Under Pressure: 20-Year Disciplinary Replication Data

This replication package provides the data and code documenting academic competition and collaboration trends in political science from 2003 to 2023. Authored by Carolina Torreblanca for Perspectives on Politics, it contains the necessary materials to reproduce all figures and tables from the associated study.

Social SciencesAcademic Competition Knowledge Production Discipli+1

0 views

Media & Communication

Lessons from the Great Depression: An Integrated Economic History of Four Nations

Peter Temin's integrated analysis of the Great Depression covers the economic experience in Britain, France, Germany, and the United States. The work describes the causes, widespread and prolonged nature, and eventual recovery from the depression. It also draws parallels to more recent economic policies, such as those of the early 1980s and the Reagan administration.

TextDevelopment EconomicsHistoryAdministration Probate LawPolitical EconomyMacroeconomicsDeflationDepression EconomicsParallelsLawGreat DepressionEconomicsComparative AnalysisMonetary PolicyEconomic PolicyFinanceGovernment LinguisticsEconomic HistoryKeynesian EconomicsPolitical ScienceCarry InvestmentOperations ManagementSynthetic+1

0 views

Media & Communication

Intercultural Communication Role-Playing Simulation Exercises

Edward C. Stewart designed training to increase cultural awareness through simulated role-playing exercises. The report describes the development of paper-and-pencil tests as interim estimates of training objectives and presents preliminary data on efficacy. Possible variations in training format are suggested.

TextComputer SciencePsychologyRole PlayingIntercultural CommunicationComputer VisionCommunicationCultural AwarenessTraining SimulationSynthetic+1

0 views

Media & Communication

U.S. Arts and Cultural Participation Survey Data, 1997

A national survey dataset on arts and cultural participation, likely containing responses on attendance, engagement, and demographics. The data originates from a 1997 initiative by the Wallace-Reader's Digest Funds to encourage community foundations to invest in broadening cultural engagement. The dataset author is Chris Walker and it is hosted on the paperswithcode platform.

TabularArts ParticipationElectronic Dance MusicMedia StudiesSurvey DataAttendanceEvent AttendanceDanceSociologyArtPolitical ScienceEvent Particle PhysicsCULTURAL STUDIESVisual ArtsThe Arts+1

0 views

Media & Communication

Arts and Culture Indicators Framework for Community Building

Urban Institute's Arts and Culture Indicators in Community Building Project presents a conceptual framework for measurement. The work, authored by Maria-Rosario Jackson, outlines guiding principles for quantifying culture's role in communities. It is hosted on the paperswithcode platform, which aggregates resources for data science and AI practitioners.

TabularEnvironmental scienceComputer ScienceData ScienceSocial ScienceCommunity IndicatorsGeographyUrban Studies+1

0 views

Media & Communication

AG News New Non: News Article Dataset for Text Classification

AG News New Non is a text dataset hosted on Kaggle. The dataset likely contains news articles, as suggested by its title. Its specific content, size, and origin require verification after download due to minimal provided metadata.

TextNewsText ClassificationNatural Language Processing+1

0 views

Media & Communication

A study of personal and cultural values: American, Japanese and Vietnamese

Roy G. D’Andrade's study compares personal and cultural values across American, Japanese, and Vietnamese populations. The dataset likely contains questionnaire responses organized around a conceptual framework for analyzing institutionalized values. The data source is paperswithcode, but the license is closed.

TabularCultural ValuesSurveyPsychologyVietnameseSocial SciencePhilosophySociologyConceptual FrameworkLinguisticsGender StudiesCross Cultural+1

0 views

Media & Communication

TMDB 5000 Movies: Cleaned Film Metadata

Kaggle hosts a dataset titled 'TMDB 5000 Movies - Cleaned Dataset'. The dataset likely contains metadata for approximately 5,000 films sourced from The Movie Database (TMDB). The cleaning process suggests it has been processed for consistency, but the specific columns, time range, and original author are unknown.

TabularMoviesTmdbFilm IndustryEntertainment Data+1

0 views

Media & Communication

TMDB Movie Data for Pandas and NumPy Cleaning Exercises

Movie data from The Movie Database (TMDB) intended for data cleaning practice. The dataset is hosted on Kaggle, but its specific size, row count, and column details are unknown. The author, organization, and last update date are also unspecified.

TabularMovie DataNumpyTmdbPandasData Cleaning+1

0 views

Media & Communication

Top Rated Movies on TMDB as of April 2026

TMDB user ratings provide a list of top-rated movies. The dataset likely contains movie titles, ratings, and other metadata. Its origin and specific contents are not detailed in the provided metadata.

TabularRatingsMoviesTmdbEntertainment+1

0 views

Media & Communication

SportSense-Fatigue: EEG, ECG, and IMU Signals for Mental Fatigue Prediction

A dataset containing physiological and motion signals for modeling mental fatigue. It includes EEG, ECG, and IMU data, likely collected from participants during physical or cognitive tasks. The dataset is intended for use in federated learning models, suggesting a structure designed for distributed training across multiple clients.

Time SeriesMultimodalMental FatigueImu SignalsEeg SignalsEcg SignalsFederated Learning+1

0 views

Media & Communication

Twitter Data from Kaggle

Twitter data sourced from Kaggle. The dataset likely contains text-based content from the social media platform. Metadata is minimal; specifics such as size, columns, and license are unknown.

TextTwitterSocial MediaText Data+1

0 views

Media & Communication

IMDB Popular Movies, TV Shows, and Web Series with Ratings and Genre Data

IMDB Popular Movies, TV Shows, Web Series - 2026 is a dataset from Kaggle. It contains information on movies, TV shows, and web series, including ratings, votes, and genre data. The author, organization, and specific temporal coverage are not provided in the input metadata.

TabularRatingsWeb SeriesMoviesTv ShowsGenres+1

0 views

Media & Communication

Psychological Prisoners: FoMO and Cyberloafing in Professional Settings

Abdullah ALDEMİR published this social science dataset in 2026 to examine the non-linear relationships between Fear of Missing Out (FoMO), cyberloafing, and social media addiction. The data focuses on identifying behavioral threshold effects and digital dependency tipping points specifically within professional environments.

Social Sciences+1

0 views

Media & Communication

Indian News Articles with Author and Publication Date Metadata

Indian-News-Articles is a collection of news content created due to a lack of available Indian news data on Kaggle. The dataset includes article text along with metadata such as author, link, and publish date, scraped from firstpost.com using Python and BeautifulSoup. The license is Database: Open Database, Contents: Original Authors.

TextNews ArticlesNatural Language ProcessingIndian MediaText Corpus+1

0 views

Media & Communication

ReviewBeauty2018: Beauty Product Reviews

ReviewBeauty2018 is a dataset of user reviews for beauty products, sourced from Kaggle. The dataset likely contains textual feedback and ratings from consumers. Its specific content and scale require verification after download.

TextSentiment AnalysisBeautyConsumer behaviorProduct Reviews+1

0 views

Media & Communication

News Articles Collection

News_df is a dataset of news articles published on Kaggle. The dataset's specific source, size, and time period are not detailed in the available metadata. Further inspection after download is required to determine its exact scope and content.

TextNewsMedia ContentText Data+1

0 views

Media & Communication

Review Checkpoints: Model Evaluation Data for Machine Learning

A dataset titled 'review-chekpoints--2026-04-28--13237-13237' published on Kaggle. The platform tags suggest it relates to model evaluation and review checkpoints, likely containing tabular data for assessing machine learning models. The author, organization, and specific data characteristics are unknown.

TabularMachine LearningModel EvaluationReview Checkpoints+1

0 views

PreviousPage 394 of 552Next