DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Media & Communication Datasets | DataSalon

All Categories

📺

Media & Communication

News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation

11,019 datasets

Movie Ratings Dataset

A dataset of movie ratings, likely sourced from a user community. The dataset is hosted on Kaggle, a platform for data science and machine learning projects. Specific details such as the number of records, time period, and contributing organization are not provided.

TabularRatingsMoviesEntertainment+1

0 views

Media & Communication

Cultural Games Benchmark for AI Evaluation

Kaggle hosts the Cultural Games Benchmark dataset. The dataset likely contains performance metrics or tasks related to games from various cultures. Its specific contents, such as column names and data volume, require verification after download.

TabularAi EvaluationBenchmarkCultural Games+1

0 views

Media & Communication

Indonesian TikTok App Reviews from 2017 to 2026 for NLP

Four million Indonesian-language reviews for the TikTok mobile application, collected from the Google Play Store between 2017 and 2026. The dataset is intended for natural language processing and sentiment analysis tasks. Its author, organization, and specific license are not provided in the input metadata.

TextApp ReviewsSocial MediaSentiment AnalysisLarge ScaleNatural Language Processing+1

0 views

Media & Communication

TRNEWS-2025: Turkish News Articles Dataset

TRNEWS-2025 is a dataset of Turkish news articles published on Kaggle. The dataset's specific size, source, and collection period are not detailed in the available metadata. Its content and structure require verification after download.

TextNewsLanguage DataText Corpus+1

0 views

Media & Communication

Mindmovieai: AI-Generated Text Content

A text dataset published on Hugging Face by author dev1601, with a last recorded update in April 2026. The dataset's title and platform tags suggest it contains content generated by or for creative artificial intelligence processes. The specific volume, source, and detailed structure of the data are not provided in the available metadata.

TextText GenerationModalitytextRegionusCreative AiLicensemitAi Generated Content+1

0 views

Media & Communication

Enjoysports: Sports Activity Preferences

Kaggle hosts a dataset titled 'enjoysports'. The dataset's specific content, size, and creation details are not provided in the available metadata. Its title suggests it relates to sports or recreational activities.

TabularRecreationActivity ClassificationSports+1

0 views

Media & Communication

Synthetic Thai E-Book Platform Data for Recommendation Systems

A synthetic dataset simulating user interactions on a Thai-language e-book platform, intended for building and benchmarking recommendation algorithms. The dataset's creator, size, and specific temporal coverage are not provided. It is hosted on the Kaggle platform.

TabularBook RecommendationE CommerceSynthetic DataThai LanguageSynthetic+1

0 views

Media & Communication

Anime Data from Anime News Network and MyAnimeList, 1563 Titles

1563 anime titles with associated metadata were collected from Anime News Network and MyAnimeList. The dataset includes features such as title, type, and number of episodes. It is shared under a CC-BY-NC-SA-4.0 license.

TabularJapanese CultureAnimeMediaEntertainment+1

0 views

Media & Communication

Short-Form Videos from TikTok and YouTube Collected in the Philippines

January to May 2025 saw the collection of 2,000 publicly available short-form videos from TikTok and YouTube. The dataset focuses on content from the Philippines and is intended for analyzing video categories. The author and license information are not provided.

VideoSocial MediaContent ClassificationVideo Content+1

0 views

Media & Communication

MM 2024/W16: Best Performing Marvel Movies

A dataset from Kaggle concerning the performance of Marvel movies. The title suggests it contains rankings or metrics for films released up to week 16 of 2024. The specific columns, data volume, and collection methodology are unknown.

TabularPerformanceMoviesMarvelEntertainment+1

0 views

Media & Communication

Khmer Entertainment Translations for Language Model Development

SeyhaLite curated and cleaned this dataset to support high-quality Khmer Language Models and translation systems. The data provides information about entertainment, media, and arts. The dataset page was last updated on February 13, 2026.

TextTranslationKhmer LanguageComputer VisionLanguage ModelEntertainment+1

0 views

Media & Communication

IMDB Movie Data with Ratings and Revenue, 1972-2019

IMDB_movie_1972-2019 contains information for 5,834 movies scraped from the IMDB website. The data was preprocessed and cleaned for machine learning applications, such as building a recommendation model. It is shared under a CC0 1.0 license.

TabularFilm RatingsMoviesImdbEntertainment+1

0 views

Media & Communication

Review Checkpoints: Text Data for Analysis

review-chekpoints--2026-05-10--13249-13249 is a dataset published on Kaggle. The title suggests it likely contains review text data, possibly with associated checkpoints or labels. Metadata is minimal; actual content requires verification after download.

TextText AnalysisReviewsCheckpoints+1

0 views

Media & Communication

Complete Filmography from IMDb

IMDb filmography data aggregated on Kaggle. The dataset likely contains records for movies and television shows, including titles, cast, crew, and ratings. Its specific size, columns, and update date are not provided in the available metadata.

TabularMoviesFilmographyImdbEntertainment+1

0 views

Media & Communication

Wikipedia Movie Metadata from 2016 to 2026

Wikipedia Movies Dataset 2016-2026 contains 10 years of movie metadata from Wikipedia. The data likely includes titles, descriptions, and release dates. The dataset's author, organization, and exact size are unknown.

TextTabularMoviesWikipediaFilm Industry+1

0 views

Media & Communication

Ask Boris: Twitter Session Metrics for London Mayor Public Engagement

Monthly Tweetreach reports track the online reach and activity of the Mayor of London's 'Ask Boris' Twitter sessions. Each report includes metrics like unique reach, total impressions, and tweet volume for sessions monitored via the hashtag #askboris. The data is produced by the Greater London Authority, with the last recorded metadata update in March 2026.

TabularConsultationTweetreachTwitterSocial MediaPublic engagementSocial Media MonitoringPolitical CommunicationUk GovernmentSynthetic+1

0 views

Media & Communication

The Old Christian Right: The Protestant Far Right from the Great Depression to the Cold Wa

This text-based work examines Protestant far-right opposition to internationalism in the United States from the Great Depression through the Cold War. It analyzes theological perspectives from denominations including Dispensationalists, Calvinists, and Lutherans, and their political engagement concerning bodies like the League of Nations. The source is a scholarly monograph from the paperswithcode platform.

TextDissenting OpinionKingdomCold WarLawReligious StudiesInternationalismProtestantismPhilosophyInternationalism PoliticsPolitical TheologyTheologyLeaguePolitical ScienceEvangelismPolitics+1

0 views

Media & Communication

Research on Culture Change in Closed Environments for Human Rights

An article analyzing processes for positive culture change in closed environments, drawing on organizational theory and expert interviews. The work is sourced from the Association for the Prevention of Torture (APT) and existing research in the field. The temporal coverage and specific data volume are not provided.

TextClosed EnvironmentsHuman RightsEpistemologyEnvironmental EthicsMathematicsOrganizational CultureCulture ChangeLawSocial ScienceMagic TelescopeTorturePhilosophyPublic RelationsSociologyPolitical ScienceDignityTorture PreventionField Mathematics+1

0 views

Media & Communication

Nollywood: History and Impact of the Nigerian Video Film Industry

Nigeria's Nollywood video film industry is the subject of this collection of essays and analyses. The content likely includes historical perspectives, market statistics, and discussions on censorship, distribution, and cultural impact across Africa. The source is paperswithcode, but the original author, organization, and specific data format are unknown.

TextConquestHausaTelecommunicationsHistoryMedia StudiesEngineeringMovie TheaterState Computer ScienceBandwagon EffectLawAdvertisingAncient HistoryAfrican CinemaNollywoodDigital VideoBusinessNigeriaSociologyArtFilm IndustryPolitical ScienceVisual ArtsCensorship+1

0 views

Media & Communication

Medical Chinese News Articles

Medical_chinese_news is a dataset of news articles in Chinese, published on Kaggle. The dataset's content likely pertains to medical and health topics, though specific details such as article count, source, and time period are not provided. Further verification after download is required to confirm its scope and structure.

TextHealth CommunicationMedical NewsChinese LanguageText Data+1

0 views

PreviousPage 370 of 551Next