DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Media & Communication Datasets | DataSalon

All Categories

📺

Media & Communication

News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation

11,003 datasets

News Articles Collection

A dataset titled 'news.csv 3' published on Kaggle. The title suggests it contains news-related information, likely in a tabular format. No further details on size, origin, or specific content are available from the provided metadata.

TabularNewsMediaText Data+1

0 views

Media & Communication

NFL Team Instagram Relationship Management Strategies

A dataset from a study exploring the NFL's use of Instagram to manage fan relationships. It examines three teams and their application of six relationship cultivation strategies derived from relationship management theory and marketing literature. The dataset was authored by Emily Jones and last updated in March 2026.

Relationships CommunicationsSocial MediaRelationship Management StrategiesRelationship managementRelationship Marketing RmNFLInstagram+1

0 views

Media & Communication

Synthetic Dataset for Incompressible 2D Fluids

A synthetic dataset for incompressible 2D fluids, published on Kaggle. The dataset likely contains simulated data relevant to computational fluid dynamics. Specific details on size, columns, and creation method are unavailable from the provided metadata.

TabularComputational Fluid DynamicsPhysics SimulationFluid dynamicsSynthetic Data+1

0 views

Media & Communication

Amazon Reviews: Customer Feedback Dataset

Amazon Reviews likely contains customer feedback text posted on the Amazon marketplace. The dataset is hosted on Kaggle, but its specific size, creation date, and authorship are unknown. Columns and content details require verification after download.

TextCustomer FeedbackText DataAmazonProduct Reviews+1

0 views

Media & Communication

Documented Fake News Site URLs

A list of URLs for websites documented as containing fake news by fact-checking organizations. The dataset was compiled by researcher Joakim Jansson and was last updated in April 2026. The specific size and number of sites are not provided.

TabularFact CheckingFake NewsUrl ListsMedia Reliability+1

0 views

Media & Communication

Historical Analysis of Colonial North American Cultural Contest, 150-Year Period

James Axtell's historical analysis examines the three-sided struggle for empire in colonial North America over a period of nearly 150 years. The work contrasts English and French colonial strategies regarding native allies and religious conversion, framing the conflict as a contest of cultures.

TextHistoryEcologyColonialismArchaeologyBiologyLawEthnologyAncient HistoryCultural CompetitionEmpireNative American StudiesCompetition BiologyPolitical ScienceContestColonial History+1

0 views

Media & Communication

Navigating Cross-Pressures: 4,785 Latino Political Behavior Records

4,785 records from the 2006 Latino National Survey processed by Jessala Grijalva to analyze how acculturation orientations predict political behavior. The data includes four bidimensional categories—Culture Affirming, Assimilationist, Bicultural, and Demicultural—derived through Gaussian Mixture Model clustering. It contains the processed data and R/Quarto code required to replicate analysis of Latino ideology, party identification, and immigration attitudes.

Latino PoliticsCross PressuresSocial SciencesBidirectional Acculturation ModelAcculturationImmigration AttitudesPolitical Behavior+1

0 views

Media & Communication

Supporting Healthy Communities through Sports and Recreation Programs

Vicki-Ann Ware at the Australian Institute of Health and Welfare authored a literature review summarizing evidence on the benefits of sports and recreation programs for Aboriginal and Torres Strait Islander communities. The review synthesizes findings from critical program descriptions and systematic reviews, noting improvements in areas like school retention, health, and social cohesion. It also identifies gaps in the literature regarding causal links, barriers to participation, and program suitability for different demographics.

TextGeospatial🇦🇺 AustraliaSports ParticipationEcologySocial OutcomesBiologyRecreationHealthcareGeographyFinanceAboriginal HealthPublic Health+1

0 views

Media & Communication

Review Checkpoints Data from May 2026

Kaggle dataset titled 'review-chekpoints--2026-05-25--13264-13264'. The title suggests it contains review-related data, possibly involving checkpoints. The dataset is hosted on Kaggle, but no further metadata is available.

TextReview AnalysisCheckpointsText Data+1

0 views

Media & Communication

Turkish Technology News Articles from ShiftDelete.Net

A structured collection of Turkish-language technology news articles sourced from the ShiftDelete.Net website. The dataset is intended for natural language processing, machine learning, and artificial intelligence research. The author, organization, and specific scale of the collection are unknown.

TextTurkish LanguageNlp ResearchNatural Language ProcessingTechnology NewsText Corpus+1

0 views

Media & Communication

Ahmed Asya Podcast Episodes

ahmed_asya_podcast is a dataset published on Kaggle. The title suggests it contains audio or transcript data from a podcast series. The dataset's specific contents, size, and origin are not detailed in the available metadata.

TextAudioMedia AnalysisAudio ContentPodcast+1

0 views

Media & Communication

Preaching Conservative Ideas: 7 Tolerance Experiments in Norway, Sweden, and Germany

Seven classical tolerance experiments conducted in Norway, Sweden, and Germany between 2020 and 2022 by Lise Bjånesøy. The data captures non-Muslim responses to Muslims exercising freedom of assembly to preach conservative religious values. It includes survey responses and background characteristics alongside Stata replication code.

Social SciencesSpeech ActsValue ConflictReligionSurvey ExperimentsCivil LibertiesTolerance Experiments+1

0 views

Media & Communication

Los Angeles Building Permits Issued Since 2020

City of Los Angeles Department of Building and Safety provides data on permits issued for construction, remodeling, and repair. Permits are categorized into building, electrical, and mechanical types, with issuance processes ranging from same-day Express Permits to those requiring plan review. The dataset includes records from 2020 to the present.

Building Permit+1

0 views

Media & Communication

University-Level Math Reasoning Problems With Grading Rubrics

BakeAI's preview dataset contains 50 challenging university-level mathematics reasoning problems. Each problem includes a detailed reference solution, a structured grading rubric, and an anonymized model evaluation result.

JSONTask Categoriestext GenerationLibrarypolarsTask Categoriesquestion AnsweringLanguageenSize Categoriesn1 KModalitytextLibrarymlcroissantLibrarydatasetsLibrarypandasEducationRegionusReasoningReinforcement LearningMathLicenseapache 20Reward Model+1

0 views

Media & Communication

News Image Sentiment Dataset for Visual Story Analysis

A multimodal dataset likely containing news images paired with sentiment labels. The description suggests it is designed for exploring whether models can interpret narratives from visual content alone. The dataset originates from Kaggle, but its size, author, and specific creation details are unknown.

MultimodalNewsSentiment AnalysisComputer VisionNatural Language ProcessingMedia+1

0 views

Media & Communication

Twitter BBC 20NG Topic: Text Data for Topic Classification

A text dataset likely containing content from Twitter, BBC articles, and the 20 Newsgroups corpus for topic classification tasks. It was published on Kaggle, but the author, organization, and specific collection details are unknown. The original creation date and last update are not provided.

TextNews ArticlesSocial MediaText ClassificationTopic Modeling+1

0 views

Media & Communication

Yoruba Cultural Reasoning Blind Spots in Frontier Models

10 diverse examples demonstrate 'generative collapse' and 'Cultural Hallucination' in frontier base models evaluating Yoruba proverbs. The dataset was created by author 'saaga' and last updated on February 23, 2026. It captures model 'blind spots' for non-Western abstract reasoning.

TextModel BlindspotsProverbsCultural ReasoningNlp EvaluationBenchmarkYoruba Language+1

0 views

Media & Communication

Misinformation Commands Attention: An English-Tagalog dataset of political discussions on

A collection of four annotated datasets submitted to ICWSM'25, created by Marc Riven Herrera and hosted on Harvard Dataverse. It analyzes political and non-political content from Philippine Facebook pages, providing insights into user engagement, sentiment, civility, and misinformation. The datasets support applications in social media analysis, political communication, and machine learning.

TextSocial SciencesComputer and Information ScienceMisinformationNatural Language ProcessingFacebookPolitical CommunicationSocial Media Analysis+1

0 views

Media & Communication

Bollywood Movies Dataset

Bollywood movies dataset published on Kaggle. The dataset likely contains information about films from the Hindi-language film industry. Metadata is minimal; actual content requires verification after download.

TabularMoviesBollywoodEntertainment+1

0 views

Media & Communication

Movie Recommender Data

A dataset titled 'movie_recommender.csv' sourced from Kaggle. The title suggests it contains data for building or testing movie recommendation systems. No further metadata is available to confirm its specific contents, size, or origin.

TabularMovie RecommendationCollaborative FilteringEntertainment+1

0 views

PreviousPage 344 of 550Next