DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Media & Communication Datasets | DataSalon

All Categories

📺

Media & Communication

News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation

11,003 datasets

Misinformation Commands Attention: An English-Tagalog dataset of political discussions on

A collection of four annotated datasets submitted to ICWSM'25, created by Marc Riven Herrera and hosted on Harvard Dataverse. It analyzes political and non-political content from Philippine Facebook pages, providing insights into user engagement, sentiment, civility, and misinformation. The datasets support applications in social media analysis, political communication, and machine learning.

TextSocial SciencesComputer and Information ScienceMisinformationNatural Language ProcessingFacebookPolitical CommunicationSocial Media Analysis+1

0 views

Media & Communication

Bollywood Movies Dataset

Bollywood movies dataset published on Kaggle. The dataset likely contains information about films from the Hindi-language film industry. Metadata is minimal; actual content requires verification after download.

TabularMoviesBollywoodEntertainment+1

0 views

Media & Communication

Movie Recommender Data

A dataset titled 'movie_recommender.csv' sourced from Kaggle. The title suggests it contains data for building or testing movie recommendation systems. No further metadata is available to confirm its specific contents, size, or origin.

TabularMovie RecommendationCollaborative FilteringEntertainment+1

0 views

Media & Communication

wine_reviews

A dataset named 'wine_reviews' sourced from the OpenML platform. No information is available regarding its contents, size, or structure.

0 views

Media & Communication

Tokopedia Product Reviews Across 7 Categories

A dataset of customer reviews from Tokopedia, a major Indonesian e-commerce platform. The raw description indicates the data is organized into 7 categories, suggesting a multi-class structure. It was published on Kaggle, but details on volume, authorship, and update recency are unavailable.

TextIndonesiaE CommerceCustomer SentimentProduct Reviews+1

0 views

Media & Communication

Review Checkpoints: Machine Learning Model Evaluation Data

review-chekpoints--2026-05-24--13263-13263 is a dataset published on Kaggle. The title suggests it likely contains data related to checkpoints or evaluation points for reviews, possibly for model training or assessment. The dataset's specific content, size, and origin require verification after download.

TabularMachine LearningModel EvaluationReview Checkpoints+1

0 views

Media & Communication

Bombax Ceiba Flower Extract Effects on Immunosuppression

Experimental data evaluates the therapeutic potential of Bombax ceiba flower aqueous extract for alleviating cyclophosphamide-induced immunosuppression. The dataset was contributed by author Wang, Liuping via Harvard Dataverse and was last updated in April 2026.

TabularPharmacologyPlant ExtractsExperimental DataImmunology+1

0 views

Media & Communication

Health and Fitness App Reviews from 23 Major Applications

180,000 user reviews for health and fitness applications, including Strava, Calm, Nike, Adidas, Lose It!, Garmin, and Google Fit. The dataset is sourced from Kaggle and is intended for natural language processing tasks. The author, organization, and specific collection date are unknown.

TextApp ReviewsHealthcareNatural Language ProcessingHealth FitnessConsumer behavior+1

0 views

Media & Communication

v3 cvd pressure shortwin: Cardiovascular Disease and Blood Pressure Data

A dataset concerning cardiovascular disease and blood pressure, published on Kaggle. The title suggests it may contain health metrics related to CVD. Specifics regarding its size, origin, and creation date are unknown.

TabularHealth MetricsBlood PressureCardiovascular Disease+1

0 views

Media & Communication

Nguyetnga Podcast: Audio and Metadata Collection

Nguyetnga_podcast is a dataset published on Kaggle. The dataset likely contains audio files and associated metadata for a podcast series. The specific content, size, and collection details are not provided in the available metadata.

TabularAudioMedia AnalysisAudio ContentPodcast+1

0 views

Media & Communication

Columbia River Plume Hydrographic Casts from the 1990-91 Study

Hydrographic cast data from the 1990-1991 Columbia River Plume Study collected by NOAA NCEI. The dataset includes CTD measurements of temperature, salinity, pressure, and conductivity, along with dynamic height, to map the plume's extent and thickness off Oregon and Washington. Observations were taken from the R/V Wecoma research vessel during the fall season.

TabularTime SeriesOceanographyCtd DataCoastal OceanographyHydrographic Casts+1

0 views

Media & Communication

Antarctic Meteorological Observations From 1976-1977 Traverse

Meteorological observations of air temperature and pressure were recorded at multiple cane sites during the 1976-1977 Mirny-Dome C traverse in Antarctica. The data was collected by personnel involved in the traverse to aid in precise location determination. Records are archived by the Australian Antarctic Division and represent a snapshot from the late 1970s.

TabularPressureTemperatureAntarctic MeteorologyPolar ResearchHistorical Observations+1

0 views

Media & Communication

Tropical Sea Level Pressure Grid Data from the International Geophysical Year

German IGY Tropical Sea Level Pressure data provides daily sea-level pressure measurements on a 5-degree latitude/longitude grid. It covers the global tropics from 25°S to 25°N. The dataset was created by SCIOPS and records conditions during the International Geophysical Year from June 1957 to December 1958.

Time SeriesGeospatialGeospatial GridHistorical ClimateSea Level Pressure+1

0 views

Media & Communication

Hourly Barometric Pressure Observations from Batavia Observatory 1866-1944

Hourly barometric measurements were recorded at the Dutch Royal Magnetic and Meteorological Observatory in Batavia (Djakarta). The dataset covers a 79-year period from 1866 to 1944. Kevin Hamilton and Rolando Garcia keypunched the data in 1986 from original Observatory Yearbooks.

Time SeriesHistorical WeatherAtmospheric PressureClimate DataObservatory Records+1

0 views

Media & Communication

Historical Northern Hemisphere Forecast Grids from 1967-1971

NMC forecast grids provide daily meteorological predictions on a 47x51 Northern Hemisphere polar-stereographic grid centered on the North Pole. The grids contain parameters like geopotential height, temperature, wind, and pressure across various tropospheric levels. Data from the SCIOPS organization is available for sporadic periods between November 1967 and December 1971.

Time SeriesGeospatialTropospheric DataHistorical ClimateWeather ForecastingPolar Stereographic Grid+1

0 views

Media & Communication

Facebook Stock Prices with Daily Open, Close, and High Values, 2012-2020

7 columns contain daily stock metrics for Facebook Inc., including opening price, closing price, and daily high. The data covers a period from 2012 to August 2020. It is shared under a CC0 1.0 license on the OpenML platform.

TabularTime SeriesMeta PlatformsFinanceFinancial DataStock Market+1

0 views

Media & Communication

Amazon Halloween Costume Reviews with User Text and Scores, November 2020

User reviews for popular Halloween costumes sold on Amazon as of November 2020. The dataset includes review text, titles, scores, publishing dates, and reviewer locations. It is provided under a CC0-1.0 license and is intended as an exercise for text preprocessing and feature extraction.

TextTabularHalloween CostumesProduct FeedbackSentiment AnalysisNatural Language ProcessingAmazon ReviewsConsumer behavior+1

0 views

Media & Communication

Trending Movies Over the Years, 2026

A dataset listing movies that were trending over time, sourced from Kaggle. The specific temporal coverage and data collection method are not detailed in the available metadata. The dataset likely contains information about movie popularity across different years.

TabularTime SeriesMoviesTrendsEntertainment+1

0 views

Media & Communication

IMDb Top 10,000 Movies and TV Series with Genre and Year Data

A dataset of top-rated entertainment titles from IMDb, containing over 3 million data points. The dataset includes nested genre information and release years. It was sourced from Kaggle, but the original author, specific license, and last update date are unknown.

TabularMoviesTv SeriesEntertainment RatingsImdb+1

0 views

Media & Communication

Movies Data Collection

Movies data is a dataset hosted on Kaggle. The dataset's specific contents, size, and origin are not detailed in the available metadata. Further inspection after download is required to confirm its scope and structure.

TabularMoviesFilmEntertainment+1

0 views

PreviousPage 345 of 550Next