DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Media & Communication Datasets | DataSalon

All Categories

📺

Media & Communication

News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation

11,012 datasets

Partial pressure of carbon dioxide, salinity, temperature and other variables collected fr

NOAA's National Oceanic and Atmospheric Administration collected surface underway chemical, meteorological, and physical data from the R/V F.G. Walton Smith in 2023. The dataset includes air-sea difference of partial pressure of carbon dioxide (pCO2), pCO2 in atmosphere and water, barometric pressure, sea surface salinity, and temperature. These measurements are part of the Global Coastal Carbon Data Project, focusing on carbon cycle understanding on continental margins.

TabularTime Series33wa20230722Surface UnderwayDfco2 Uatm33wa20230808Coastal CruisesFocusFco2 Atm Interpolated Uatm33wa20230605Fco2 Swsst UatmCarbon dioxide33wa2023030233wa2023081833wa2023022433wa20231210Gulf Of Mexico33wa2023062333wa20230520Caribbean SeaCarbon cycleNerc NfsgeoAoml Soop Oa33wa2023022527nCoastal Oceanography+1

0 views

Media & Communication

Intervertebral Disc MRI: T1/T2 Relaxometry and Strain in Human Cadavers

A collection of MRI-derived mechanical and structural measurements from human cadaveric intervertebral discs, produced by Deva Chan and hosted on Harvard Dataverse. It quantifies T1 and T2 relaxation times alongside in-plane strains and estimated shear modulus under physiological compression and bending loads.

EngineeringMedicine Health And Life Sciences+1

0 views

Media & Communication

Letterboxd Film Metadata and User Reviews

Extracted film metadata, ratings, cast, genres, and user reviews from Letterboxd. The dataset is hosted on Kaggle, but its author, size, and temporal coverage are unknown. The description suggests it contains scraped data from the Letterboxd platform.

TextTabularSocial MediaUser ReviewsEntertainment+1

0 views

Media & Communication

Google News Articles Scraped by Keyword Across 50+ Countries

Scraped news articles from Google News based on keywords, brands, or topics. The scraper returns canonical URLs and covers over 50 countries. The author, organization, and specific temporal coverage are unknown.

TextMedia AnalysisNews ScrapingBrand Tracking+1

0 views

Media & Communication

Hacker News Search Scraper via Algolia API

Hacker News stories, comments, and polls scraped via the Algolia API. The dataset likely contains user-generated content from the technology and startup discussion forum. The author, organization, and last update date are unknown.

TextWeb ScrapingSocial MediaHacker NewsDiscussion Forum+1

0 views

Media & Communication

Steam Game Metadata and User Reviews Scraped from the Platform

Steam game metadata, pricing, genres, Metacritic scores, and user reviews scraped from the platform. The dataset likely contains structured information on games available via the Steam digital distribution service. The author, organization, and specific data volume are unknown.

TabularUser ReviewsGame MetadataSteamVideo GamesScraped Data+1

0 views

Media & Communication

Roblox Indonesian Reviews for Sentiment Analysis Experiments

Roblox Indonesian Reviews Dataset is a text collection intended for sentiment analysis experiments using machine learning and deep learning. The dataset is hosted on Kaggle, but its author, size, and license are unknown. Its last update date is also unknown.

TextIndonesianReviewsSentiment AnalysisRoblox+1

0 views

Media & Communication

Output Espresso: Coffee-Related Data

A dataset titled 'output_epresso' published on Kaggle. The title suggests the data likely contains information related to espresso or coffee. Specifics regarding its contents, size, and origin are unavailable from the provided metadata.

TabularCoffeeFood DrinkBeverageEspresso+1

0 views

Media & Communication

ISOT Fake and Real News Dataset for Binary Text Classification

A preprocessed derivative of the ISOT Fake and Real News Dataset, designed for binary text classification tasks. The original dataset contains collections of both fake and real news articles. This version has been processed for machine learning applications, though specific preprocessing steps are not detailed.

TextNews ArticlesFake NewsBinary ClassificationText Classification+1

0 views

Media & Communication

Turkish Technology News Articles from HWP.com.tr

Turkish Technology News Dataset (HWP.com.tr) is a collection of news articles focused on technology topics in the Turkish language. The dataset is intended for natural language processing, machine learning, and text analysis projects. The source is the Turkish news website HWP.com.tr, but the author, license, and specific data volume are unknown.

TextTurkish LanguageNatural Language ProcessingTechnology NewsText Corpus+1

0 views

Media & Communication

IMDB Dataset of 50,000 Movies

50,000 movie entries sourced from the Internet Movie Database (IMDB). The dataset is hosted on Kaggle, a platform for data science competitions and projects. The specific collection date, author, and update frequency are not provided in the available metadata.

TabularMoviesFilm IndustryImdbEntertainment+1

0 views

Media & Communication

IMDB Movies Dataset with 50,000 Entries

A dataset of 50,000 movies sourced from IMDB. The dataset is hosted on Kaggle, but the author, specific columns, and update history are unknown. The content likely includes movie titles and associated metadata.

TabularMoviesFilmImdbEntertainment+1

0 views

Media & Communication

News Articles Categorized into 20 Classes

News Category Dataset (20 Classes) is a text dataset hosted on Kaggle. The title suggests it contains news articles labeled into 20 distinct categories. The dataset's author, size, and specific source are unknown.

TextNews ClassificationMulti ClassMedia ContentText Data+1

0 views

Media & Communication

Vietnamese News Articles Collection

Vietnamese_news_10m is a dataset hosted on Kaggle. Its title suggests it likely contains a corpus of news articles written in the Vietnamese language. The dataset's scale, source, and creation details are not provided in the available metadata.

TextNews ArticlesNatural Language ProcessingVietnamese LanguageText Corpus+1

0 views

Media & Communication

Fss Writen Artifacts Ver D 2026 04

Synthetic binary blobs used for measuring sequential write throughput and latency. The dataset is strictly for infrastructure verification purposes. New data for version 2026 04 was uploaded by author micmicmicmicmicchan on 2026-03-21.

TabularTime SeriesInfrastructure BenchmarkingLicenseopenrailWrite ThroughputRegionusSynthetic DataLatency Measurement+1

0 views

Media & Communication

York County Chapter News Article References

York County Chapter news articles are referenced by the organization's social media posts. The dataset was created by Jennifer D. McGrew of the York County Chapter and was last updated on April 1, 2026.

TextMedia AnalysisNews ArticlesSocial SciencesSocial MediaLocal News+1

0 views

Media & Communication

Brevard Chapter News Video References

Brevard County Chapter news videos are referenced by the organization's social media posts. The dataset was authored by Jennifer D McGrew and last updated in April 2026.

VideoSocial SciencesSocial MediaNews VideosMedia ContentLocal News+1

0 views

Media & Communication

Brevard Chapter News Articles Referenced In Social Media

News articles referenced by the Brevard Chapter's social media posts, compiled by Jennifer D McGrew of the Brevard County Chapter. The dataset was last updated in April 2026.

TextMedia AnalysisNews ArticlesSocial Media ReferencesSocial SciencesLocal News+1

0 views

Media & Communication

Placer County Chapter News Video Archive

News videos form the media archive referenced by a specific local government chapter's social media posts. The dataset was created by Jennifer D McGrew of the Placer County Chapter and was last updated in April 2026. The exact volume of videos and their publication dates are not specified.

VideoSocial Media ContentMedia ArchiveSocial SciencesNews VideosLocal Government+1

0 views

Media & Communication

News Articles Referenced by Placer County Social Media

Placer - News Articles contains news articles referenced by social media posts from the Placer County Chapter. The dataset was created by Jennifer D McGrew of the Placer County Chapter organization. It was last updated on April 1, 2026.

TextMedia AnalysisNews ArticlesSocial SciencesSocial MediaLocal News+1

0 views

PreviousPage 354 of 550Next