Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
11,020 datasets
A practice dataset from Kaggle concerning books. The data likely contains price information and user reviews. The original author, organization, and specific details like size and license are unknown.
imdb_urdu_reviews is a dataset of movie reviews in the Urdu language, sourced from the IMDb platform. The dataset is hosted on Kaggle, but its specific scale, creation date, and authorship are not provided in the available metadata. The content likely contains user-generated text for films listed on IMDb.
News Metadata Dataset (7K) is a metadata-only collection from Kaggle containing structural, bias, and narrative signals for news content. The dataset's author, organization, and specific source are unknown. The last update date is also unknown.
WHO Global Health Observatory data on the existence of national targets for blood pressure and hypertension control. The dataset likely contains indicators tracking policy adoption across member states. It is published by the World Health Organization.
WHO data likely tracks compliance scores for bans on advertising specific products on national television and radio. The dataset's columns suggest it contains quantitative measures of adherence to these media regulations. It is published by the World Health Organization on the WHO GHO platform.
WHO data on compliance with bans on advertising in local magazines and newspapers. The dataset likely contains scores or metrics assessing adherence to public health advertising regulations. It is published by the World Health Organization on the WHO GHO platform.
A dataset from the World Health Organization (WHO) concerning regulations on product placement in national television broadcasts. The dataset likely contains tabular information on policy measures, potentially covering various countries or regions. Its specific temporal coverage, row count, and column details are not provided in the metadata.
A dataset from the World Health Organization (WHO) on restrictions for product placement on national television. The data likely contains information on regulatory policies concerning advertising and public health. The specific columns, time range, and geographic coverage are not detailed in the available metadata.
Global data on advertising restrictions for products like tobacco, alcohol, and unhealthy foods on national television. The dataset is published by the World Health Organization (WHO) through its GHO platform. The specific temporal coverage, number of countries, and update frequency are not detailed in the available metadata.
WHO GHO data on advertising restrictions for products like tobacco, alcohol, or unhealthy foods on national television. The dataset likely contains policy indicators or regulatory statuses for different countries. Its specific temporal coverage, column details, and update schedule are not provided in the metadata.
World Health Organization data tracks compliance with bans on advertising in local magazines and newspapers, likely related to tobacco control or other health policies. The dataset's specific row count and temporal coverage are not provided. It originates from the WHO's monitoring of global health regulations.
World Health Organization data on advertising bans for local magazines and newspapers. The dataset covers policies related to media regulation and public health. The WHO likely compiled this information to monitor global health policy implementation.
Factors associated with depression likely form the core of this dataset. Published on Kaggle, its specific variables and collection method are unknown. The data may be useful for exploring correlations with mental health outcomes.
Rebuttal Filtered Original 1 Low 0.7 16384 Gemini 3 Pro Preview is a dataset published on huggingface by connections-dev. The title suggests it contains text data, likely filtered for rebuttal or safety training purposes, and may be associated with the Gemini 3 Pro Preview model. It was last updated on March 28, 2026.
News headlines are collected from 18 major media sources including Fox Business, Reuters, BBC, and The New York Times. A script checks for new headlines every 20 minutes, with data acquisition starting on March 21, 2020. The dataset creator intends to update it daily, subject to system availability.
36,622 comments from 209 frequent Amazon customers on book content, reflecting user preferences and tastes. This dataset is part of the HumanLM benchmark for training user simulators that accurately reflect real user behavior. The data was sourced from Amazon Reviews 2023 and spans from 1998-01-25 to 2023-05 10.
Systematic literature review of academic papers on remittance costs, compiled from queries run on Ideas-RePec, EconLit, Web of Science, and Scopus databases. The review covers articles published in English from 2015 to 2025, filtered by accessibility, and uses a multi-dimensional framework for analysis.
Presenting a systematic literature review of academic papers on remittance costs, compiled from queries run on Ideas-RePec, EconLit, Web of Science, and Scopus databases. The review covers articles published in English from 2015 to 2025, filtered by accessibility, and uses a multi-dimensional framework for analysis.
Kaggle hosts a dataset titled 'facebook'. The dataset's content and structure are unspecified. Metadata is minimal; actual content requires verification after download.
A dataset of news articles published on Kaggle. The title suggests it likely contains textual news content. Specific details such as the number of articles, publication dates, and original sources are unknown.