Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
11,020 datasets
January 2020 through April 2026 covers 100,286 stories from the Hacker News platform that received at least 100 points. The dataset likely contains popular tech news and discussion topics from this period.
A historical and political analysis by Professor Melvyn C. Goldstein examining the status of Tibet from the 17th century to the late 20th century. The text provides a detailed account of political negotiations, cultural changes, and international responses, culminating in a proposed compromise for the conflict. It is sourced from the paperswithcode platform.
Manhood and American Political Culture in the Cold War is a text-based dataset from paperswithcode. The work by K. A. Cuordileone analyzes the intersection of masculinity, liberalism, and anti-communism in post-war American political discourse. It likely contains structured arguments and historical analysis from the described chapters.
A collection of 16 scholarly essays examining the mechanics of internationalism across culture, society, and politics from the 1840s to the First World War. The work, edited by Martin H. Geyer and Johannes Paulmann, includes contributions from multiple authors on topics like trade, law, sport, and women's organizations. The dataset likely contains the full text of these historical analyses.
Procopius's 'Secret History' is a text providing a critical account of the 6th-century CE Byzantine emperor Justinian and empress Theodora. The author, the Byzantine historian Procopius, wrote in the late fifth century to after 558 CE. The work alleges the rulers' ruinous effect on the Roman empire and portrays Theodora's character in sharp detail.
From the early 1850s until 1959, this work by Louis A. Perez Jr. examines the cultural encounter between Cuba and the United States. It uses a range of Cuban and US sources, including archival records, oral interviews, magazines, novels, and motion pictures. The analysis focuses on how US cultural forms influenced Cuban identity, nationality, and the context for the 1959 revolution.
Richard Kuisel's book explores France's response to American influence from the late 1940s to the mid-1980s. The analysis includes case studies such as the Coca-Cola controversy, the Marshall Plan, and de Gaulle's policies. It draws from perspectives across French society, including politicians, businessmen, trade unionists, and intellectuals.
From the 19th century to the present day, this dataset charts the cultural diffusion of British and American sports globally. It traces how key sports like cricket and soccer have affected cultural development in different countries, such as India and Africa. The data was authored by Allen Guttmann and sourced from paperswithcode.
Marita Sturken's book analyzes American cultural responses to national trauma over two decades, focusing on the September 11th attacks and the Oklahoma City bombing. The work investigates consumerism, memorial design debates, and the political implications of memory and kitsch. It is a critical text in media and cultural studies, sourced from the paperswithcode platform.
Philip J. Deloria's book explores cultural discordance through accounts of Native Americans in unexpected contexts like Wild West shows, Hollywood films, and sports. The work examines hidden narratives and stereotypes, suggesting new directions for American Indian history. The text is sourced from the paperswithcode platform and is associated with fields including history, anthropology, and sociology.
Encyclopaedia Britannica's 11th edition, a landmark reference work published in 1911. The dataset contains the full text of this historical encyclopedia, digitized by Wolfram Research, Inc. It provides a snapshot of early 20th-century knowledge and perspectives.
Kaggle hosts a multi-source dataset designed for heritage risk prediction and management analysis. The dataset's specific sources, size, and creation details are not provided. Its intended application is for analytical tasks related to cultural heritage preservation.
Kaggle hosts a dataset titled 'Movies'. The dataset's specific contents, such as the number of records, columns, and temporal coverage, are not detailed in the available metadata. Its origin and creation date are also unspecified.
A dataset of customer reviews from an e-commerce platform, published on Kaggle. The specific source, size, and time range are not detailed in the available metadata. Columns and data specifics require verification after download.
SAPR is a dataset for analogical reasoning in Arabic. It focuses on native, culturally-grounded scenarios involving proverbs. The dataset was uploaded to Kaggle, but its author, size, and update history are unknown.
A dataset designed for sentiment analysis tasks on user reviews written in the Kannada language but using Roman script. The dataset's author, size, and specific collection details are not provided in the available metadata. Its last update date and licensing terms are also unknown.
A dataset titled 'review-chekpoints--2026-05-08--13247-13247' published on Kaggle. The title suggests it may contain information related to checkpoints for reviewing or evaluating models. The specific content, size, and origin are unknown from the provided metadata.
Cook County of Illinois data reflects potential defendants in cases brought for review by the State's Attorney's Office. The dataset is no longer actively maintained as of December 2024, with users directed to newer data dashboards. Row and column counts are unknown.
Steam-review-model is a dataset from Kaggle, likely containing user reviews from the Steam digital game distribution platform. The dataset's specific size, columns, and creation date are unknown. Its content is inferred to be text data suitable for modeling tasks related to game reviews.
A text dataset of NSFW writing prompts sourced from Reddit and shared via the ShareGPT platform. The dataset was uploaded by author 'lipilipic' to Hugging Face and was last updated on April 4, 2026. The specific content, size, and structure require verification after download.