Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,085 datasets
Nimbus-7 satellite Temperature-Humidity Infrared Radiometer (THIR) Level 1 data provides calibrated infrared radiances from two spectral channels. The 11.5-micron channel measures cloud top and surface temperatures with 6.7 km resolution, while the 6.7-micron channel detects upper atmospheric water vapor with 20 km resolution. Data coverage spans from October 30, 1979, to May 13, 1985, and is archived in its original proprietary format recovered from historical 9-track tapes.
The Western Hemisphere, as viewed from geostationary orbit at 115°W longitude, is covered by this historical weather imagery. It consists of visible spectrum (0.55–0.70 µm) scans from the first GOES satellite, originally recorded on 70mm film and later digitized into TIFF files. Each image includes metadata titles and a 33-level grayscale for brightness temperature calibration, with a maximum spatial resolution of 3.7 km per pixel.
SMS-2 satellite imagery from 1975 was originally recorded on 70mm film and later digitized into TIFF files. Each scan contains 2 or 3 pictures with a maximum effective size of 500 sq km and a resolution of 3.7 km per pixel. The images include metadata titles and a 33-level gray scale for brightness temperature analysis.
Synchronous Meteorological Satellite-2 (SMS-2) captured infrared imagery of the Western Hemisphere from geostationary orbit, initially at 105°W and later at 135°W. This dataset consists of digital scans of 70mm film containing 10.5–12.5 micrometer IR pictures, each with embedded metadata titles and a 33-level brightness temperature grayscale. Individual pictures can cover up to 500 sq km with a maximum spatial resolution of 3.7 km per pixel and may include contrast enhancement, sectorization, or reduced-size options.
GOES-1, the first Geostationary Operational Environmental Satellite, captured this infrared imagery from a fixed point over the equator at 115°W longitude starting December 18, 1975. The data was originally printed on 70mm film from digital tapes and later scanned into multi-picture TIFF files, each containing a title block and a 33-level grayscale for temperature calibration. Images have a maximum native resolution of 3.7 km per pixel and could be contrast-enhanced, sectorized, or reduced to 1/16 size.
1.2 GB benchmark dataset pairing high-frequency electricity price data from New South Wales with temporally aligned market news. It was created by Zhaoge Bi and last updated on May 7, 2026, to evaluate the use of textual context in forecasting models. The dataset demonstrates that current numerical models and large language models do not yet use market news consistently or reliably.
Real-time vehicle detection data collected from Singapore's 90 LTA traffic cameras using a novel Context-Aware Traffic Intelligence (CATI) system. The dataset contains per-camera detection results from the Land Transport Authority expressway network, continuously updated by author SuhxsReddy. The last recorded update was on May 12, 2026.
1.6 KB of Crystallographic Information Files (CIF) detail the atomic structures of two synthesized magnesium carbonate phases. The data includes lattice parameters, space groups, and calculated densities for monohydrated magnesium carbonate (MHMC) and the β-MgCO3 polymorph. The dataset was authored by Ryo Yamane and last updated on April 15, 2026.
A 2026 study by Ryo Yamane characterized two novel magnesium carbonate phases synthesized under high pressure. The dataset includes crystallographic information files (CIF) detailing the unit cell parameters, space groups, and calculated densities for monohydrated magnesium carbonate and the β-MgCO3 polymorph. Structural data was determined using microelectron diffraction and powder X-ray diffraction Rietveld refinement.
Transcriptional profiling data characterizing the attachment, maturation, and dispersal stages of Pseudomonas aeruginosa PAO1 biofilms in closed cultures. The dataset, authored by Xavier Bertran Forga and last updated in April 2026, includes fold-change measurements for genes related to systems like Pil-Chp, Pel polysaccharide synthesis, and cis-2-decenoic acid sensing. It identifies fourteen genes as transcriptional biomarkers of the dispersal stage.
A dataset from figshare containing transcriptional profiling data for the model bacterium Pseudomonas aeruginosa PAO1. The work characterizes gene expression across three biofilm life cycle stages—attachment, maturation, and dispersal—in closed cultures. The dataset, authored by Xavier Bertran Forga and last updated in April 2026, identifies stage-specific gene upregulation and proposes fourteen genes as transcriptional biomarkers of dispersal.
A 110.5 KB PDF authored by Xinyi Qi and published on figshare in April 2026. The document presents a perspective paper proposing a four-part psychological framework for understanding AI's role in natural science research. It addresses themes of labor visibility, identity stability, accountability, and institutional climate in the context of AI tools like large language models and autonomous laboratories.
Wisely Kola's MSc dissertation dataset contains figures and tables summarizing experimental findings on the maize fungal pathogen Exserohilum turcicum. The 19.1 MB collection includes microscopy images and quantitative tables on biofilm development under varying pH and temperature, architecture, and responses to difenoconazole. The dataset was last updated on April 20, 2026, and is shared under a CC-BY-4.0 license.
Sixteen studies examining the relationship between cortical auditory evoked potentials (CAEPs) and speech-in-noise (SPiN) performance are reviewed. The review includes data from 238 participants with sensorineural hearing loss and 204 participants with normal hearing. It was authored by Lana Biot and published on figshare in April 2026.
A 34.6 KB document by Qi-Hang Pan, uploaded to figshare on 2026-04-16, details a study on depression following traumatic brain injury in mice. The research investigates the role of the TRPC1 protein, neuroinflammation, and synaptic function using behavioral tests and molecular analysis.
Titanium 4 is an upcoming dataset of 4,900 rows focused on real-world, challenging agentic coding tasks. Generated by the DeepSeek-V4-Pro model, it prioritizes DevOps and architecture problems across multiple programming languages. The preview was released by sequelbox on Hugging Face in June 2026.
Survey results examining the relationship between sports participation and psychological state among the Chinese population, considering conditions of access to sports resources. The dataset was authored by Bin Li and is available under a CC-BY-4.0 license. It was last updated on May 19, 2026.
Bin Li's dataset on figshare contains results analyzing the impact of sports participation on the psychological state of the Chinese people. The data is stored in an XLS file sized 9.5 KB and was last updated on 2026-05-19. It is shared under a CC-BY-4.0 license.
Monthly data tracks the average number of business days Winnipeg's development services department takes to notify applicants of a decision. Metrics include the city's service level standard and the percentage of time its target is met, updated monthly by data.winnipeg.ca. The dataset distinguishes between simple permits requiring only zoning review and complex permits reviewed by multiple departments.
Iron and manganese Indicators of Reduction in Soils (IRIS) devices were placed in rice paddies under variably flooded conditions. The dataset correlates paint removal from these devices with soil redox, porewater chemistry, methane emissions, and water level. It is a 242.3 KB CSV file authored by Matt Limmer and last updated on 2026-05 04.