5.7 KB of derived data from a study of award-winning children's book reviews. The dataset includes sentiment patterns, rating distributions, cosine similarity, and rating–sentiment discrepancies, but excludes raw text and user identifiers. Yunseon Choi published this dataset on figshare in 2026.
Use Cases
- Analyze sentiment patterns in online reviews of award-winning children's books.
- Study discrepancies between numerical ratings and sentiment scores.
- Calculate cosine similarity between review sentiments.
- Model rating distributions for children's literature.
Strengths
- Data is de-identified, protecting user privacy.
- Derived metrics like cosine similarity and rating–sentiment discrepancies are included.
- File size is 5.7 KB, making it easy to download and process.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
- Raw review text and user information are excluded, limiting text-based NLP tasks.
Provenance
- Source
- figshare
- Collection Method
- Derived from a study of online reviews of award-winning children's books.
- Freshness
- Last updated 2026-05-05 01:16:14; freshness should be verified.