Sign in to view source links and access this dataset
Description
Synthetic guest reviews for homestays in Uttarakhand, India, intended for NLP and sentiment analysis tasks. The dataset was uploaded to Kaggle, but its creator, size, and specific creation date are unknown. The data is described as synthetic, meaning it was artificially generated rather than collected from real guests.
Use Cases
Train sentiment analysis models based on synthetic guest review text.
Benchmark NLP pipelines for review classification based on the described content.
Create BI dashboards for hospitality performance metrics based on review data.
Strengths
Data is explicitly synthetic, which may allow for controlled experimentation and avoid privacy concerns associated with real user data.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Provenance
Source
Kaggle
Collection Method
Synthetically generated, as stated in the description.
Geography
Uttarakhand, India
License is unknown; users should verify terms before use.