Brilliant-Diamonds: 119K Natural and Lab-Created Diamond Listings
arff
Available on 1 platform
Sign in to view source links and access this dataset
Description
119,000 diamond listings scraped from brilliantearth.com to demystify the value of the 4 Cs (cut, color, clarity, carat). The dataset includes attributes like price, carat weight, shape, and certificate type, and was collected using a tool called DiamondScraper.
Use Cases
Predict diamond price based on carat, cut, color, and clarity features.
Analyze market trends for lab-created versus natural diamonds based on the 'type' attribute.
Study the relationship between diamond shape and price based on the 'shape' column.
Identify common grading report providers based on the 'report' field.
Strengths
Contains 119,000 diamond records, providing a substantial sample for analysis.
Includes key pricing factors like carat, cut, color, and clarity as described.
Limitations
Row count for the specific dataset instance is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Last update date is unknown; freshness unverified.
Provenance
Source
Brilliant Earth (brilliantearth.com)
Collection Method
Scraped using DiamondScraper
License is CC-BY-NC-4.0, which prohibits commercial use.