Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
WebSight contains between 1 and 10 million pairs of synthetic website screenshots and their corresponding HTML/CSS code, released by HuggingFaceM4 in March 2024. The collection features two distinct versions covering standard HTML/CSS and modern HTML/Tailwind CSS implementations for English-language websites.
Distributed in Parquet format; users must choose between v0.1 for standard CSS or v0.2 for Tailwind CSS based on their model requirements.