Tabular & Structured Datasets
Clean, well-documented tabular datasets for regression, classification, and exploratory analysis.
UCI ML Repository Highlights
| Dataset |
Rows |
Cols |
Task |
License |
Browser? |
Link |
| Abalone |
4,177 |
9 |
Regression |
CC-BY-4.0 |
Yes |
UCI |
| Auto MPG |
398 |
8 |
Regression |
CC-BY-4.0 |
Yes |
UCI |
| Bank Marketing |
45,211 |
17 |
Classification |
CC-BY-4.0 |
Yes |
UCI |
| Mushroom |
8,124 |
23 |
Classification |
CC-BY-4.0 |
Yes |
UCI |
| Covertype |
581,012 |
54 |
Classification |
CC-BY-4.0 |
Yes |
UCI |
| Online Retail |
541,909 |
8 |
Clustering |
CC-BY-4.0 |
Yes |
UCI |
| Dry Bean |
13,611 |
17 |
Classification |
CC-BY-4.0 |
Yes |
UCI |
| Student Performance |
649 |
33 |
Classification |
CC-BY-4.0 |
Yes |
UCI |
Kaggle Favorites
| Dataset |
Rows |
Size |
Topic |
Browser? |
Link |
| House Prices |
1,460 |
460 KB |
Real estate regression |
Yes |
Kaggle |
| Credit Card Fraud |
284,807 |
150 MB |
Anomaly detection |
Yes |
Kaggle |
| Telco Churn |
7,043 |
955 KB |
Churn prediction |
Yes |
Kaggle |
| Spaceship Titanic |
8,693 |
600 KB |
Classification |
Yes |
Kaggle |
| NYC Taxi Trips |
1.1B+ |
250 GB |
Regression/Geospatial |
No |
NYC TLC |
| Airbnb Listings |
Varies |
10-100 MB |
Pricing/Geo |
Yes |
Inside Airbnb |
Financial