Skip to content

Tabular & Structured Datasets

Clean, well-documented tabular datasets for regression, classification, and exploratory analysis.

UCI ML Repository Highlights

Dataset Rows Cols Task License Browser? Link
Abalone 4,177 9 Regression CC-BY-4.0 Yes UCI
Auto MPG 398 8 Regression CC-BY-4.0 Yes UCI
Bank Marketing 45,211 17 Classification CC-BY-4.0 Yes UCI
Mushroom 8,124 23 Classification CC-BY-4.0 Yes UCI
Covertype 581,012 54 Classification CC-BY-4.0 Yes UCI
Online Retail 541,909 8 Clustering CC-BY-4.0 Yes UCI
Dry Bean 13,611 17 Classification CC-BY-4.0 Yes UCI
Student Performance 649 33 Classification CC-BY-4.0 Yes UCI

Kaggle Favorites

Dataset Rows Size Topic Browser? Link
House Prices 1,460 460 KB Real estate regression Yes Kaggle
Credit Card Fraud 284,807 150 MB Anomaly detection Yes Kaggle
Telco Churn 7,043 955 KB Churn prediction Yes Kaggle
Spaceship Titanic 8,693 600 KB Classification Yes Kaggle
NYC Taxi Trips 1.1B+ 250 GB Regression/Geospatial No NYC TLC
Airbnb Listings Varies 10-100 MB Pricing/Geo Yes Inside Airbnb

Financial

Dataset Coverage Size Format License Link
Yahoo Finance Stock prices API CSV Free for personal finance.yahoo.com
FRED Economic indicators API CSV Open fred.stlouisfed.org
World Bank Development indicators 300 MB CSV CC-BY-4.0 data.worldbank.org
SEC EDGAR Company filings Varies XBRL/JSON Open sec.gov