Name Instances Attributes Missing Values Tasks Dataset Types Attribute Types Area Hits Date
Abalone 4177 8 No Classification Multivariate Categorical, Integer, Real Life 996004 1995-12-01
Qualitative Structure Activity Relationships - - N/A N/A Domain-Theory N/A Physical 27562 N/A
Prodigy - - N/A N/A Domain-Theory N/A N/A 34635 N/A
Primary Tumor 339 17 Yes Classification Multivariate Categorical Life 135961 1988-11-01
Quadruped Mammals - 72 No Classification Multivariate, Data-Generator Real Life 68363 1992-08-25
Post-Operative Patient 90 8 Yes Classification Multivariate Categorical, Integer Life 109124 1993-06-01
Pen-Based Recognition of Handwritten Digits 10992 16 No Classification Multivariate Integer Computer 232761 1998-07-01
Optical Recognition of Handwritten Digits 5620 64 No Classification Multivariate Integer Computer 285071 1998-07-01
Page Blocks Classification 5473 10 No Classification Multivariate Integer, Real Computer 99972 1995-07-01
Othello Domain Theory - - No N/A Domain-Theory N/A Game 28392 1991-02-01
Nursery 12960 8 No Classification Multivariate Categorical Social 204439 1997-06-01
Musk (Version 2) 6598 168 No Classification Multivariate Integer Physical 73938 1994-09-12
Musk (Version 1) 476 168 No Classification Multivariate Integer Physical 68456 1994-09-12
Mushroom 8124 22 Yes Classification Multivariate Categorical Life 569337 1987-04-27
Moral Reasoner 202 - N/A N/A Domain-Theory N/A Computer 40596 1994-06-01
Multiple Features 2000 649 No Classification Multivariate Integer, Real Computer 114363 N/A
Intelligent Media Accelerometer and Gyroscope (IM-AccGyro) Dataset 800 9 N/A Classification Time-Series Real Physical 714 2020-09-03
Russian Corpus of Biographical Texts 200 2 N/A Classification Text N/A N/A 1173 2020-06-03
Codon usage 13028 69 Yes Classification, Clustering Multivariate N/A Life 2263 2020-10-03
IIWA14-R820-Gazebo-Dataset-10Trajectories - - N/A Regression N/A Integer Computer 1858 2020-06-09
Guitar Chords finger positions 2633 5 N/A Classification Text N/A N/A 7656 2020-06-05
South German Credit (UPDATE) 1000 21 N/A Classification, Regression, Clustering Multivariate Integer, Real Business 8864 2020-06-20
Taiwanese Bankruptcy Prediction 6819 96 N/A Classification Multivariate Integer Business 12237 2020-06-28
HCV data 615 14 Yes Classification, Clustering Multivariate Integer, Real Life 12702 2020-06-10
CLINC150 23700 - N/A Classification Text N/A N/A 1564 2020-05-08
COVID-19 Surveillance 14 7 N/A Classification Multivariate N/A Computer 9034 2020-04-24
Refractive errors 467 79 Yes Classification Multivariate Integer Life 1392 2020-04-27
Bone marrow transplant: children 187 39 Yes Classification, Regression Multivariate Integer, Real Life 4035 2020-04-21
Unmanned Aerial Vehicle (UAV) Intrusion Detection 17256 55 N/A Classification Multivariate Real Computer 2347 2020-04-12
Iranian Churn Dataset 3150 13 N/A Classification, Regression Multivariate Integer Business 4285 2020-04-09
Shill Bidding Dataset 6321 13 N/A Classification, Clustering Multivariate N/A Computer 1847 2020-03-10
Seoul Bike Sharing Demand 8760 14 N/A Regression Multivariate Integer, Real Computer 5421 2020-03-01
Person Classification Gait Data 48 321 N/A Classification Multivariate Real Computer 1459 2020-03-02
Monolithic Columns in Troad and Mysia Region 11 19 N/A Classification Multivariate Real Computer 484 2020-02-08
Nasarian CAD Dataset 150 52 N/A Classification Multivariate N/A Life 763 2020-02-28
: Simulated Data set of Iraqi tourism places 232 16 N/A Classification, Clustering Multivariate N/A Computer 2095 2020-01-10
Apartment for rent classified 10000 22 N/A Classification, Regression, Clustering Multivariate N/A Business 10217 2019-12-26
CNNpred: CNN-based stock market prediction using a diverse set of variables 1985 84 Yes Classification, Regression Sequential, Time-Series Real Computer 2500 2019-12-26
clickstream data for online shopping 165474 14 N/A Classification, Regression, Clustering Multivariate, Sequential Integer, Real Business 7521 2019-12-09
Activity recognition using wearable physiological measurements 4480 533 N/A Classification Multivariate Real Life 2835 2019-12-04
UrbanGB, urban road accidents coordinates labelled by the urban center 360177 2 N/A Clustering Univariate Real Computer 970 2019-11-22
Gas Turbine CO and NOx Emission Data Set 36733 11 N/A Regression, Clustering Multivariate Real Computer 3019 2019-11-29
Horton General Hospital 139 6 N/A Causal-Discovery Multivariate, Time-Series Integer Life 1411 2019-11-13
Breath Metabolomics 104 1656 N/A Classification, Clustering Multivariate, Time-Series Real Life 2257 2019-11-08
Algerian Forest Fires Dataset 244 12 N/A Classification, Regression Multivariate Real Life 4613 2019-10-22
Vehicle routing and scheduling problems 18 9 N/A Clustering Multivariate Integer, Real Business 2132 2019-10-07
Rice (Cammeo and Osmancik) 3810 8 N/A Classification Multivariate Real Computer 2420 2019-10-06
Estimation of obesity levels based on eating habits and physical condition 2111 17 N/A Classification, Regression, Clustering Multivariate Integer Life 7672 2019-08-27
User Profiling and Abusive Language Detection Dataset 65919 3 N/A Classification N/A N/A Computer 1114 2019-04-25
Malware static and dynamic features VxHeaven and Virus Total 2955 1087 N/A Classification Multivariate Integer, Real Computer 908 2019-01-31
Internet Firewall Data 65532 12 N/A Classification Multivariate N/A Computer 2893 2019-02-04
3W dataset 1984 8 Yes Classification, Clustering Multivariate, Time-Series Integer, Real Computer 5158 2019-08-15
Sattriya_Dance_Single_Hand_Gestures Dataset 1450 - N/A Classification Multivariate N/A Computer 376 2019-07-22
Cervical Cancer Behavior Risk 72 19 N/A Classification, Clustering Multivariate, Univariate Integer Life 3112 2019-07-17
Pedestrian in Traffic Dataset 4760 14 Yes Classification, Regression, Causal-Discovery Multivariate, Sequential, Time-Series Real Computer 3417 2019-07-04
Youtube cookery channels viewers comments in Hinglish 9800 3 N/A Classification Multivariate, Text N/A Computer 1169 2019-07-03
Detect Malware Types 7107 280 N/A Classification Multivariate, Time-Series, Text N/A Computer 3132 2019-06-03
Demand Forecasting for a store 28764 8 N/A Regression Multivariate Integer N/A 4400 2019-05-14
Stock keeping units 2279 9 Yes Clustering Multivariate Integer, Real Business 1931 2019-04-10
Turkish Spam V01 826 2 N/A Classification Text N/A Social 847 2019-04-07
Early stage diabetes risk prediction dataset. 520 17 Yes Classification Multivariate N/A Computer 19528 2020-07-12
Amphibians 189 23 N/A Classification Multivariate Integer, Real Life 13140 2020-07-17
Facebook Large Page-Page Network 22470 4714 N/A Classification Multivariate N/A Social 18232 2020-07-22
BitcoinHeistRansomwareAddressDataset 2916697 10 N/A Classification, Clustering Multivariate, Time-Series Integer, Real Computer 12852 2020-06-17
Crop mapping using fused optical-radar data set 325834 175 N/A Classification Multivariate, Time-Series Real N/A 6352 2020-06-16
Swarm Behaviour 24017 2400 N/A Classification Multivariate Real Computer 10430 2020-06-16
Exasens 399 4 Yes Classification, Clustering Multivariate Integer Life 6526 2020-04-22
selfBACK 26136 6 N/A Classification, Clustering Time-Series Real Computer 6879 2020-06-15
South German Credit 1000 21 N/A Classification, Regression, Clustering Multivariate Integer, Real Business 5929 2019-11-29
Deepfakes: Medical Image Tamper Detection 20000 200000 N/A Classification Multivariate Real Computer 5039 2020-03-11
Heart failure clinical records 299 13 N/A Classification, Regression, Clustering Multivariate Integer, Real Life 22665 2020-02-05
Speaker Accent Recognition 329 12 N/A Classification Multivariate Real Social 6228 2020-03-04
Shoulder Implant X-Ray Manufacturer Classification 597 1 N/A Classification Multivariate Real Life 3848 2020-05-20
Kitsune Network Attack Dataset 27170754 115 N/A Classification, Clustering, Causal-Discovery Multivariate, Sequential, Time-Series Real Computer 118787 2019-10-16
Bar Crawl: Detecting Heavy Drinking 14057567 3 N/A Classification, Regression Multivariate, Time-Series Real Life 57274 2020-02-24
Bias correction of numerical prediction model temperature forecast 7750 25 Yes Regression Multivariate Real Physical 17350 2020-02-18
Real-time Election Results: Portugal 2019 21643 29 N/A Regression Multivariate, Time-Series, Text Integer, Real Social 14075 2019-12-05
A study of Asian Religious and Biblical Texts 590 8265 N/A Classification, Clustering Multivariate, Text Integer Social 20673 2019-12-24
QSAR fish bioconcentration factor (BCF) 1056 7 N/A Regression Multivariate Integer, Real Life 7137 2019-11-27
QSAR Bioconcentration classes dataset 779 14 N/A Classification, Regression Multivariate N/A Life 6998 2019-10-11
QSAR androgen receptor 1687 1024 N/A Classification Multivariate N/A Physical 3938 2019-10-01
QSAR oral toxicity 8992 1024 N/A Classification Multivariate N/A Physical 7481 2019-10-01
WISDM Smartphone and Smartwatch Activity and Biometrics Dataset 15630426 6 N/A Classification Multivariate, Time-Series Real Computer 59921 2019-10-06
QSAR aquatic toxicity 546 9 N/A Regression Multivariate Real Physical 16317 2019-09-23
Human Activity Recognition from Continuous Ambient Sensor Data 13956534 37 Yes Classification Multivariate, Sequential, Time-Series Integer, Real N/A 25420 2019-09-20
QSAR fish toxicity 908 7 N/A Regression Multivariate Real Physical 27696 2019-09-23
Hepatitis C Virus (HCV) for Egyptian patients 1385 29 N/A Classification Multivariate Integer, Real Life 50828 2019-09-30
Online Retail II 1067371 8 Yes Classification, Regression, Clustering Multivariate, Sequential, Time-Series, Text Integer, Real Business 84410 2019-09-21
Beijing Multi-Site Air-Quality Data 420768 18 Yes Regression Multivariate, Time-Series Integer, Real Physical 47858 2019-09-20
MEx 6262 710 N/A Classification, Clustering Time-Series Real Computer 13160 2019-09-20
Opinion Corpus for Lebanese Arabic Reviews (OCLAR) 3916 3916 N/A Classification Text Integer Computer 4590 2019-06-17
Incident management process enriched event log 141712 36 Yes Regression, Clustering Multivariate, Sequential Integer Business 24238 2019-07-14
Divorce Predictors data set 170 54 N/A Classification Multivariate, Univariate Integer Life 85191 2019-07-24
Alcohol QCM Sensor Dataset 125 8 N/A Classification, Regression, Clustering Multivariate Real Computer 49561 2019-07-22
PPG-DaLiA 8300000 11 N/A Regression Multivariate, Time-Series Real Computer 19986 2019-07-30
Wave Energy Converters 288000 49 Yes Regression Multivariate Real Computer 20186 2019-06-30
Query Analytics Workloads Dataset 260000 8 N/A Regression, Clustering Multivariate Real Computer 15328 2019-06-22
Metro Interstate Traffic Volume 48204 9 N/A Regression Multivariate, Sequential, Time-Series Integer, Real N/A 51720 2019-05-07
Parkinson Dataset with replicated acoustic features 240 46 N/A Classification Multivariate N/A Life 21557 2019-04-10
Facebook Live Sellers in Thailand 7051 12 N/A Clustering Multivariate Integer Business 38287 2019-04-22
Gas sensor array temperature modulation 4095000 20 N/A Classification, Regression Multivariate, Time-Series Real Computer 23045 2019-04-15
Rice Leaf Diseases 120 - N/A Classification Multivariate Integer Computer 44344 2019-04-14
Tarvel Review Ratings 5456 25 N/A Classification, Clustering Multivariate, Text Real N/A 53987 2018-12-19
Travel Reviews 980 11 N/A Classification, Clustering Multivariate, Text Real N/A 75520 2018-12-19
Behavior of the urban traffic of the city of Sao Paulo in Brazil 135 18 N/A Classification, Regression Multivariate, Time-Series Integer, Real Computer 49814 2018-12-12
Parking Birmingham 35717 4 Yes Classification, Regression, Clustering Multivariate, Univariate, Sequential, Time-Series Real Computer 52450 2019-01-02
EMG data for gestures 30000 6 N/A Classification Time-Series Real Life 51736 2019-01-07
2.4 GHZ Indoor Channel Measurements 7840 5 N/A Classification Multivariate Real Computer 63884 2018-11-30
Somerville Happiness Survey 143 7 N/A Classification N/A Integer Life 20918 2018-05-24
Real estate valuation data set 414 7 N/A Regression Multivariate Integer, Real Business 82553 2018-08-18
BuddyMove Data Set 249 7 N/A Classification, Clustering Multivariate, Text Real N/A 29594 2018-07-01
Audit Data 777 18 Yes Classification Multivariate Real N/A 43450 2018-07-14
BAUM-2 1047 - N/A Classification Time-Series N/A Computer 11114 2018-11-09
BAUM-1 1184 - N/A Classification Time-Series N/A Computer 16919 2018-11-09
Caesarian Section Classification Dataset 80 5 N/A Classification Univariate Integer Life 41469 2018-11-02
Electrical Grid Stability Simulated Data 10000 14 N/A Classification, Regression Multivariate Real Physical 49257 2018-11-16
Parkinson's Disease Classification 756 754 N/A Classification Multivariate Integer, Real Computer 64876 2018-11-05
PMU-UD 5180 9 N/A Classification Univariate N/A Computer 11307 2018-08-05
Online Shoppers Purchasing Intention Dataset 12330 18 N/A Classification, Clustering Multivariate Integer, Real Business 98570 2018-08-31
Student Academics Performance 300 22 N/A Classification Multivariate N/A Computer 68014 2018-09-16
GNFUV Unmanned Surface Vehicles Sensor Data Set 2 10190 6 Yes Regression Multivariate, Sequential, Time-Series Real Computer 10707 2018-09-13
WESAD (Wearable Stress and Affect Detection) 63000000 12 N/A Classification, Regression Multivariate, Time-Series Real Computer 42954 2018-09-14
Superconductivty Data 21263 81 N/A Regression Multivariate Real Physical 45713 2018-10-12
Physical Unclonable Functions 6000000 129 N/A Classification Multivariate Integer Computer 16537 2018-10-08
Drug Review Dataset (Drugs.com) 215063 6 N/A Classification, Regression, Clustering Multivariate, Text Integer Life 81565 2018-10-04
Drug Review Dataset (Druglib.com) 4143 8 N/A Classification, Regression, Clustering Multivariate, Text Integer N/A 54617 2018-10-02
PANDOR - - N/A Recommendation Multivariate Categorical Life 15228 2018-10-02
Avila 20867 10 N/A Classification Multivariate Real Computer 33990 2018-06-20
Roman Urdu Data Set 20000 2 N/A Classification Text N/A Computer 22119 2018-08-29
EEG Steady-State Visual Evoked Potential Signals 9200 16 N/A Classification, Regression Multivariate, Time-Series Integer Life 35863 2018-07-13
Multimodal Damage Identification for Humanitarian Computing 5879 - N/A Classification Multivariate, Text Integer Social 12896 2018-06-01
Simulated Falls and Daily Living Activities Data Set 3060 138 Yes Classification Time-Series Integer Life 82273 2018-06-06
Victorian Era Authorship Attribution 93600 1000 N/A Classification Text N/A Computer 17361 2018-05-31
Dishonest Internet users Dataset 322 5 N/A Classification, Clustering Multivariate N/A Computer 47847 2018-03-20
GNFUV Unmanned Surface Vehicles Sensor Data 1672 5 N/A Regression Multivariate, Time-Series Real Computer 22378 2018-05-06
Breast Cancer Coimbra 116 10 N/A Classification Multivariate Integer Life 94411 2018-03-06
Sports articles for objectivity analysis 1000 59 N/A Classification Multivariate, Text Integer Social 36399 2018-04-09
Optical Interconnection Network 640 10 N/A Classification, Regression Multivariate Integer, Real Computer 27086 2018-03-29
Carbon Nanotubes 10721 8 N/A Regression Univariate Real Computer 40226 2018-04-05
Condition monitoring of hydraulic systems 2205 43680 N/A Classification, Regression Multivariate, Time-Series Real Computer 55591 2018-04-26
SCADI 70 206 N/A Classification, Clustering Multivariate N/A Life 30661 2018-04-14
Absenteeism at work 740 21 N/A Classification, Clustering Multivariate, Time-Series Integer, Real Business 185596 2018-04-05
detection_of_IoT_botnet_attacks_N_BaIoT 7062606 115 N/A Classification, Clustering Multivariate, Sequential Real Computer 70507 2018-03-19
Repeat Consumption Matrices 130000 21000 N/A Clustering Multivariate Real Computer 21979 2018-03-22
SGEMM GPU kernel performance 241600 18 N/A Regression Multivariate Integer Computer 27219 2018-02-27
chipseq 4960 - N/A Classification Sequential Integer Life 21904 2018-02-21
Health News in Twitter 58000 25000 N/A Clustering Text Real Computer 51526 2018-02-19
Residential Building Data Set 372 105 N/A Regression Multivariate Real Computer 48971 2018-02-19
Container Crane Controller Data Set 15 3 N/A Classification, Regression Univariate, Domain-Theory Real Computer 32467 2018-01-01
BLE RSSI Dataset for Indoor localization and Navigation 6611 15 Yes Classification, Clustering Multivariate, Sequential, Time-Series Integer Computer 32095 2018-01-25
ICMLA 2014 Accepted Papers Data Set 105 5 N/A Classification, Clustering Multivariate N/A N/A 15998 2018-02-19
Ultrasonic flowmeter diagnostics 540 173 N/A Classification Multivariate Real Computer 17963 2018-01-13
News Popularity in Multiple Social Media Platforms 93239 11 N/A Regression Multivariate, Time-Series, Text Integer, Real Computer 60102 2018-02-20
Discrete Tone Image Dataset 71 11 N/A Classification Multivariate N/A Computer 20622 2018-01-20
OCT data & Color Fundus Images of Left & Right Eyes 50 2 N/A Classification Multivariate Real Computer 16636 2016-11-01
Cryotherapy Dataset 90 7 N/A Classification Univariate Integer, Real Life 45153 2018-01-04
Immunotherapy Dataset 90 8 N/A Classification Univariate Integer, Real Life 52550 2018-01-04
Activity recognition with healthy older people using a batteryless wearable sensor 75128 9 N/A Classification Sequential Real Life 36792 2016-12-12
Autism Screening Adult 704 21 Yes Classification N/A Integer Social 63291 2017-12-24
University of Tehran Question Dataset 2016 (UTQD.2016) 1175 3 Yes Classification Text N/A N/A 13196 2017-09-27
CSM (Conventional and Social Media Movies) Dataset 2014 and 2015 217 12 Yes Classification, Regression Multivariate Integer Computer 32306 2017-10-11
HCC Survival 165 49 Yes Classification Multivariate Integer, Real Life 35321 2017-11-29
Wireless Indoor Localization 2000 7 N/A Classification Multivariate Real Computer 47698 2017-12-04
APS Failure at Scania Trucks 60000 171 Yes Classification Multivariate Integer, Real Computer 59364 2017-12-08
Autistic Spectrum Disorder Screening Data for Adolescent 104 21 Yes Classification Multivariate Integer Life 31974 2017-12-24
Autistic Spectrum Disorder Screening Data for Children 292 21 Yes Classification Multivariate Integer Life 46435 2017-12-24
DeliciousMIL: A Data Set for Multi-Label Multi-Instance Learning with Instance Labels 12234 8519 N/A Classification Text Integer Computer 19110 2016-10-27
Character Font Images 745000 411 N/A Classification Multivariate Integer, Real Computer 35491 2016-08-14
Mturk User-Perceived Clusters over Images 180 500 N/A Clustering Multivariate, Text Integer Computer 9312 2016-11-02
DSRC Vehicle Communications 10000 5 N/A Clustering Sequential, Text Real Computer 31581 2017-12-13
IDA2016Challenge 76000 171 Yes Classification Multivariate Integer Computer 17849 2017-01-17
Dynamic Features of VirusShare Executables 107888 482 N/A Classification, Regression Multivariate, Time-Series Integer Computer 31474 2017-11-29
Z-Alizadeh Sani 303 56 N/A Classification N/A Integer, Real Life 23099 2017-11-17
extention of Z-Alizadeh sani dataset 303 59 N/A Classification N/A Integer, Real Life 12054 2017-11-17
Paper Reviews 405 10 Yes Classification, Regression Text Integer Computer 53974 2017-10-23
Daily Demand Forecasting Orders 60 13 N/A Regression Time-Series Integer Business 83216 2017-11-21
Gastrointestinal Lesions in Regular Colonoscopy 76 698 N/A Classification Multivariate Real Computer 18699 2016-10-15
TTC-3600: Benchmark dataset for Turkish text categorization 3600 4814 N/A Classification, Clustering Text Integer Computer 15376 2017-02-08
Anuran Calls (MFCCs) 7195 22 N/A Classification, Clustering Multivariate Real Life 50004 2017-02-24
Motion Capture Hand Postures 78095 38 Yes Classification, Clustering Multivariate Real Computer 31085 2017-01-27
Burst Header Packet (BHP) flooding attack on Optical Burst Switching (OBS) Network 1075 22 N/A Classification Text Integer Computer 31196 2017-08-28
Hybrid Indoor Positioning Dataset from WiFi RSSI, Bluetooth and magnetometer 1540 65 Yes Classification Multivariate, Sequential, Time-Series Real Computer 15627 2016-12-18
gene expression cancer RNA-Seq 801 20531 N/A Classification, Clustering Multivariate Real Life 63734 2016-06-09
Crowdsourced Mapping 10546 29 N/A Classification Multivariate N/A Physical 23804 2016-05-25
MEU-Mobile KSD 2856 71 N/A Classification Multivariate Integer, Real Computer 13364 2016-05-14
Eco-hotel 401 1 N/A N/A Text N/A Business 38875 2017-07-23
Las Vegas Strip 504 20 N/A Classification, Regression N/A Integer Business 97088 2017-07-23
Sales_Transactions_Dataset_Weekly 811 53 N/A Clustering Multivariate, Time-Series Integer, Real N/A 74026 2017-07-16
Parkinson Disease Spiral Drawings Using Digitized Graphics Tablet 77 7 N/A Classification, Regression, Clustering Multivariate Integer Computer 47281 2017-07-20
PM2.5 Data of Five Chinese Cities 52854 86 Yes Regression Multivariate, Time-Series Integer, Real Physical 86364 2017-07-18
Data for Software Engineering Teamwork Assessment in Education Setting 74 102 Yes Classification Sequential, Time-Series Integer, Real Computer 36002 2017-06-29
MoCap Hand Postures 78095 38 Yes Classification, Clustering Multivariate Integer, Real Computer 22619 2016-11-22
Stock portfolio performance 315 12 N/A Regression Multivariate Real Business 68843 2016-04-22
Devanagari Handwritten Character Dataset 92000 - N/A Classification N/A Integer Computer 29956 2016-09-01
Epileptic Seizure Recognition 11500 179 N/A Classification, Clustering Multivariate, Time-Series Integer, Real Life 118227 2017-05-24
Air Quality 9358 15 Yes Regression Multivariate, Time-Series Real Computer 466557 2016-03-23
FMA: A Dataset For Music Analysis 106574 518 N/A Classification, Clustering Multivariate, Time-Series Real Computer 88081 2017-05-24
Quality Assessment of Digital Colposcopies 287 69 N/A Classification Multivariate Real Life 20454 2017-03-08
KASANDR 17764280 2158859 N/A Causal-Discovery Multivariate Integer Life 35147 2017-05-16
Cervical cancer (Risk Factors) 858 36 Yes Classification Multivariate Integer, Real Life 128530 2017-03-03
Cargo 2000 Freight Tracking and Tracing 3942 98 Yes Classification, Regression Multivariate, Sequential Integer Business 45197 2016-11-03
Beijing PM2.5 Data 43824 13 Yes Regression Multivariate, Time-Series Integer, Real Physical 184768 2017-01-19
YouTube Spam Collection 1956 5 N/A Classification Text N/A Computer 85577 2017-03-26
Website Phishing 1353 10 N/A Classification Multivariate Integer Computer 87815 2016-11-02
DrivFace 606 6400 N/A Classification, Regression, Clustering Multivariate Real Computer 37604 2016-05-26
Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone 153540 25 N/A Classification, Regression, Clustering Multivariate, Sequential, Time-Series Integer, Real Computer 33191 2017-01-10
KDC-4007 dataset Collection 4007 - N/A Classification, Regression Multivariate, Text Integer Computer 31993 2017-04-27
Miskolc IIS Hybrid IPS 1540 67 Yes Classification, Clustering, Causal-Discovery Text Integer Computer 13907 2016-07-04
Appliances energy prediction 19735 29 N/A Regression Multivariate, Time-Series Real Computer 131725 2017-02-15
Drug consumption (quantified) 1885 32 N/A Classification Multivariate Real Social 135027 2016-10-17
HTRU2 17898 9 N/A Classification, Clustering Multivariate Real Physical 65913 2017-02-14
NIPS Conference Papers 1987-2015 11463 5812 N/A Clustering Text Integer Computer 51536 2016-11-23
UbiqLog (smartphone lifelogging) 9782222 - N/A Causal-Discovery Multivariate N/A Computer 31334 2016-06-16
Facebook metrics 500 19 N/A Regression Multivariate Integer Business 171458 2016-08-05
Dota2 Games Results 102944 116 N/A Classification Multivariate N/A Game 103791 2016-08-14
Activity Recognition system based on Multisensor data fusion (AReM) 42240 6 N/A Classification Multivariate, Sequential, Time-Series Real Computer 65279 2016-05-18
Polish companies bankruptcy data 10503 64 Yes Classification Multivariate Real Business 110783 2016-04-11
Smartphone Dataset for Human Activity Recognition (HAR) in Ambient Assisted Living (AAL) 5744 561 N/A Classification Time-Series Real Computer 50348 2016-03-09
Facebook Comment Volume Dataset 40949 54 N/A Regression Multivariate Integer, Real N/A 97884 2016-03-11
Twin gas sensor arrays 640 480000 N/A Classification, Regression Multivariate, Time-Series, Domain-Theory Real Computer 48854 2016-05-19
Gas sensors for home activity monitoring 919438 11 N/A Classification Multivariate, Time-Series Real Computer 84457 2016-07-15
Air Quality 9358 15 Yes Regression Multivariate, Time-Series Real Computer 466558 2016-03-23
Occupancy Detection 20560 7 N/A Classification Multivariate, Time-Series Real Computer 147019 2016-02-29
News Aggregator 422937 5 N/A Classification, Clustering Multivariate N/A N/A 91673 2016-02-28
Detect Malacious Executable(AntiVirus) 373 513 Yes Classification Multivariate Real Computer 258091 2016-03-03
GPS Trajectories 163 15 Yes Classification, Regression Multivariate Real Computer 111310 2016-02-29
Online Retail 541909 8 N/A Classification, Clustering Multivariate, Sequential, Time-Series Integer, Real Business 519933 2015-11-06
default of credit card clients 30000 24 N/A Classification Multivariate Integer, Real Business 513329 2016-01-26
SIFT10M 11164866 128 N/A Causal-Discovery Multivariate Integer Computer 32916 2016-02-23
Open University Learning Analytics dataset - - Yes Classification, Regression, Clustering Multivariate, Sequential, Time-Series Integer Computer 65001 2015-12-21
Indoor User Movement Prediction from RSS data 13197 4 N/A Classification Multivariate, Sequential, Time-Series Real Computer 64965 2016-02-04
HEPMASS 10500000 28 N/A Classification Multivariate Real Physical 66210 2016-01-28
Educational Process Mining (EPM): A Learning Analytics Data Set 230318 13 N/A Classification, Regression, Clustering Multivariate, Sequential, Time-Series Integer Computer 102740 2015-09-24
Heterogeneity Activity Recognition 43930257 16 Yes Classification, Clustering Multivariate, Time-Series Real Computer 105129 2015-10-26
UJIIndoorLoc-Mag 40000 13 N/A Classification, Regression, Clustering Multivariate, Sequential, Time-Series Integer, Real Computer 44049 2015-09-10
Mice Protein Expression 1080 82 Yes Classification, Clustering Multivariate Real Life 81867 2015-08-04
Smartphone-Based Recognition of Human Activities and Postural Transitions 10929 561 N/A Classification Multivariate, Time-Series Real Life 179423 2015-07-29
Cuff-Less Blood Pressure Estimation 12000 3 Yes Classification, Regression Multivariate Real Life 118356 2015-07-27
Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015 1710671 9 Yes Clustering, Causal-Discovery Multivariate, Sequential, Time-Series, Domain-Theory Real Computer 83897 2015-07-11
Folio 637 20 Yes Classification, Clustering Multivariate N/A N/A 60203 2015-07-05
Machine Learning based ZZAlpha Ltd. Stock Recommendations 2012-2014 314080 - Yes Classification Sequential, Time-Series Real Business 53191 2015-06-06
Chronic_Kidney_Disease 400 25 Yes Classification Multivariate Real N/A 178826 2015-07-03
Online Video Characteristics and Transcoding Time Dataset 168286 11 N/A Regression Multivariate Integer, Real Computer 44327 2015-05-19
wiki4HE 913 53 Yes Regression, Clustering, Causal-Discovery Multivariate N/A Social 49479 2015-05-04
Forest type mapping 326 27 N/A Classification Multivariate N/A Life 60786 2015-05-25
Sentiment Labelled Sentences 3000 - N/A Classification Text N/A N/A 181894 2015-05-30
HIV-1 protease cleavage 6590 1 N/A Classification Multivariate Categorical Life 62458 2015-04-25
Online News Popularity 39797 61 N/A Classification, Regression Multivariate Integer, Real Business 305574 2015-05-31
Diabetic Retinopathy Debrecen Data Set 1151 20 N/A Classification Multivariate Integer, Real Life 111299 2014-11-03
Greenhouse Gas Observing Network 2921 5232 N/A Regression Multivariate, Time-Series Real Physical 58991 2015-04-16
Phishing Websites 2456 30 N/A Classification N/A Integer Computer Security 152907 2015-03-26
TV News Channel Commercial Detection Dataset 129685 12 N/A Classification, Clustering Multivariate Real Computer 74691 2015-03-27
Dataset for Sensorless Drive Diagnosis 58509 49 N/A Classification Multivariate Real Computer 75041 2015-02-24
Firm-Teacher_Clave-Direction_Classification 10800 20 N/A Classification Multivariate N/A N/A 29751 2015-04-24
microblogPCU 221579 20 Yes Classification, Causal-Discovery Multivariate, Univariate, Sequential, Text Integer, Real Computer 51181 2015-03-17
Gas sensor array under dynamic gas mixtures 4178504 19 N/A Classification, Regression Multivariate, Time-Series Real Computer 54789 2015-03-20
ElectricityLoadDiagrams20112014 370 140256 N/A Regression, Clustering Time-Series Real Computer 79325 2015-03-13
Student Performance 649 33 N/A Classification, Regression Multivariate Integer Social 793241 2014-11-27
MHEALTH Dataset 120 23 N/A Classification Multivariate, Time-Series Real Computer 104544 2014-12-07
NoisyOffice 216 216 N/A Classification, Regression Multivariate Real Computer 68774 2015-01-03
Grammatical Facial Expressions 27965 100 N/A Classification, Clustering Multivariate, Sequential Real Computer 73596 2014-10-06
Condition Based Maintenance of Naval Propulsion Plants 11934 16 N/A Regression Multivariate Real Computer 61846 2014-09-11
AAAI 2013 Accepted Papers 150 5 N/A Clustering Multivariate N/A Computer 57852 2014-07-30
sEMG for Basic Hand movements 3000 2500 N/A Classification Time-Series Real Life 70242 2014-11-18
Geographical Original of Music 1059 68 N/A Classification, Regression Multivariate Real N/A 89050 2014-10-18
Dow Jones Index 750 16 N/A Classification, Clustering Time-Series Integer, Real Business 219481 2014-10-23
Sentence Classification - - N/A Classification Text Integer N/A 77492 2014-11-05
UJIIndoorLoc 21048 529 N/A Classification, Regression Multivariate Integer, Real Computer 100431 2014-09-18
Gas sensor array exposed to turbulent gas mixtures 180 150000 N/A Classification, Regression Multivariate, Time-Series Real Computer 42489 2014-10-10
Gas sensor array under flow modulation 58 120432 N/A Classification, Regression Multivariate, Time-Series Real Computer 39619 2014-09-10
AAAI 2014 Accepted Papers 399 6 N/A Clustering Multivariate N/A Computer 55351 2014-07-30
Newspaper and magazine images segmentation dataset 101 - N/A Classification N/A N/A Computer 28499 2014-07-15
REALDISP Activity Recognition Dataset 1419 120 N/A Classification Multivariate, Time-Series Real Computer 46208 2014-07-25
BlogFeedback 60021 281 N/A Regression Multivariate Integer, Real Social 99679 2014-05-29
Perfume Data 560 2 N/A Classification, Clustering Univariate, Domain-Theory Integer Computer 145249 2014-07-22
Gesture Phase Segmentation 9900 50 N/A Classification, Clustering Multivariate, Sequential, Time-Series Real N/A 53693 2014-06-18
Parkinson Speech Dataset with Multiple Types of Sound Recordings 1040 26 N/A Classification, Regression Multivariate Integer, Real Life 103263 2014-06-12
Tennis Major Tournament Match Statistics 127 42 Yes Classification, Regression, Clustering Multivariate Integer, Real N/A 103187 2014-06-01
StoneFlakes 79 8 Yes Classification, Clustering, Causal-Discovery Multivariate Real N/A 64207 2014-05-20
Bach Choral Harmony 5665 17 N/A Classification Sequential N/A N/A 50985 2014-05-20
Diabetes 130-US hospitals for years 1999-2008 100000 55 Yes Classification, Clustering Multivariate Integer Life 325745 2014-05-03
Urban Land Cover 168 148 N/A Classification Multivariate N/A Physical 48366 2014-03-27
Combined Cycle Power Plant 9568 4 N/A Regression Multivariate Real Computer 179240 2014-03-26
Twitter Data set for Arabic Sentiment Analysis 2000 2 N/A Classification Text N/A Social 93019 2014-04-11
Wholesale customers 440 8 N/A Classification, Clustering Multivariate Integer Business 359627 2014-03-31
Airfoil Self-Noise 1503 6 N/A Regression Multivariate Real Physical 137260 2014-03-04
Tamilnadu Electricity Board Hourly Readings 45781 5 N/A Classification, Regression, Clustering Multivariate Real Life 76095 2013-12-22
Dresses_Attribute_Sales 501 13 Yes Classification, Clustering Text N/A Computer 112363 2014-02-19
Leaf 340 16 N/A Classification Multivariate Real Computer 130033 2014-02-24
Activity Recognition from Single Chest-Mounted Accelerometer - - N/A Classification, Clustering Univariate, Sequential, Time-Series Real N/A 113260 2014-03-02
User Identification From Walking Activity - - N/A Classification, Clustering Univariate, Sequential, Time-Series Real N/A 75035 2014-03-02
Wilt 4889 6 N/A Classification Multivariate N/A Life 61441 2014-03-13
Dataset for ADL Recognition with Wrist-worn Accelerometer - 3 N/A Classification, Clustering Multivariate, Time-Series N/A Computer 81031 2014-02-11
LSVT Voice Rehabilitation 126 309 N/A Classification Multivariate Real Life 35577 2014-02-19
Qualitative_Bankruptcy 250 7 N/A Classification Multivariate N/A Computer 83378 2014-02-09
HIGGS 11000000 28 N/A Classification N/A Real Physical 155836 2014-02-12
SUSY 5000000 18 N/A Classification N/A Real Physical 72064 2014-02-12
EMG dataset in Lower Limb 132 5 N/A N/A Multivariate, Time-Series Real Computer 50748 2014-02-05
Predict keywords activities in a online social media 51 35 N/A N/A Multivariate, Sequential, Time-Series Integer, Real Computer 46201 2013-12-12
Thoracic Surgery Data 470 17 N/A Classification Multivariate Integer, Real Life 103643 2013-11-13
Bike Sharing Dataset 17389 16 N/A Regression Univariate Integer, Real Social 515091 2013-12-20
SML2010 4137 24 Yes Regression Multivariate, Sequential, Time-Series, Text Real Computer 129466 2014-01-09
Weight Lifting Exercises monitored with Inertial Measurement Units 39242 152 Yes Classification Multivariate Real Physical 46277 2013-11-24
SkillCraft1 Master Table Dataset 3395 20 Yes Regression Multivariate Integer, Real Game 65678 2013-10-22
Activities of Daily Living (ADLs) Recognition Using Binary Sensors 2747 - N/A Classification, Clustering Multivariate, Sequential, Time-Series N/A Computer 92672 2013-10-28
Gas Sensor Array Drift Dataset at Different Concentrations 13910 129 N/A Classification, Regression, Clustering, Causa Multivariate, Time-Series Real Computer 72041 2013-10-23
YouTube Multiview Video Games Dataset 120000 1000000 Yes Classification, Clustering Multivariate, Text Integer, Real Computer 132139 2013-10-16
banknote authentication 1372 5 N/A Classification Multivariate Real Computer 268914 2013-04-16
USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder Problem: Pat 306 5 N/A Classification Domain-Theory Integer N/A 34333 2013-10-13
seismic-bumps 2584 19 N/A Classification Multivariate Real N/A 68044 2013-04-03
Physicochemical Properties of Protein Tertiary Structure 45730 9 N/A Regression Multivariate Real Life 57099 2013-03-31
EEG Eye State 14980 15 N/A Classification Multivariate, Sequential, Time-Series Integer, Real Life 125922 2013-06-10
ser Knowledge Modeling Data (Students' Knowledge Levels on DC Electrical Machines) 403 5 N/A Classification Multivariate Real Computer 38786 2013-06-20
Turkiye Student Evaluation 5820 33 N/A Classification, Clustering Multivariate N/A N/A 103252 2013-09-01
NYSK 10421 7 N/A Clustering Multivariate, Sequential, Text N/A Social 54403 2013-10-11
Reuters RCV1 RCV2 Multilingual, Multiview Text Categorization Test collection 111740 - N/A Classification Multivariate Real Life 61664 2013-09-06
User Knowledge Modeling 403 5 N/A Classification, Clustering Multivariate Integer Computer 129846 2013-06-26
Daily and Sports Activities 9120 5625 N/A Classification, Clustering Multivariate, Time-Series Real Computer 194007 2013-07-08
BLOGGER 100 6 N/A Classification Multivariate N/A Computer 83945 2013-07-06
QSAR biodegradation 1055 41 N/A Classification Multivariate Integer, Real N/A 49033 2013-06-21
MicroMass 931 1300 N/A Classification Multivariate Real Life 48208 2013-08-12
Climate Model Simulation Crashes 540 18 N/A Classification Multivariate Real Physical 88362 2013-06-18
Gas sensor arrays in open sampling settings 18000 1950000 N/A Classification Multivariate, Time-Series Real Computer 47483 2013-06-05
Wearable Computing: Classification of Body Postures and Movements (PUC-Rio) 165632 18 N/A Classification Sequential Integer, Real Computer 60340 2013-04-09
First-order theorem proving 6118 51 N/A Classification Multivariate Real Computer 44103 2013-04-17
ISTANBUL STOCK EXCHANGE 536 8 N/A Classification, Regression Multivariate, Univariate, Time-Series Real Business 139705 2013-06-01
Buzz in social media 140000 77 N/A Regression, Classification Time-Series, Multivariate Integer, Real Computer 161986 2013-05-27
3D Road Network (North Jutland, Denmark) 434874 4 N/A Regression, Clustering Sequential, Text Real Computer 192288 2013-04-16
Daphnet Freezing of Gait 237 9 N/A Classification Multivariate, Time-Series Real Life 72923 2013-03-07
Fertility 100 10 N/A Classification, Regression Multivariate Real Life 207730 2013-01-17
Yacht Hydrodynamics 308 7 N/A Regression Multivariate Real Physical 101697 2013-01-03
Energy efficiency 768 8 N/A Classification, Regression Multivariate Integer, Real Computer 318830 2012-11-30
One-hundred plant species leaves data set 1600 64 N/A Classification N/A Real Life 74289 2012-12-03
Human Activity Recognition Using Smartphones 10299 561 N/A Classification, Clustering Multivariate, Time-Series N/A Computer 1047147 2012-12-10
QtyT40I10D100K 3960456 4 N/A N/A Sequential Integer N/A 41949 2012-10-21
Legal Case Reports - - N/A Classification Text N/A N/A 91674 2012-10-19
Northix 115 200 N/A Classification Multivariate, Univariate, Text Integer, Real Computer 38916 2012-08-15
seeds 210 7 N/A Classification, Clustering Multivariate Real Life 318928 2012-09-29
Individual household electric power consumption 2075259 9 Yes Regression, Clustering Multivariate, Time-Series Real Physical 383340 2012-08-30
CNAE-9 1080 857 N/A Classification Multivariate, Text Integer Business 66846 2012-08-03
Restaurant & consumer data 138 47 Yes N/A Multivariate N/A Computer 146620 2012-08-04
PAMAP2 Physical Activity Monitoring 3850505 52 Yes Classification Multivariate, Time-Series Real Computer 97187 2012-08-06
Planning Relax 182 13 N/A Classification Univariate Real Computer 60850 2012-07-17
Skin Segmentation 245057 4 N/A Classification Univariate Real Computer 208817 2012-07-17
SMS Spam Collection 5574 - N/A Classification, Clustering Multivariate, Text, Domain-Theory Real Computer 324998 2012-06-22
Nomao 34465 120 Yes Classification Univariate Real Computer 53233 2012-07-04
OPPORTUNITY Activity Recognition 2551 242 Yes Classification Multivariate, Time-Series Real Computer 101978 2012-06-09
ILPD (Indian Liver Patient Dataset) 583 10 N/A Classification Multivariate Integer, Real Life 137913 2012-05-21
Gas Sensor Array Drift Dataset 13910 128 N/A Classification Multivariate Real Computer 145776 2012-04-25
YouTube Comedy Slam Preference Data 1138562 3 N/A Classification Text N/A Computer 93943 2012-04-10
Bank Marketing 45211 17 N/A Classification Multivariate Real Business 1306273 2012-02-14
KEGG Metabolic Reaction Network (Undirected) 65554 29 Yes Classification, Regression, Clustering Multivariate, Univariate, Text Integer, Real Life 55033 2011-11-28
KEGG Metabolic Relation Network (Directed) 53414 24 N/A Classification, Regression, Clustering Multivariate, Univariate, Text Integer, Real Life 57774 2011-11-28
DBWorld e-mails 64 4702 N/A Classification Text N/A Computer 59822 2011-11-06
Farm Ads 4143 54877 N/A Classification Text N/A Business 76113 2011-10-18
Reuter_50_50 2500 10000 N/A Classification, Clustering Multivariate, Text, Domain-Theory Real Computer 66615 2011-09-08
Amazon Access Samples 30000 20000 N/A Regression, Clustering, Causal-Discovery Time-Series, Domain-Theory N/A Business 210246 2011-09-13
Amazon Commerce reviews set 1500 10000 N/A Classification Multivariate, Text, Domain-Theory Real Physical 204156 2011-06-11
Vicon Physical Action Data Set 3000 27 N/A Classification Time-Series Real Physical 53742 2011-07-27
EMG Physical Action Data Set 10000 8 N/A Classification Time-Series Real Physical 87751 2011-07-27
Vertebral Column 310 6 N/A Classification Multivariate Real N/A 174098 2011-08-09
Record Linkage Comparison Patterns 5749132 12 Yes Classification Multivariate Real N/A 81784 2011-03-10
PubChem Bioassay Data - - N/A Classification Multivariate Integer, Real Life 39689 2011-03-29
Communities and Crime Unnormalized 2215 147 Yes Regression Multivariate Real Social 149002 2011-03-02
Online Handwritten Assamese Characters Dataset 8235 - N/A Classification Multivariate, Sequential Integer Computer 48232 2011-04-01
Relative location of CT slices on axial axis 53500 386 N/A Regression Domain-Theory Real Computer 58080 2011-07-07
OpinRank Review Dataset - - N/A N/A Text N/A Computer 52042 2011-07-26
PEMS-SF 440 138672 N/A Classification Multivariate, Time-Series Real Computer 71850 2011-05-22
YearPredictionMSD 515345 90 N/A Regression Multivariate Real N/A 182134 2011-02-07
MiniBooNE particle identification 130065 50 N/A Classification Multivariate Real Physical 53333 2010-12-13
Steel Plates Faults 1941 27 N/A Classification Multivariate Integer, Real Physical 84519 2010-10-26
AutoUniv - - N/A Classification Multivariate Categorical, Integer, Real N/A 60182 2010-11-03
Localization Data for Person Activity 164860 8 N/A Classification Univariate, Sequential, Time-Series Real Life 108052 2010-11-03
Spoken Arabic Digit 8800 13 No Classification Multivariate, Time-Series Real N/A 79347 2010-09-13
Wall-Following Robot Navigation Data 5456 24 N/A Classification Multivariate, Sequential Real Computer 63966 2010-08-04
Cardiotocography 2126 23 N/A Classification Multivariate Real Life 174228 2010-09-07
Breast Tissue 106 10 N/A Classification Multivariate Real Life 128419 2010-05-10
Opinosis Opinion ⁄ Review 51 - N/A N/A Text N/A Computer 50097 2010-07-06
Demospongiae 503 - Yes Classification Multivariate Integer Life 56461 2010-01-21
Parkinsons Telemonitoring 5875 26 N/A Regression Multivariate Integer, Real Life 154537 2009-10-29
p53 Mutants 16772 5409 Yes Classification Multivariate Real Life 80826 2010-02-09
URL Reputation 2396130 3231961 N/A Classification Multivariate, Time-Series Integer, Real Computer 140979 2009-10-15
Wine Quality 4898 12 N/A Classification, Regression Multivariate Real Business 1337127 2009-10-07
Acute Inflammations 120 6 No Classification Multivariate Categorical, Integer Life 180535 2009-02-11
Communities and Crime 1994 128 Yes Regression Multivariate Real Social 295628 2009-07-13
Concrete Slump Test 103 10 N/A Regression Multivariate Real Computer 102557 2009-04-30
Libras Movement 360 91 N/A Classification, Clustering Multivariate, Sequential Real N/A 86412 2009-08-17
Plants 22632 70 Yes Clustering Multivariate Categorical Life 196765 2008-12-31
SECOM 1567 591 Yes Classification, Causal-Discovery Multivariate Real Computer 126129 2008-11-19
Semeion Handwritten Digit 1593 256 N/A Classification Multivariate Integer Computer 129831 2008-11-11
UJI Pen Characters (Version 2) 11640 - N/A Classification Multivariate, Sequential Integer Computer 65902 2009-01-22
Blood Transfusion Service Center 748 5 N/A Classification Multivariate Real Business 338152 2008-10-03
Character Trajectories 2858 3 N/A Classification, Clustering Time-Series Real Computer 150142 2008-08-20
Parkinsons 197 23 N/A Classification Multivariate Real Life 275722 2008-06-26
Abscisic Acid Signaling Network 300 43 N/A Causal-Discovery Multivariate Integer Life 57315 2008-04-03
Ozone Level Detection 2536 73 Yes Classification Multivariate, Sequential, Time-Series Real Physical 142896 2008-04-21
Madelon 4400 500 N/A Classification Multivariate Real N/A 137996 2008-02-29
Gisette 13500 5000 N/A Classification Multivariate Integer Computer 120075 2008-02-29
Dorothea 1950 100000 N/A Classification Multivariate Integer Life 111432 2008-02-29
Arcene 900 10000 N/A Classification Multivariate Real Life 136091 2008-02-29
Dexter 2600 20000 N/A Classification Multivariate Integer N/A 105433 2008-02-29
Concrete Compressive Strength 1030 9 N/A Regression Multivariate Real Physical 216030 2007-08-03
Hill-Valley 606 101 N/A Classification Sequential Real N/A 72118 2008-03-20
Bag of Words 8000000 100000 N/A Clustering Text Integer N/A 316536 2008-03-12
Reuters Transcribed Subset 200 - N/A Classification Text N/A Business 57645 2008-03-08
Forest Fires 517 13 N/A Regression Multivariate Real Physical 932642 2008-02-29
Mammographic Mass 961 6 Yes Classification Multivariate Integer Life 177568 2007-10-29
UJI Pen Characters 1364 - No Classification Multivariate, Sequential Integer Computer 98566 2007-06-01
MAGIC Gamma Telescope 19020 11 No Classification Multivariate Real Physical 118222 2007-05-01
Poker Hand 1025010 11 No Classification Multivariate Categorical, Integer Game 618890 2007-01-01
Dodgers Loop Sensor 50400 3 Yes N/A Multivariate, Time-Series Categorical, Integer N/A 75700 2006-12-01
CalIt2 Building People Counts 10080 4 No N/A Multivariate, Time-Series Categorical, Integer N/A 55066 2006-12-01
Cloud 1024 10 N/A N/A Multivariate Real Physical 134426 1989-08-03
Protein Data - - N/A N/A N/A N/A Life 73120 N/A
Economic Sanctions - - N/A N/A Domain-Theory N/A Financial 60603 N/A
Connectionist Bench (Vowel Recognition - Deterding Data) 528 10 N/A Classification N/A Real N/A 75569 N/A
Connectionist Bench (Sonar, Mines vs. Rocks) 208 60 N/A Classification Multivariate Real Physical 189300 N/A
Connectionist Bench (Nettalk Corpus) 20008 4 N/A N/A Multivariate Categorical N/A 47560 N/A
Statlog (Shuttle) 58000 9 N/A Classification Multivariate Integer Physical 142458 N/A
Statlog (Vehicle Silhouettes) 946 18 N/A Classification Multivariate Integer N/A 131301 N/A
Statlog (Image Segmentation) 2310 19 No Classification Multivariate Real N/A 62467 1990-11-01
Statlog (Landsat Satellite) 6435 36 N/A Classification Multivariate Integer Physical 136504 1993-02-13
Statlog (Heart) 270 13 No Classification Multivariate Categorical, Real Life 229057 N/A
Statlog (German Credit Data) 1000 20 N/A Classification Multivariate Categorical, Integer Financial 650974 1994-11-17
Statlog (Australian Credit Approval) 690 14 Yes Classification Multivariate Categorical, Integer, Real Financial 173126 N/A
Volcanoes on Venus - JARtool experiment - - Yes Classification Image N/A Physical 49566 N/A
UNIX User Data - - N/A N/A Text, Sequential N/A Computer 56246 N/A
Syskill and Webert Web Page Ratings 332 5 N/A Classification Multivariate, Text Categorical Computer 68818 1998-10-20
Synthetic Control Chart Time Series 600 - No Classification, Clustering Time-Series Real N/A 81402 1999-06-08
Robot Execution Failures 463 90 N/A Classification Multivariate, Time-Series Integer Physical 104428 1999-04-23
Reuters-21578 Text Categorization Collection 21578 5 N/A Classification Text Categorical N/A 194864 1997-09-26
Pseudo Periodic Synthetic Time Series 100000 - N/A N/A Univariate, Time-Series N/A N/A 46544 1999-02-08
Pioneer-1 Mobile Robot Data - - N/A N/A Multivariate, Time-Series Categorical, Real Computer 39911 1999-01-28
NSF Research Award Abstracts 1990-2003 129000 - N/A N/A Text N/A N/A 48436 2003-11-18
MSNBC.com Anonymous Web Data 989818 - N/A N/A Sequential Categorical Computer 82875 N/A
Movie 10000 - Yes N/A Multivariate, Relational N/A N/A 215311 1999-07-07
M. Tuberculosis Genes - - N/A N/A Relational N/A Life 38653 2001-07-14
KDD Cup 1999 Data 4000000 42 N/A Classification Multivariate Categorical, Integer Computer 159398 1999-01-01
KDD Cup 1998 Data 191779 481 Yes Regression Multivariate Categorical, Integer N/A 79860 1998-07-20
Japanese Vowels 640 12 N/A Classification Multivariate, Time-Series Real N/A 112421 N/A
IPUMS Census Database 256932 61 N/A N/A Multivariate Categorical, Integer Social 42159 1999-11-09
Internet Usage Data 10104 72 No N/A Multivariate Categorical, Integer Computer 118800 1999-06-30
Insurance Company Benchmark (COIL 2000) 9000 86 No Regression, Description Multivariate Categorical, Integer Social 159433 2000-07-03
CMU Face Images 640 - Yes Classification Image Integer N/A 151100 1999-06-24
Entree Chicago Recommendation Data 50672 - Yes Recommender-Systems Transactional, Sequential Categorical N/A 95694 2000-03-09
El Nino 178080 12 Yes N/A Spatio-temporal Integer, Real Physical 101652 1999-06-30
EEG Database 122 4 Yes N/A Multivariate, Time-Series Categorical, Integer, Real Life 218161 1999-10-13
E. Coli Genes - - Yes N/A Relational N/A Life 48969 2001-07-14
Corel Image Features 68040 89 N/A N/A Multivariate Real N/A 88416 1999-07-01
Coil 1999 Competition Data 340 17 No N/A Multivariate Categorical, Real Physical 51611 1999-09-09
Census-Income (KDD) 299285 40 Yes Classification Multivariate Categorical, Integer Social 179537 2000-03-07
US Census Data (1990) 2458285 68 N/A Clustering Multivariate Categorical Social 160265 N/A
Australian Sign Language signs (High Quality) 2565 22 N/A Classification Multivariate, Time-Series Real N/A 114097 2002-02-26
Australian Sign Language signs 6650 15 N/A Classification Multivariate, Time-Series Categorical, Real N/A 102339 1999-04-20
Twenty Newsgroups 20000 - No N/A Text N/A N/A 117592 1999-09-09
Undocumented - - N/A N/A N/A N/A N/A 28382 N/A
Zoo 101 17 No Classification Multivariate Categorical, Integer Life 330659 1990-05-15
Yeast 1484 8 No Classification Multivariate Real Life 313082 1996-09-01
Wine 178 13 No Classification Multivariate Integer, Real Physical 1514474 1991-07-01
Waveform Database Generator (Version 2) 5000 40 No Classification Multivariate, Data-Generator Real Physical 61728 1988-11-10
Waveform Database Generator (Version 1) 5000 21 No Classification Multivariate, Data-Generator Real Physical 76393 1988-11-10
Water Treatment Plant 527 38 N/A Clustering Multivariate Integer, Real Physical 149382 1993-06-01
University 285 17 Yes Classification Multivariate Categorical, Integer N/A 226977 1988-07-01
Congressional Voting Records 435 16 Yes Classification Multivariate Categorical Social 228842 1987-04-27
Trains 10 32 N/A Classification Multivariate Categorical N/A 158990 1994-06-24
Thyroid Disease 7200 21 N/A Classification Multivariate, Domain-Theory Categorical, Real Life 247082 1987-01-01
Tic-Tac-Toe Endgame 958 9 No Classification Multivariate Categorical Game 248336 1991-08-19
Teaching Assistant Evaluation 151 5 No Classification Multivariate Categorical, Integer N/A 165636 1997-06-07
Student Loan Relational 1000 - N/A N/A Domain-Theory N/A Social 69606 1993-01-01
Sponge 76 45 Yes Clustering Multivariate Categorical, Integer Life 95883 N/A
Statlog Project - - N/A N/A N/A N/A N/A 48365 1992-10-01
SPECTF Heart 267 44 No Classification Multivariate Integer Life 96134 2001-10-01
SPECT Heart 267 22 No Classification Multivariate Categorical Life 207924 2001-10-01
Spambase 4601 57 Yes Classification Multivariate Integer, Real Computer 535647 1999-07-01
Low Resolution Spectrometer 531 102 N/A Classification Multivariate Integer, Real Physical 54111 1988-03-01
Challenger USA Space Shuttle O-Ring 23 4 No Regression Multivariate Integer Physical 144163 1993-08-05
Soybean (Small) 47 35 No Classification Multivariate Categorical Life 104703 1987-01-01
Solar Flare 1389 10 No Regression Multivariate Categorical Physical 153630 1989-03-01
Soybean (Large) 307 35 Yes Classification Multivariate Categorical Life 130638 1988-07-11
Shuttle Landing Control 15 6 No Classification Multivariate Categorical Physical 106443 1988-11-01
Servo 167 4 No Regression Multivariate Categorical, Integer Computer 109313 1993-05-01
MONK's Problems 432 7 No Classification Multivariate Categorical N/A 213641 1992-10-01
Molecular Biology (Splice-junction Gene Sequences) 3190 61 No Classification Sequential, Domain-Theory Categorical Life 103645 1992-01-01
Molecular Biology (Protein Secondary Structure) 128 - N/A Classification Sequential Categorical Life 57140 N/A
Molecular Biology (Promoter Gene Sequences) 106 58 No Classification Sequential, Domain-Theory Categorical Life 79828 1990-06-30
Mobile Robots - - N/A N/A Domain-Theory Categorical, Integer, Real Computer 75742 1995-07-15
Meta-data 528 22 Yes Classification Multivariate Categorical, Integer, Real N/A 76314 1996-03-01
Mechanical Analysis 209 8 No Classification Multivariate Categorical, Integer, Real Computer 102007 1990-06-01
Lung Cancer 32 56 Yes Classification Multivariate Integer Life 312817 1992-05-01
Lymphography 148 18 No Classification Multivariate Categorical Life 92309 1988-11-01
Logic Theorist - - N/A N/A Domain-Theory N/A Computer 34702 N/A
Liver Disorders 345 7 No N/A Multivariate Categorical, Integer, Real Life 183890 1990-05-15
Letter Recognition 20000 16 No Classification Multivariate Integer Computer 400773 1991-01-01
Lenses 24 4 No Classification Multivariate Categorical N/A 178391 1990-08-01
LED Display Domain - 7 No Classification Multivariate, Data-Generator Categorical Computer 70925 1988-11-10
Labor Relations 57 16 No N/A Multivariate Categorical, Integer, Real Social 80641 1988-11-01
Kinship 104 12 No Relational-Learning Relational Categorical Social 78046 1990-07-01
ISOLET 7797 617 No Classification Multivariate Real Computer 126195 1994-09-12
Iris 150 4 No Classification Multivariate Real Life 3616637 1988-07-01
Ionosphere 351 34 No Classification Multivariate Integer, Real Physical 238726 1989-01-01
Internet Advertisements 3279 1558 Yes Classification Multivariate Categorical, Integer, Real Computer 361856 1998-07-01
Image Segmentation 2310 19 No Classification Multivariate Real N/A 203626 1990-11-01
ICU - - No N/A Multivariate, Time-Series Real Life 96581 N/A
Horse Colic 368 27 Yes Classification Multivariate Categorical, Integer, Real Life 153196 1989-08-06
Hepatitis 155 19 Yes Classification Multivariate Categorical, Integer, Real Life 265614 1988-11-01
Hayes-Roth 160 5 No Classification Multivariate Categorical Social 103803 1989-03-01
Heart Disease 303 75 Yes Classification Multivariate Categorical, Integer, Real Life 1340373 1988-07-01
Haberman's Survival 306 3 No Classification Multivariate Integer Life 217896 1999-03-04
Function Finding 352 - No Function-Learning N/A Real Physical 55046 1990-09-01
Flags 194 30 No Classification Multivariate Categorical, Integer N/A 311383 1990-05-15
Glass Identification 214 10 No Classification Multivariate Real Physical 359366 1987-09-01
Ecoli 336 8 No Classification Multivariate Real Life 239214 1996-09-01
Echocardiogram 132 12 Yes Classification Multivariate Categorical, Integer, Real Life 193496 1989-02-28
EBL Domain Theories - - N/A N/A N/A N/A Computer 24667 N/A
Document Understanding - - No N/A N/A N/A N/A 45318 1994-11-01
DGP2 - The Second Data Generation Program - - N/A N/A Data-Generator Real N/A 32305 N/A
Diabetes - 20 N/A N/A Multivariate, Time-Series Categorical, Integer Life 514018 N/A
Dermatology 366 33 Yes Classification Multivariate Categorical, Integer Life 229400 1998-01-01
Cylinder Bands 512 39 Yes Classification Multivariate Categorical, Integer, Real Physical 79503 1995-08-01
Contraceptive Method Choice 1473 9 No Classification Multivariate Categorical, Integer Life 199832 1997-07-07
Computer Hardware 209 9 No Regression Multivariate Integer Computer 318014 1987-10-01
Japanese Credit Screening 125 - N/A Classification Multivariate, Domain-Theory Categorical, Real, Integer Financial 110577 1992-03-19
Covertype 581012 54 No Classification Multivariate Categorical, Integer Life 295901 1998-08-01
Connect-4 67557 42 No Classification Multivariate, Spatial Categorical Game 165667 1995-02-04
Bach Chorales 100 6 No N/A Univariate, Time-Series Categorical, Integer N/A 134870 N/A
Chess (Domain Theories) - - N/A N/A Domain-Theory N/A Game 53389 N/A
Credit Approval 690 15 Yes Classification Multivariate Categorical, Integer, Real Financial 442888 N/A
Chess (King-Rook vs. King) 28056 6 No Classification Multivariate Categorical, Integer Game 130396 1994-06-01
Chess (King-Rook vs. King-Pawn) 3196 36 No Classification Multivariate Categorical Game 119362 1989-08-01
Chess (King-Rook vs. King-Knight) - 22 No Classification Multivariate, Data-Generator Categorical, Integer Game 73923 1988-10-03
Census Income 48842 14 Yes Classification Multivariate Categorical, Integer Social 479344 1996-05-01
Car Evaluation 1728 6 No Classification Multivariate Categorical N/A 1258987 1997-06-01
Pittsburgh Bridges 108 13 Yes Classification Multivariate Categorical, Integer N/A 105100 1990-08-01
Breast Cancer Wisconsin (Diagnostic) 569 32 No Classification Multivariate Real Life 1353723 1995-11-01
Breast Cancer Wisconsin (Prognostic) 198 34 Yes Classification, Regression Multivariate Real Life 221242 1995-12-01
Breast Cancer Wisconsin (Original) 699 10 Yes Classification Multivariate Integer Life 638410 1992-07-15
Balloons 16 4 No Classification Multivariate Categorical Social 297339 N/A
Breast Cancer 286 9 Yes Classification Multivariate Categorical Life 504383 1988-07-11
Balance Scale 625 4 No Classification Multivariate Categorical Social 262693 1994-04-22
Badges 294 1 No Classification Univariate, Text N/A N/A 119453 1994-09-01
Automobile 205 26 Yes Regression Multivariate Categorical, Integer, Real N/A 553050 1987-05-19
Auto MPG 398 8 Yes Regression Multivariate Categorical, Real N/A 589961 1993-07-07
Audiology (Original) 226 - Yes Classification Multivariate Categorical Life 113262 1987-12-03
Audiology (Standardized) 226 69 Yes Classification Multivariate Categorical Life 107932 1992-08-18
Artificial Characters 6000 7 No Classification Multivariate Categorical, Integer, Real Computer 251917 1992-07-01
Arrhythmia 452 279 Yes Classification Multivariate Categorical, Integer, Real Life 327090 1998-01-01
Anonymous Microsoft Web Data 37711 294 N/A Recommender-Systems N/A Categorical Computer 184639 1998-11-01
Annealing 798 38 Yes Classification Multivariate Categorical, Integer, Real Physical 170320 N/A
Adult 48842 14 Yes Classification Multivariate Categorical, Integer Social 1963523 1996-05-01