Here is the list of free data sets for machine learning and deep learning publicly available:
- Machine learning problems datasets
- UC Irvine Machine Learning Repository: A repository of 560 datasets suitable for traditional machine learning algorithm problems such as classification and regression
- Public available dataset through public APIs: A list of 650+ datasets available via public API
- Penn machine learning dataset: The data sets cover a broad range of applications, and include binary/multi-class classification problems and regression problems, as well as combinations of categorical, ordinal, and continuous features. The good part if that the datasets is available in **tabular form **that makes it very useful for training models with traditional machine learning algorithms
- Datasets linked to the papers: A set of 3000+ datasets linked with the white papers; My favorite one from https://www.paperswithcode.com
- OpenML.org dataset: Mostly tabular datasets (3200+) suitable for traditional machine learning algorithms
- Computer vision problems datasets
- Visual data: A collection of 526 datasets for solving computer vision problems
- Roboflow computer vision datasets: A list of computer vision datasets in many popular formats (including CreateML JSON, COCO JSON, Pascal VOC XML, YOLO v3, and Tensorflow TFRecords)
#machine learning #ai #free #deep learning #datasets #training data