Appendix B: Datasets

Google Speech Commands Dataset
- Description: A set of one-second .wav audio files, each containing a single spoken English word.
- Link to the Dataset
VisualWakeWords Dataset
- Description: A dataset tailored for TinyML vision applications, consisting of binary labeled images indicating whether a person is in the image or not.
- Link to the Dataset
EMNIST Dataset
- Description: A dataset containing 28x28 pixel images of handwritten characters and digits, which is an extension of the MNIST dataset but includes letters.
- Link to the Dataset
UCI Machine Learning Repository: Human Activity Recognition Using Smartphones
- Description: A dataset with the recordings of 30 study participants performing activities of daily living (ADL) while carrying a waist-mounted smartphone with embedded inertial sensors.
- Link to the Dataset
PlantVillage Dataset
- Description: A dataset comprising of images of healthy and diseased crop leaves categorized based on the crop type and disease type, which could be used in a TinyML agricultural project.
- Link to the Dataset
Gesture Recognition using 3D Motion Sensing (3D Gesture Database)
- Description: This dataset contains 3D gesture data recorded using a Leap Motion Controller, which might be useful for gesture recognition projects.
- Link to the Dataset
Multilingual Spoken Words Corpus
- Description: A dataset containing recordings of common spoken words in various languages, useful for speech recognition projects targeting multiple languages.
- Link to the Dataset
Wake Vision
- Description: A dataset containing over 6 million images for binary person classification. In addition, it includes a fine-grain benchmark suite for evaluating the fairness and robustness of models.
- Link to the Dataset

Remember to verify the dataset’s license or terms of use to ensure it can be used for your intended purpose.