Where and how to get the data
- Existing datasets at Kaggle
- Curate your own data
- Scientific data
- Web crawling & using APIs (to get social media data etc)
- Using wget or panda (read from URL) to get webpages or remote files;
- Spotify API (All requests to spotify Web API require authentication) (spotipy; Spotify Million Playlist Dataset Challenge)
- Google API (e.g., to get youtube playlist)
google API explorer
- Google trend API (pytrends -- unofficial)