Simple data curation practices using Numpy (Udacity Deep Learning Assignment 1)
First, download the compressed dataset "notMNIST_large.tar.gz" and "notMNIST_small.tar.gz" from http://yaroslavvb.com/upload/notMNIST/ OR download from others site using full code provided below. url = 'https://commondatastorage.googleapis.com/books1000/' last_percent_reported = None data_root = os.getcwd() #path to save file def download_progress_hook ( count , blockSize , totalSize ): """A hook to report the progress of a download. This is mostly intended for users with slow internet connections. Reports every 5% change in download progress. """ global last_percent_reported percent = int ( count * blockSize * 100 / totalSize ) if last_percent_reported != percent : if percent % 5 == 0 : sys . stdout . write ( " %s%% " % percent ) sys . stdout . flush () else : sys . stdout . write ( "." ) sys . stdout . flush () ...