In my last article, I discussed the steps to download NASA data from GES DISC. The data files downloaded are in the HDF5 format. HDF5 is a file format, a technology, that enables the management of very large data collections. Thus, it is quite popular for storing information. For getting NASA’s data, please check the below article first:

Getting NASA data for your next geo-project

Accessing GES DISC data files

towardsdatascience.com

Whenever I work with datasets, I’m most comfortable with CSV files. Thus, once I got the HDF5 files, I decided to look for ways to change them to CSV files. I found the package h5py in Python, which enables the reading in of HDF5 files. Thus, this article articulates the steps to use h5py and convert HDF5 to CSV. You can follow along by referring to the complete notebook at the link below.

kb22/NASA-data-exploration

Permalink Dismiss GitHub is home to over 50 million developers working together to host and review code, manage…

github.com

Import libraries

For this work, we’ll require two libraries. The first library is h5py which has the option to read and work with HDF5 files (documentation). The second package we need is numpy to work with arrays. Finally, we will import pandas so we can create a dataframe and later save it as a CSV file.

#data #data-science #editors-pick #machine-learning #technology #data analysis

Working with HDF5 files and creating CSV files
4.30 GEEK