Building a Database from Federal School Test Data

Building a Database from Federal School Test Data

In this tutorial, we'll learn Building a Database from Federal School Test Data. Organizing National Center for Education mclStatistics’ (NCES) data in a relational database.


As parents of two young children, my wife and I have researched the quality of schools in several US cities. This research has led to a few popular websites which present a variety of metrics on the quality of individual schools. Two of the most popular websites that present information on school quality are and As a data professional, I wondered where these sites obtain their data. Both of these sites use data provided by the US Department of Education’s National Center for Education Statistics (NCES) ( also uses quite a bit of data from state education departments, which is more detailed than the federal data).

In this series of articles, I will share my process to download and restructure a number of these datasets. First, I will use Python to restructure the data to load it into a Mysql database, then I will do some ad hoc research projects using some of Python’s popular data science libraries.

The main goal of this first article is to cover how I successfully used Pandas to load 10 years of school-level test score results obtained from NCES into a MySQL database. By loading this data into a relational database format, I aim to make a wide array of analysis tasks more efficient as I take a deeper dive into the data in future articles. Hopefully, this will be helpful to those who are interested in exploring Pandas’ SQL capabilities.

sql python-pandas building a database from federal school test data database federal school test data

What is Geek Coin

What is GeekCash, Geek Token

Best Visual Studio Code Themes of 2021

Bootstrap 5 Tutorial - Bootstrap 5 Crash Course for Beginners

Nest.JS Tutorial for Beginners

Hello Vue 3: A First Look at Vue 3 and the Composition API

Introduction to Structured Query Language SQL pdf

SQL stands for Structured Query Language. SQL is a scripting language expected to store, control, and inquiry information put away in social databases. The main manifestation of SQL showed up in 1974, when a gathering in IBM built up the principal model of a social database. The primary business social database was discharged by Relational Software later turning out to be Oracle.

Data Manipulation: SQL vs. Pandas

Data Manipulation: SQL vs. Pandas. Which tool to use in your next data science project.

top 30 Python Tips and Tricks for Beginners

In this post, we'll learn top 30 Python Tips and Tricks for Beginners

Basic Data Types in Python | Python Web Development For Beginners

In the programming world, Data types play an important role. Each Variable is stored in different data types and responsible for various functions. Python had two different objects, and They are mutable and immutable objects.

Data Quality Testing Skills Needed For Data Integration Projects

Data Quality Testing Skills Needed For Data Integration Projects. Data integration projects fail for many reasons. Risks can be mitigated when well-trained testers deliver support. Here are some recommended testing skills.