Skip to content

In this challenge, as the Chief Data Scientist for your city's school district, you'll use Python's Pandas library to analyze standardized test results. The goal is to identify trends in math and reading scores, helping the school board make data-driven decisions regarding budgets and priorities.

Notifications You must be signed in to change notification settings

SakinaJaffri/School_District_Analysis_with_Pandas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

School District Analysis with Pandas

Project Overview

In this challenge, you take on the role of Chief Data Scientist for your city's school district. Using Python's Pandas library, you will analyze district-wide standardized test results to identify trends in school performance. The data includes math and reading scores, school information, and other relevant factors. This analysis will assist the school board in making strategic decisions about budgets and priorities.

Education

Key Tasks and Calculations

District Summary

  • Total number of unique schools.
  • Total number of students.
  • Total school district budget.
  • Average math and reading scores.
  • Percentage of students passing math and reading.
  • Percentage of students passing both subjects.

School Summary

  • School type and total student count.
  • Per capita spending.
  • Average test scores.
  • Schools with passing rates of 70% or higher in math and reading.
  • Overall passing percentages for each school.

Additional Analysis

  • Top-Performing Schools: Ranked by overall passing percentage.
  • Lowest-Performing Schools: Ranked by overall passing percentage.
  • Math and Reading Scores by Grade: Comparison of test scores by grade level.
  • Scores by School Spending: Analysis of school performance based on spending levels.
  • Scores by School Size: Comparison of school performance based on school size.
  • Scores by School Type: Analysis of performance based on charter and district school types.

Key Findings

  • Schools with lower budgets performed better in both math and reading, showing higher passing rates.
  • Smaller and medium-sized schools consistently outperformed larger schools.
  • Charter schools outperformed District schools, with all top-performing schools being Charter and all lowest-performing schools being District.
  • Academic performance was generally consistent across grade levels.
  • Students performed better in reading than math, though overall passing rates were low, indicating subject performance variation.

Tools Used

  • Python: Scripting language for data analysis.
  • Pandas: Library used for data manipulation and aggregation.
  • Jupyter Notebook: Interactive development environment for running Python scripts.

How to Run

  1. Clone this repository.
  2. Ensure Python and Pandas are installed in your environment.
  3. Run the analysis scripts in Jupyter Notebook or a Python environment to view results.

Written Report

The written report attached provides a detailed summary of the analysis and draws conclusions based on key calculations. For a more detailed breakdown of the data, please refer to the attached analysis report in the repository.

Contributors

  • Sakina Jaffri - Data analysis, Pandas scripting, and reporting.

About

In this challenge, as the Chief Data Scientist for your city's school district, you'll use Python's Pandas library to analyze standardized test results. The goal is to identify trends in math and reading scores, helping the school board make data-driven decisions regarding budgets and priorities.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published