Skip to content

Commit

Permalink
Added coments
Browse files Browse the repository at this point in the history
  • Loading branch information
shamimansari1988 committed Nov 22, 2024
1 parent 2025f1f commit 7ab6af1
Showing 1 changed file with 1 addition and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -1059,6 +1059,7 @@ def process(self):
(os.path.join(self.cleaned_csv_file_path,
"USA_Population_Count_by_Race_before_2000.csv"))
generator_df = generator_df[generator_df['geo_ID'].str.len() > 1]
#Duplicate geo_ID instances were detected within the 1970-1979 data subset. To ensure data integrity, the older entries were eliminated, leaving only the most recent updates.
generator_df = generator_df.drop_duplicates(
subset=['geo_ID', 'Year'], keep='last')
generator_df.to_csv(os.path.join(
Expand Down

0 comments on commit 7ab6af1

Please sign in to comment.