Skip to content

Juvenile Crime Analysis using data exploration, data analysis and machine learning.

Notifications You must be signed in to change notification settings

KartikGS/Juvenile_Crime_Analysis_In_India

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Juvenile Crime Analysis

Using data exploration, data analysis and machine learning.

Objective

Main objective of the analysis is prediction and interpretation of juvenile crime in India and deciding efficient allocation of resources in the right areas to minimise the crime rate.

Dataset

Juvenile Crime in India

Dataset we have considered:Github

Crime in India (Kaggle)

Following link contains processed data of various crime in separate csv file. From which we have considered csv which contains information about juvenile crime:Kaggle

Crime in India (NCRB)

Following link contains the original unprocessed data of various crime in separate csv file:NCRB

Metadata

The dataset contains different sub datasets, each of these sub datasets holds the aggregate value of crime recorded from 2001 to 2010 and are split on various categories. On sub-dataset divided based on:

  • Age-group and type of crime
  • Background
  • Recidivism The kaggle dataset was preprocessed, various combinations of datasets was used to train the model and address various issues.

Problem Statements And Results

Given state, crime(mapping required, not considered for now), age, gender, predicting the possibility of SLL(Naive bias using above data + ARIMA to predict current year using previous year data)

SLL by parima

Alt text

Naive Bayes Result

Boys

Alt text

Girls

Alt text

Feature Importance(Using Linear Regression and Random Forest)

Relation between punishment and recidivism (Linear Regression)

Alt text Alt text

Relation between punishment and recidivism (Random Forest)

Alt text Alt text

Background of juveniles committing crime

Alt text Alt text

Which State requires better rehabilitation facilities.(Done using clustering)

Alt text Alt text

Given State, who to be on a look out for?(Arima+Naive bias, 3 class classification)

Recidivism by parima
Alt text

Naive Bayes Result
Alt text

Future Scope

The next step is to streamline the data analysis process by improving the collection method of data or aggregation of various data. Better interpretations can be made with more abstract data. A lot of key information is lost in the current format of data aggregation.

About

Juvenile Crime Analysis using data exploration, data analysis and machine learning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published