ML-Projects

Data Science and Analysis

Physical Behaviour as a classification problem using smartphone data

Overview

For this example, data from smarthpone sensors (placed on the hip) will be used to build a model for predicting locomotion and transportation activities. To that end, the SHL dataset will be used to predict the following activities: 'Still', 'Walking','Run', 'Bike', 'Car', 'Bus', 'Train', and 'Subway'.

In order to build and evaluate an activity recognition system, a sequence of steps is required. These steps include data acquisition, data processing, data segmentation, features extraction, data imputation, features selection, classification, and validation. The following lines of code are performed with Python.

Methodology

Data Acquisition: Based on literature, it is highly recommended to use sensor data related to accelerometer, gyroscpose and GPS location for predicting activities related to locomotion and transportation. Here, the focus will be on accelerometer, gyroscope, magnetometer, and GPS data obtained from the SHL dataset.
Data Processing: Here data are cleaned and prepared for the next phases. Furthermore, GPS data features (i.e., speed and distance) are calculated based on the Haversine Distance of the GPS coordinates.
Data Segmentation: here the sensor data are divided into different frames. Based on literature, it is recommended a window segment of ~5sec with 50% overlap. Here, a sliding window of size 6 seconds with half overlap is chosen to meet the criteria for sensor fusion (by merging motion data and GPS).
Features Extraction: different time and frequency domain features are extracted for each sensor recording. Based on literature, I decided to compute the following features. For the accelerometer I computed mean, std, magnitude, fftfreq (peak frequencies based on fast fourier), signal power, entropy. For the gyroscope I computed mean, std, and magnitude. For the magnetomer I computed std, magnitude. For the GPS I computed the mean values. A description of these features can be found on the following section (see Reference 1). Furthermore, missing GPS data are imputed.
Features Selection: here non-informative features are excluded. At first highly correlated features (with an absolute correlation >=0.9) are removed. Then, a feature selection model (based on sklearn feature importance score) is also used to remove features with the lowest information gain.
Classification: The Random Forest is used to train the prediction model. Since the focus is not on how to optimise the classificaiton model, I will not focus on optimising the hyperparameters or on evaluating other algorithms.
Validation: The StratifiedShuffleSplit split from sklearn is used to split the data into 10 different train/test sets (10-fold cross- validation). Different metrics are used, such as precision, recall, f1-score, and confusion matrix.

Background Information & Related Work

For this solution, different studies in literature were considered. Mainly, I focused on the followings:

Physical Activity Recognition using Wearable Accelerometers in Controlled and Free-Living Environments (my MSc thesis graduation project about activity recognition)
A systematic review of smartphone-based human activity recognition methods for health research
The University of Sussex-Huawei Locomotion and Transportation Dataset for Multimodal Analytics With Mobile Devices

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Classification_Locomotion.ipynb		Classification_Locomotion.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML-Projects

Physical Behaviour as a classification problem using smartphone data

Overview

Methodology

Background Information & Related Work

About

Releases

Packages

Languages

kkonsolakis/ML-Projects

Folders and files

Latest commit

History

Repository files navigation

ML-Projects

Physical Behaviour as a classification problem using smartphone data

Overview

Methodology

Background Information & Related Work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages