-
the data for the project [data]https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip
-
A full description is available at the site where the data was obtained:[description]http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones
-
First download the data set from the link and then set a working directory to UCI HAR Dataset
-
needs dplyr , data.table , and tidyverse packages
-
uses read.table() to import the data sets.
-
uses names() to describe the activity names of variables feature, activities
- uses rbind() to merge Y_test and X_test, and store in **merged_activity **
- it uses merge() and cbind() to create total_Merged_Data.
- uses grep("mean\(\)|std\(\)", features$feature_Label) to extract variables with mean and std on the column names
- gsub("\(|\)", "", names(Merged_with_Mean_Std)) removes () from variable names
-
uses tbl_df function from dplyr package
-
uses group_by function to group the data set by activity_Label, subject_Id
-
Then, for each separted data set it summarizes all variables using mean
-
Finally, exports and save a data set with a name tidy_mean_std.txt