Skip to content

Commit

Permalink
DS4B Class
Browse files Browse the repository at this point in the history
  • Loading branch information
AndrewBorst committed Jul 8, 2020
1 parent 8b172ad commit e46262f
Show file tree
Hide file tree
Showing 33 changed files with 31,993 additions and 276 deletions.
Binary file not shown.
42 changes: 42 additions & 0 deletions DS4B_101_R_Business_Analysis/00_data/bike_sales/README.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
---
title: "Bike Sales Excel Database"
output: github_document
---

```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```

The __Bike Sales Database__ represents a __bicycle manufacturer__, including tables for products (bikes), customers (bike shops), and transactions (orders).

## Data Model

```{r, echo=F}
knitr::include_graphics("img/bike_sales_data_model.png")
```


## About

The Bike Sales data files consist of 3 excel spreadsheets that represent:

- `bikes` table which includes bicycle models, descriptions, and unit prices that are produced by the manufacturer.

- `bikeshops` table which includes customers that the bicycle manufacturer has sold to.

- `orderlines` table which includes transactional data such as order ID, order line, date, customer, product, and quantity sold.

## Tables

```{r, message=F}
library(tidyverse)
library(readxl)
library(fs)
dir_info("./data_raw") %>%
filter(str_detect(path, "xlsx")) %>%
pull(path) %>%
map(read_xlsx) %>%
set_names(c("bikes", "bikeshops", "orderlines"))
```

86 changes: 86 additions & 0 deletions DS4B_101_R_Business_Analysis/00_data/bike_sales/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,86 @@
Bike Sales Excel Database
================

The **Bike Sales Database** represents a **bicycle manufacturer**,
including tables for products (bikes), customers (bike shops), and
transactions (orders).

## Data Model

![](img/bike_sales_data_model.png)<!-- -->

## About

The Bike Sales data files consist of 3 excel spreadsheets that
represent:

- `bikes` table which includes bicycle models, descriptions, and unit
prices that are produced by the manufacturer.

- `bikeshops` table which includes customers that the bicycle
manufacturer has sold to.

- `orderlines` table which includes transactional data such as order
ID, order line, date, customer, product, and quantity sold.

## Tables

``` r
library(tidyverse)
library(readxl)
library(fs)

dir_info("./data_raw") %>%
filter(str_detect(path, "xlsx")) %>%
pull(path) %>%
map(read_xlsx) %>%
set_names(c("bikes", "bikeshops", "orderlines"))
```

## $bikes
## # A tibble: 97 x 4
## bike.id model description price
## <dbl> <chr> <chr> <dbl>
## 1 1 Supersix Evo Black Inc. Road - Elite Road - Carbon 12790
## 2 2 Supersix Evo Hi-Mod Team Road - Elite Road - Carbon 10660
## 3 3 Supersix Evo Hi-Mod Dura Ace 1 Road - Elite Road - Carbon 7990
## 4 4 Supersix Evo Hi-Mod Dura Ace 2 Road - Elite Road - Carbon 5330
## 5 5 Supersix Evo Hi-Mod Utegra Road - Elite Road - Carbon 4260
## 6 6 Supersix Evo Red Road - Elite Road - Carbon 3940
## 7 7 Supersix Evo Ultegra 3 Road - Elite Road - Carbon 3200
## 8 8 Supersix Evo Ultegra 4 Road - Elite Road - Carbon 2660
## 9 9 Supersix Evo 105 Road - Elite Road - Carbon 2240
## 10 10 Supersix Evo Tiagra Road - Elite Road - Carbon 1840
## # ... with 87 more rows
##
## $bikeshops
## # A tibble: 30 x 3
## bikeshop.id bikeshop.name location
## <dbl> <chr> <chr>
## 1 1 Pittsburgh Mountain Machines Pittsburgh, PA
## 2 2 Ithaca Mountain Climbers Ithaca, NY
## 3 3 Columbus Race Equipment Columbus, OH
## 4 4 Detroit Cycles Detroit, MI
## 5 5 Cincinnati Speed Cincinnati, OH
## 6 6 Louisville Race Equipment Louisville, KY
## 7 7 Nashville Cruisers Nashville, TN
## 8 8 Denver Bike Shop Denver, CO
## 9 9 Minneapolis Bike Shop Minneapolis, MN
## 10 10 Kansas City 29ers Kansas City, KS
## # ... with 20 more rows
##
## $orderlines
## # A tibble: 15,644 x 7
## X__1 order.id order.line order.date customer.id product.id
## <chr> <dbl> <dbl> <dttm> <dbl> <dbl>
## 1 1 1 1 2011-01-07 00:00:00 2 48
## 2 2 1 2 2011-01-07 00:00:00 2 52
## 3 3 2 1 2011-01-10 00:00:00 10 76
## 4 4 2 2 2011-01-10 00:00:00 10 52
## 5 5 3 1 2011-01-10 00:00:00 6 2
## 6 6 3 2 2011-01-10 00:00:00 6 50
## 7 7 3 3 2011-01-10 00:00:00 6 1
## 8 8 3 4 2011-01-10 00:00:00 6 4
## 9 9 3 5 2011-01-10 00:00:00 6 34
## 10 10 4 1 2011-01-11 00:00:00 22 26
## # ... with 15,634 more rows, and 1 more variable: quantity <dbl>
Binary file not shown.
Binary file not shown.
Binary file not shown.
Loading

0 comments on commit e46262f

Please sign in to comment.