-
Websraping Data from the webpage https://www.watchcollecting.com Code is in
rolexVer009.py
The path needs to be setup here:service = Service('/usr/local/bin/chromedriver')
The data are saved into filedataAutoVer009.csv
-
Websraping Data from the webpage https://www.ofx.com/en-us/forex-news/historical-exchange-rates/ Code is in
currencypaul002.py
At the beginning the data are reading from filedata\dataAutoVer009.csv
At the end the updated data are exported into filedataWithCurrencyVer002.csv
-
Code for data cleaning and preparing is in
cleaningRolexVer011WholeCode.py
At the beginning the data are reading from filedataWithCurrencyVer002.csv
At the end the updated data are exported into filereformatedAndOneHotEncodedDataVer011.csv
-
Model is created in the Jupiter Notebook:
Watches_Model.ipynb
Model code is exported inwatches_lgbm_initial_model.pkl
-
Code for application is in files:
server.py
-
Make sure everything is able to be run
-
Finish flask server. Using the data received from the form, populate an array (should match the number of columns in the data set) to pass into the model to predict results. Once completed, test to see if the prediction is working and outputing data into the next page. (Tips are provided in the comments in the file)
-
Automate data retrieval such that webscraping will only scrape new entries to the website (i.e. new watches that were sold)
-
Automate model such that once enough new data has been retrieved the model will re fit to the best parameters in order to pickle into the flask server.
- Download and install the latest Chrome and check your Chrome version
- Download the Chrome WebDriver Zip File matching with your Chrome version and Apple Chip from
https://chromedriver.storage.googleapis.com/index.html
and extract the chromedriver - Copy the chromedriver and paste it to
/usr/local/bin
(If this folder doesn’t exist then create one) - Open a terminal inside the bin folder and run the following command so that MacOS can verify the app.
cd /usr/local/bin
xattr -d com.apple.quarantine chromedriver
- Install Selenium:
pip3 install selenium
- When runnig code on MacOS, the
py
files for webscraping code should contain:
service = Service('/usr/local/bin/chromedriver')