This project includes delivering a cloud based web application which will run with minimum downtime, supporting multiple users to access the application simultaneously in a secure manner. Further, the project aims to deliver a shopping website wherein users can purchase clothing items and accessories.
AWS services Integrated:
- AWS Cognito (Authentication system)
- AWS EC2 (For application deployment)
- AWS EC2 Auto Scaling groups
- AWS Elastic Load Balancer
500083951_CAD_PROJECT_DEMO.mp4
- Authentication System
- Browsing products on the website
- Viewing Product details
- Adding products to cart
- Getting the total amount for the products selected in cart
- Successfully placing order of the selected items in the cart
python server.py
A multithreaded HTTP web server which will run multiple and individual threads for the server and client side, will start a new thread each time a client accesses the website. After starting the HTTP web server, HTML,CSS and JS files are rendered defined in requestHandler.py The server is stopped when a critical error occurs.
python multithreading.py n
Here, n is the number of worker threads you want to spawn for conformance and stress tests on GET,PUT and DELETE HTTP requests.
-
Change the URL mentioned in the index.html "https://homestyle.auth.us-east-1.amazoncognito.com/login?response_type=code&client_id=5icc2b3ks47a43hopdmbdoeqer&redirect_uri=http://localhost:8000/home.html"
-
Configure homestyle-cognito.yaml Here,change the parameters: UserPoolName,CallBackURLs,UserPoolId,Domain
AWS cognito api is fetched which presents the the sign up page to the user.
The application is deployed on AWS EC2 instance using Ubuntu canonical AMI. The instance is created inside a VPC, which will provide security and isolation to the application. Security rules like TCP connection on port 8000, SSH were defined. We’ll connect to the instance using Putty and copy the files from our local desktop to the remote instance by establishing a remote connection using WinSCP SMTP protocol. Once the files are transferred, install requirements.txt on the instance terminal and launch the application using: python server.py.
Now copy the public IPv4 address from the instance details and add port 8000 at the end of the address and run it on browser. You can see that the application is now successfully running on the AWS Cloud.
Once the user is successfully authenticated using AWS Cognito, they will be navigated to the main page of the website.
Further Elastic Load Balancer and EC2 Auto Scaling groups are linked to the the EC2 instance providing scalability and reliability.
-
EC2 Auto-Scaling Group Create a launch template for the auto scaling group and configure 6 availability zones wherein the instances of the auto-scaling group will be deployed.Now once the auto scaling group is created, we can browse to its activity history and see that as the desired capacity is set to 2, two new instances are launched in different availability zones of the same configuration as that of the originally created instance. This can be checked by navigating to the Instance dashboard of EC2 , wherein we can see that three instances are now up and running.
We are analysing Amazon review data using Pyspark which can provide a scalable, flexible, and fast way to gain valuable insights into customer behaviour and sentiment. These insights can help inform marketing and product development strategies, ultimately leading to improved business performance and customer satisfaction.
Steps performed for incorporating Big Data Analytics in this project are:
-
I chose dataset of Digital Music purchase reviews of Amazon.
-
Then Extract the Amazon review data of digital music purchase from the URL of object uploaded in AWS S3 bucket.
-
Transform the extracted the data type of several columns in the "digital_music_reviews_df" dataframe to the specified data types. Transform Data was then transformed(cleaned) before tables were created using our PgAdmin SQL schema for reference.
-
Load the data frames to RDS database. Each table that was created above was then loaded onto the PgAdmin using AWS's RDS connection I created.
We first transfer the python files for mapper.py and reducer.py from our local machine to the remote host. Also download an ebook .txt file to test out the functionality of the python files written. Now set the python path in each of the python files at the top. If you are not sure of the python path type in the command: which python3 (this will give the path).
cat /home/ubuntu/Beam-Private-George-O---Geor-[ebooksread.com].txt | /home/ubuntu/mapper.py
docker build -t <image_name> .
docker run -it <image_id>