PetMatch

Petmatch seeks to provide users with curated animal recommendations in the hope that it will lead to higher adoptions rates and lower return rates to shelters. This is accomplished through the use of recommendation models that serve recommendations based on both item-based similarities and user-based collaborative filtering. We also built a serverless full-stack pipeline and a Tinder-like UI that matches adoptable pets with potential adopters.

Team

Denise Blady
Theresa Thoraldson
Matthew Robinson

Local Docker

To run all required containers locally
- pull down master branch
- run BASE_PATH='<Path to this repo!>' docker compose up
- For example: BASE_PATH='/Users/theresa/Desktop/source/PetMatch' docker compose up
- Alternative: Create a .env file at the root of the repository which contains an environment variable BASE_PATH and a string value leading the repository's root within your local file system.

Run Petmatch Docker

From the root of repository, run docker-compose up
Troubleshooting - docker-compose up --build --force-recreate -d or BASE_PATH='YOUR BASE PATH' docker compose up

Prerequisites

Docker
Docker Compose
Python3.8>=
Port Access Availability on local ports: 8086, 8001, 8002, 8003, 3000, 8501, 8087

Project Organization

├── LICENSE
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── external       <- Data from third party sources.
│   ├── rankings       <- User rankings (likes,dislikes)
│   └── raw            <- The original, immutable data dump.
│
├── docs               <- A default Sphinx project; see sphinx-doc.org for details
│
├── models             <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
│                         the creator's initials, and a short `-` delimited description, e.g.
│                         `1.0-jqp-initial-data-exploration`.
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures        <- Generated graphics and figures to be used in reporting
│
├── requirements.txt   <- The requirements file for reproducing the analysis environment, e.g.
│                         generated with `pip freeze > requirements.txt`
│
├── setup.py           <- makes project pip installable (pip install -e .) so src can be imported
├── src                <- Source code for use in this project.
│   ├── __init__.py    <- Makes src a Python module
│   │
│   ├── data           
|   |   └── dynamo     <- Local shared Dynamodb
|   |   └── shared-local-instance.db <- Request this file from PetMatch team member
|   ├── frontend       <- React and AWS Amplify code, frontend Dockerfile
|   ├── pet_match_stack
|   |   └── api        <- 
|   |   |   └── app    <- FastAPI application code for main backend routes
|   |   |   └── model_api <- FastAPI application code for ML models interactions routes
|   |   |   └── fastapi.Dockerfile <- FastAPI application Dockerfile
|   |   |
|   |   └── db-setup <- DynamoDB local setup scripts and Dockerfile
|   |   └── infra    <- Terraform infrastructure
|   |   └── lambdas  <- AWS Lambda code 
|   |   └── tests    <- Testing sweet
|   | 
│   └── visualization  <- Contains scripts and Dockerfiles to run the Petmatch Streamlit application this contains basic version of Petmatch
|       └── pages <- Streamlit multi-page app pages
|       └── rankings <- Destination for saved user Rankings (likes of animals)
|       └── get_user_prefs.py <- script to collect user preferences
|       └── Petmatch_Start.py <- Streamlit application homepage 
│
└── docker-compose.yml   <- Docker Compose configuration that sets up and configures your local stack

Limitations
Model Limitations:

User-based Collaborative Filtering models have a 'cold-start' problem, which we have mitigated by initially collecting animal rankings by beta users. That said this model will need to be continually retrained on a semi-regular schedule to continually improve it.
Item-based Content-based similarity models are an exhaustive model that currently has a cap of ~56K animals before memory is exhausted. Research is still being underdone to more efficiently train this model to up the limit of animals it can recommend. These models can also fill the gap until collaborative filtering models are ready.

Data Limitations:

Data pulls automatically cut off animal descriptions once they reach a certain limit
PetFinder DB API can be limiting for data pulls until we can prove we have a deployable app, must work with stashed local versions of the DB based on pulls until we get higher DB access rights
Distance field based on location at time of pull

ML Pipeline Limitations:

DynamoDB tables requires pulls at scale, which requires some data currently must be loaded in via a .pkl format rather than queried from the Petmatch stack backend

Project Pitch Slides: /reports/PetMatch_presentationSlides_20230123.pdf
Project Software Architecture Diagram: /reports/figures/PetMatchSystemDiagram_v2.png

Future Work:

Continue collecting user rankings
Add timestamp to user rankings for time-sensitive recommendations
Incorporate distance more effectively into models
Incorporate auto-retrain of models in Sagemaker

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PetMatch

Team

Local Docker

Run Petmatch Docker

Prerequisites

Project Organization

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 214 Commits
.dvc		.dvc
data		data
docs		docs
models		models
notebooks		notebooks
references		references
reports		reports
src		src
.dvcignore		.dvcignore
.gitignore		.gitignore
LICENSE		LICENSE
PLANNING.md		PLANNING.md
README.md		README.md
amplify.yml		amplify.yml
docker-compose.yml		docker-compose.yml
dvc-local-remote.dvc		dvc-local-remote.dvc
env.sample		env.sample
requirements.txt		requirements.txt
setup.py		setup.py
test_environment.py		test_environment.py

License

tthoraldson/PetMatch

Folders and files

Latest commit

History

Repository files navigation

PetMatch

Team

Local Docker

Run Petmatch Docker

Prerequisites

Project Organization

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages