Keeping it Fresh: Predict Restaurant Inspections

Goal of the Competition

The goal for this competition is to use data from social media to narrow the search for health code violations in Boston. Competitors will have access to historical hygiene violation records from the City of Boston — a leader in open government data — and Yelp's consumer reviews. The challenge: Figure out the words, phrases, ratings, and patterns that predict violations, to help public health inspectors do their job better.

What's in this Repository

This repository contains code volunteered from leading competitors in the Keeping it Fresh: Predict Restaurant Inspections on DrivenData. Code for all winning solutions are open source under the MIT License.

Winning code for other DrivenData competitions is available in the competition-winners repository.

Winning Submissions

Place	Team or User	Score	Summary of Model
1	LilianaMedina	0.8901	I averaged the predictions of a random forest and a gradient boosted model.
2	qwang	0.8931	My approach was focused almost strictly on feature engineering. I used scikit-learn’s implementation of random forests in Python.
3	furiouseskimo	0.9113	I built four models for each target (random forest, extra random trees, gradient boosting machine, l2 logistic regression) and blended the predictions from these models to get my final submission.

Winner's Interview: "Announcing the Results of our Keeping It Fresh Competition"

Benchmark Blog Post: "From raw Yelp reviews to a model of hygiene violations (in 3 easy steps)"

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
1st-place		1st-place
2nd-place		2nd-place
3rd-place		3rd-place
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keeping it Fresh: Predict Restaurant Inspections

Goal of the Competition

What's in this Repository

Winning Submissions

About

Releases

Packages

Contributors 3

Languages

License

drivendataorg/keeping-it-fresh

Folders and files

Latest commit

History

Repository files navigation

Keeping it Fresh: Predict Restaurant Inspections

Goal of the Competition

What's in this Repository

Winning Submissions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages