Skip to content
forked from matth/juicer

Juicer is a web API for extracting text, meta data and named entities from HTML "article" type pages.

Notifications You must be signed in to change notification settings

skyshard/juicer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

78 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Juicer

Juicer is a web API for extracting text, meta data and named entities from HTML "article" type pages.

For more info visit: http://juicer.herokuapp.com/

Running the app (Developers)

Locally

  • Run sbt test to test the app (sbt must be sbt 0.11, not 0.7)
  • Run sbt stage to stage the app
  • Run juicer-web/target/start to run the server
  • Now open http://localhost:8080 in a browser

On Heroku

  • Clone the repo
  • Install the Heroku tools; be sure heroku is on your path
  • Type these commands inside the application's git clone:
    • heroku create --stack cedar
    • heroku config:add BUILDPACK_URL=https://github.com/heroku/heroku-buildpack-scala.git#perm-gen
    • git push heroku master
    • heroku open

About

Juicer is a web API for extracting text, meta data and named entities from HTML "article" type pages.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 86.9%
  • Scala 12.4%
  • Other 0.7%