Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 145 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 145 Bytes

Word-Web-Scraper

Python script to scrape word counts for all the pages on a site. Uses urllib, regular expressions, and NLTK's word tokenizer.