Skip to content

Latest commit

 

History

History
29 lines (17 loc) · 545 Bytes

README.md

File metadata and controls

29 lines (17 loc) · 545 Bytes

ptt-scrapy

A powerful scrapy spider make all-out effort to collect information from PTT into database.

Requisites

  • Python 3
  • MongoDB

Usage

  • Make sure your mongod is running in system.

  • Run with scrapy command:

    scrapy crawl ptt
  • (Option) You can export these data into json, e.g.:

    mongoexport --db ptt --collection mobilecomm --out mobilecomm.json

Acknowledgement