Skip to content
This repository has been archived by the owner on Jul 21, 2020. It is now read-only.
/ movie-crawler Public archive

🕷利用Python爬取豆瓣热映与即将上映的电影信息及预告片,项目部署到服务器并通过crontab每日定时执行。

Notifications You must be signed in to change notification settings

lhz960904/movie-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python爬取豆瓣热映与即将上映的电影信息及预告片

利用python去抓取豆瓣页面doubanID数组,并通过请求电影API、模拟访问详情页获取所需要的信息,存入MongoDB数据库。用于movie-api使用。项目部署到服务器并通过crontab每日定时执行。

爬取URL

爬取流程图

流程图

定时任务命令

# 每天凌晨5点执行python3 crawler.py。并将控制台错误输入到error.txt文件里
0 5 * * * /usr/bin/python3 /home/python/movie-crawler/crawler.py 2> /home/python/error.txt  &

log截图

log截图

About

🕷利用Python爬取豆瓣热映与即将上映的电影信息及预告片,项目部署到服务器并通过crontab每日定时执行。

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages