QZone Crawler

基于Scrapy的QQ空间照片/相册爬虫。

Dependence

python 2.7
scrapy==1.3.2
qqlib==1.0.0

Usage

git clone [email protected]:Lodour/QZone-Crawler.git
cd QZone-Crawler
virtualenv env --python=python2.7
source ./env/bin/activate
pip install scrapy==1.3.2 qqlib==1.0.0
cd qzone
scrapy crawl photo [-a qq=<Your QQ>] [-a target=<Target QQ>]
图片保存在QZone-Crawler/qzone/downloads文件夹内

Hint

API可能会失效，一般而言最近一次Commit的时候是有效的
项目仍在编写中，目前仅完成了爬取原创说说中的照片
如果非原创的说说图片也需要爬取

请将QZone-Crawler/qzone/qzone/spiders/photo.py中的assert not 'rt_certified' in msg注释掉

没有对virtualenv的配置进行测试，如有问题请自行解决
~~及时清空终端命令历史~~不指定参数时，爬虫开启后会提示输入

License

MIT License

Todo

相册爬虫
空间API应该放在单独的包里，以便于管理和更新
密码不应该在参数中明文输入
优化对Cookies的处理，应该自动保持而不被取消

Update

2017-2-27

使用getpass进行密码输入

2017-2-26

Initial commit

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
qzone		qzone
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

QZone Crawler

Dependence

Usage

Hint

License

Todo

Update

2017-2-27

2017-2-26

About

Uh oh!

Releases

Packages

Languages

License

Lodour/QZone-Crawler

Folders and files

Latest commit

History

Repository files navigation

QZone Crawler

Dependence

Usage

Hint

License

Todo

Update

2017-2-27

2017-2-26

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages