Spider¶

一个简单的爬虫库

预期提供：

更复杂的功能，如爬取特定网站形成结构化数据，反爬虫等内容独立成库

Quick Glance¶

`longling.spider.lib.get_html_code`(url)	get encoded html code from specified url
`longling.spider.download_data.download_file`(url)	cli alias: `download`, download data from specified url

longling.spider.lib.get_html.get_html_code(url)[源代码]¶: get encoded html code from specified url

longling.spider.download_data.download_file(url, save_path=None, override=True, decomp=True, reporthook=None)[源代码]¶

cli alias: download, download data from specified url

参数:	url -- save_path -- override -- decomp -- reporthook --