{
"info": {
"author": "pengr",
"author_email": "pengrui55555@163.com",
"bugtrack_url": null,
"classifiers": [
"Environment :: Console",
"Intended Audience :: Developers",
"Natural Language :: Chinese (Simplified)",
"Operating System :: OS Independent",
"Programming Language :: Python :: 3",
"Programming Language :: Python :: 3.5",
"Programming Language :: Python :: 3 :: Only",
"Topic :: Communications :: Email",
"Topic :: Documentation :: Sphinx",
"Topic :: Internet",
"Topic :: Software Development :: Libraries :: Python Modules",
"Topic :: Software Development :: Version Control :: Git"
],
"description": "simple Spider\n=============\n\n| |python -> 3.4+|\n| |coverage -> 37%|\n| |build -> passing|\n\n::\n\n _ _ _____ _ _ \n (_) | | / ____| (_) | | \n ___ _ _ __ ___ _ __ | | ___ | (___ _ __ _ __| | ___ _ __ \n / __| | '_ ` _ \\| '_ \\| |/ _ \\ \\___ \\| '_ \\| |/ _` |/ _ \\ '__|\n \\__ \\ | | | | | | |_) | | __/ ____) | |_) | | (_| | __/ | \n |___/_|_| |_| |_| .__/|_|\\___| |_____/| .__/|_|\\__,_|\\___|_| \n | | | | \n |_| |_| \n\n`\u4e2d\u6587 <./Readme-zh.md>`__\n\nOverview\n--------\n\nA simple web crawling\nframework.\\ `Document `__\n\nGetting Started\n---------------\n\n``pip install simple-spiders``\n\nYou should construst project.py to suit your needs\n\n::\n\n from crawler.spider import Spider\n from crawler.writter import DataWriter\n\n spider = Spider(\n 'https://movie.douban.com/subject/26810318/comments?start=0&limit=20&sort=new_score&status=P')\n spider.start_crawl()\n\n``python project.py``\n\n``Ctrl-C to stop``\n\nReferenced Libraries\n--------------------\n\n- Using `requests `__ as\n htmlDownloader\n- Using `lxml `__ as default htmlParser\n- Using `csv `__ provide feature that export\n file as csv type\n- Using `xlwt `__ provide feature that\n export file as excel type\n- Using `xlsxwriter `__ provide\n feature that export file as xexcel type\n\nUsage\n-----\n\nProject structure\n-----------------\n\n::\n\n - crawler/\n - __init__.py\n - test/\n - htmlDownloder_test\n - htmlParser_test\n - requestManager_test\n - writter_test\n - logger_test\n - spider_test\n \n - htmlDownloder\n - htmlParser\n - requestManager\n - writter\n - logger\n - spider\n\n - main.py\n\nLicense\n-------\n\nThis project is published open source under [|license|\\ ] agreement.\nPlease maintain the open source release after modification and sign the\nname of the original author. Thank you for your respect\n\nIf you need to apply this project for commercial purposes, please\ncontact me( `@pengr `__ ) separately to\nobtain commercial authorization\n\n.. |python -> 3.4+| image:: ./images/python-3.4+-green.svg\n.. |coverage -> 37%| image:: https://img.shields.io/badge/coverage-37%25-yellowgreen.svg\n.. |build -> passing| image:: ./images/build-passing-orange.svg\n.. |license| image:: ./images/license-LGPL--3.0-orange.svg\n",
"description_content_type": "",
"docs_url": null,
"download_url": "",
"downloads": {
"last_day": -1,
"last_month": -1,
"last_week": -1
},
"home_page": "https://github.com/duiliuliu/simple-spiders",
"keywords": "",
"license": "MIT",
"maintainer": "",
"maintainer_email": "",
"name": "simple-spiders",
"package_url": "https://pypi.org/project/simple-spiders/",
"platform": "",
"project_url": "https://pypi.org/project/simple-spiders/",
"project_urls": {
"Homepage": "https://github.com/duiliuliu/simple-spiders"
},
"release_url": "https://pypi.org/project/simple-spiders/0.1.5/",
"requires_dist": null,
"requires_python": "",
"summary": "A simple web crawling framework.",
"version": "0.1.5"
},
"last_serial": 4928907,
"releases": {
"0.1.1": [
{
"comment_text": "",
"digests": {
"md5": "c26fceaad7cd451235420b35bed11ec7",
"sha256": "8f2376c999730f04b9bc8e447fe65f10838156b40ebd876994dcfdf6af382fec"
},
"downloads": -1,
"filename": "simple-spiders-0.1.1.tar.gz",
"has_sig": false,
"md5_digest": "c26fceaad7cd451235420b35bed11ec7",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 10950,
"upload_time": "2018-09-11T03:04:05",
"url": "https://files.pythonhosted.org/packages/9a/55/d2ec941586f2e25d77a980e63186d6e0338a58380e2d32bec7cd78917321/simple-spiders-0.1.1.tar.gz"
}
],
"0.1.2": [
{
"comment_text": "",
"digests": {
"md5": "2a72fc1547307f634c798cbcd46c9617",
"sha256": "e10bfd63de9308fca7477ff0f2a320b69a7b237a17271a34003359479a2c4789"
},
"downloads": -1,
"filename": "simple-spiders-0.1.2.tar.gz",
"has_sig": false,
"md5_digest": "2a72fc1547307f634c798cbcd46c9617",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 10940,
"upload_time": "2018-09-11T05:31:57",
"url": "https://files.pythonhosted.org/packages/58/87/80606ea85aa8162d7e9c2334a868987d559085a3aa97432257bff75a46a5/simple-spiders-0.1.2.tar.gz"
}
],
"0.1.3": [
{
"comment_text": "",
"digests": {
"md5": "a398ae6bf02b7afa26f3bb7cc38d37dc",
"sha256": "03b417f568e65751c02e895924aa4feb0180e61c7f92672470835ad264ec9a82"
},
"downloads": -1,
"filename": "simple-spiders-0.1.3.tar.gz",
"has_sig": false,
"md5_digest": "a398ae6bf02b7afa26f3bb7cc38d37dc",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 10984,
"upload_time": "2018-09-14T00:46:49",
"url": "https://files.pythonhosted.org/packages/c8/12/24e1d9868706ddceb0e478e2975bcb71e690fbc666404bf754cc665f7dc4/simple-spiders-0.1.3.tar.gz"
}
],
"0.1.4": [
{
"comment_text": "",
"digests": {
"md5": "e7c99977c435b15e9e2153adb4895b58",
"sha256": "5a893b3af6a888e4c70c8650178a5778657d19a4301cf06acc6c36690a769941"
},
"downloads": -1,
"filename": "simple-spiders-0.1.4.tar.gz",
"has_sig": false,
"md5_digest": "e7c99977c435b15e9e2153adb4895b58",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 11321,
"upload_time": "2018-09-14T01:42:53",
"url": "https://files.pythonhosted.org/packages/5a/5c/a8e48778841cd3f119cb8bbef1fa8873ccb138947ff1fd6f0d0dc95b6f0f/simple-spiders-0.1.4.tar.gz"
}
],
"0.1.5": [
{
"comment_text": "",
"digests": {
"md5": "9e25e1f529051c26f6f709676e0511b7",
"sha256": "7a97d0fe9fb38f6b794cadf5ddf0f9f5a68e2115b631bd7718a70350a44c646f"
},
"downloads": -1,
"filename": "simple-spiders-0.1.5.tar.gz",
"has_sig": false,
"md5_digest": "9e25e1f529051c26f6f709676e0511b7",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 11382,
"upload_time": "2019-03-12T07:52:29",
"url": "https://files.pythonhosted.org/packages/50/66/5063469f4d0f3eadbfc5b25511feafb2b9e07c0da891cd0be9b90f3701cc/simple-spiders-0.1.5.tar.gz"
}
]
},
"urls": [
{
"comment_text": "",
"digests": {
"md5": "9e25e1f529051c26f6f709676e0511b7",
"sha256": "7a97d0fe9fb38f6b794cadf5ddf0f9f5a68e2115b631bd7718a70350a44c646f"
},
"downloads": -1,
"filename": "simple-spiders-0.1.5.tar.gz",
"has_sig": false,
"md5_digest": "9e25e1f529051c26f6f709676e0511b7",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 11382,
"upload_time": "2019-03-12T07:52:29",
"url": "https://files.pythonhosted.org/packages/50/66/5063469f4d0f3eadbfc5b25511feafb2b9e07c0da891cd0be9b90f3701cc/simple-spiders-0.1.5.tar.gz"
}
]
}