{ "info": { "author": "SuanCaiYu", "author_email": "suancaiyu0413@gmail.com", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Programming Language :: Python", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: Implementation :: CPython", "Programming Language :: Python :: Implementation :: PyPy" ], "description": "\n# scrapy-ssdb-spider\n\n- \u5bf9\u7740 scrapy-redis \u7167\u846b\u82a6\u753b\u74e2\u7684\u4f5c\u54c1\n- \u57fa\u4e8e ssdb \u961f\u5217\u7684 scrapy \u5206\u5e03\u5f0f\u89e3\u51b3\u65b9\u6848\n\n## \u4f9d\u8d56\u8bf4\u660e\n\n- Python 3.6\uff08\u6d4b\u8bd5\u73af\u5883\uff09\n- SSDB 1.9.7\n- scrapy\n- pyssdb\n\n## \u4f7f\u7528\u8bf4\u660e\n\nshell:\n```\ngit clone https://github.com/PickledFish/scrapy-ssdb-spider\npython3 setup.py install\n```\n\n\u6216\u8005\n\n```\npip install scrapy-ssdb-spider\n```\n\n\u5728scrapy\u9879\u76ee\u4e2d:\n\n```python\n# settings\n# ssdb\u670d\u52a1\nSSDB_HOST = '127.0.0.1'\nSSDB_PORT = 8888\n# ssdb\u5bc6\u7801\uff0c\u53ef\u9009\u914d\u7f6e\n#SSDB_PWD = 'your password'\n# \u914d\u7f6e\u8c03\u5ea6\u5668\nSCHEDULER = 'scrapy_ssdb_spider.scheduler.Scheduler'\n# \u914d\u7f6e\u53bb\u91cd\u7c7b\nDUPEFILTER_CLASS = 'scrapy_ssdb_spider.dupefilter.SSDBDupeFilter'\n# \u914d\u7f6e\u8c03\u5ea6\u961f\u5217\u952e(\u53ef\u9009)\n#SCHEDULER_QUEUE_KEY = ''\n# \u914d\u7f6e\u8c03\u5ea6\u961f\u5217\u7c7b(\u53ef\u9009)\n#SCHEDULER_QUEUE_CLASS = ''\n# \u914d\u7f6e\u53bb\u91cd\u961f\u5217\u952e\n#SCHEDULER_DUPEFILTER_KEY = ''\n\n# \u4e0b\u9762\u4e24\u4e2a\u914d\u7f6e\uff0c\u5982\u679c\u6211\u5148\u542f\u52a8\u4e86A\u722c\u866b\uff0c\u8fc7\u4e86\u534a\u5c0f\u65f6\u542f\u52a8B\u722c\u866b\uff1f\n# \u961f\u5217\u88ab\u6e05\u7a7a\u4e86\uff1f\uff1f\uff1f\uff1f\uff1f\u6211\u6ca1\u641e\u61c2\uff0c\u53cd\u6b63scrapy-redis\u6709\u8fd9\u4e2a\u529f\u80fd\uff0c\u6211\u4e5f\u641e\u4e00\u4e2a\uff0c\u9ed8\u8ba4\u4e0d\u6e05\u7a7a\u961f\u5217\n# \u914d\u7f6e\u5728\u722c\u866b\u5f00\u59cb\u524d\u6e05\u7a7a\u53bb\u91cd\u53ca\u8c03\u5ea6\u961f\u5217\uff08\u5e03\u5c14\u7c7b\u578b\uff09\n#SCHEDULER_OPEN_CLEAR_QUEUE = \n# \u914d\u7f6e\u5728\u722c\u866b\u7ed3\u675f\u540e\u6e05\u7a7a\u53bb\u91cd\u53ca\u8c03\u5ea6\u961f\u5217\uff08\u5e03\u5c14\u7c7b\u578b\uff09\n#SCHEDULER_CLOSE_CLEAR_QUEUE = \n\n```\n\n```python\n\n# \u7f16\u5199\u722c\u866b\nfrom scrapy_ssdb_spider.spiders import SsdbSpider\n\nclass TestSpider(SsdbSpider):\n # \u914d\u7f6e\u79cd\u5b50\u961f\u5217\u952e\n ssdb_key = 'start_key'\n\n def parse(self, response):\n pass\n\n```\n\n- \u4e00\u5207\u90fd\u548cscrapy_redis\u90a3\u4e48\u50cf\uff0c\u5373\u4f7f\u662f\u4ee3\u7801\uff0c\u90fd\u5f88\u50cf\n- \u76f8\u4fe1\u806a\u660e\u5982\u4f60\uff0c\u4e00\u5b9a\u6ca1\u95ee\u9898\u7684\uff0c\u6b22\u8fce\u63d0\u610f\u89c1\n\n\n## \u5dee\u5f02\n\n\u867d\u7136\u4ee3\u7801\u90fd\u662f\u53c2\u7167scrapy-redis\u5199\u7684\uff0c\u4f46\u662f\u6709\u4e9b\u529f\u80fd\u5e76\u672a\u5b9e\u73b0:\n\n- \u57fa\u4e8e ssdb \u7684 Pipeline \u6ca1\u6709\u5b9e\u73b0\n- ~~\u6ca1\u6709\u722c\u866b\u7ed3\u675f\u6216\u722c\u866b\u5f00\u59cb\u6e05\u9664\u961f\u5217\u7684\u914d\u7f6e~~\n- \u5fd8\u4e86\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/PickledFish/scrapy-ssdb-spider", "keywords": "", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "scrapy-ssdb-spider", "package_url": "https://pypi.org/project/scrapy-ssdb-spider/", "platform": "", "project_url": "https://pypi.org/project/scrapy-ssdb-spider/", "project_urls": { "Homepage": "https://github.com/PickledFish/scrapy-ssdb-spider" }, "release_url": "https://pypi.org/project/scrapy-ssdb-spider/0.1.1/", "requires_dist": [ "scrapy", "pyssdb" ], "requires_python": ">=3.6.0", "summary": "Ssdb-based components for Scrapy.", "version": "0.1.1" }, "last_serial": 5510929, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "88814510672604ae35f185839ed3c205", "sha256": "fe417eac801976e026ef028b7e86de75690116c3c58a743cfa6f47d25fee6678" }, "downloads": -1, "filename": "scrapy_ssdb_spider-0.1.0-py3-none-any.whl", "has_sig": false, "md5_digest": "88814510672604ae35f185839ed3c205", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6.0", "size": 8343, "upload_time": "2019-07-10T09:03:08", "url": "https://files.pythonhosted.org/packages/f1/89/b4fc6f0627ceae57a27e6b418eacdab0b8711ababa242a6a75ae9d116570/scrapy_ssdb_spider-0.1.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f0c234f960e16caf1c4a73d7aa5446c2", "sha256": "4f0472bb6ce11159bc533c059308dbe2e86c9c90c1860d874a05870d95d83129" }, "downloads": -1, "filename": "scrapy_ssdb_spider-0.1.0.tar.gz", "has_sig": false, "md5_digest": "f0c234f960e16caf1c4a73d7aa5446c2", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6.0", "size": 7329, "upload_time": "2019-07-10T09:03:12", "url": "https://files.pythonhosted.org/packages/bc/3a/f46042e0209886a9d07bc6184ed4fc5ccbb3b957b2b463328c4d39d97e73/scrapy_ssdb_spider-0.1.0.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "c30b7375489cd02e5b93e9af84b7b683", "sha256": "05e060876bcdf95a7fefca1a37d359e61c1cc37a0b49f5c232e84d24b03d3065" }, "downloads": -1, "filename": "scrapy_ssdb_spider-0.1.1-py3-none-any.whl", "has_sig": false, "md5_digest": "c30b7375489cd02e5b93e9af84b7b683", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6.0", "size": 8363, "upload_time": "2019-07-10T09:13:15", "url": "https://files.pythonhosted.org/packages/27/13/fa2751c2d40f59320086bb5fb015c5535ab5e32f34b8e8439f79fd534d55/scrapy_ssdb_spider-0.1.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "490aced4a1d40fe499c6c6575b8ac656", "sha256": "be4c0a5bdde5b7c562b4a22de8c767f5494e66d3bc4ececc7dc8cc4e09b9cc26" }, "downloads": -1, "filename": "scrapy_ssdb_spider-0.1.1.tar.gz", "has_sig": false, "md5_digest": "490aced4a1d40fe499c6c6575b8ac656", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6.0", "size": 7359, "upload_time": "2019-07-10T09:13:17", "url": "https://files.pythonhosted.org/packages/83/da/93c5ba5a3f22ffc53a57fc66611b3e4ebabb7212874d0ee10d57f649bac9/scrapy_ssdb_spider-0.1.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "c30b7375489cd02e5b93e9af84b7b683", "sha256": "05e060876bcdf95a7fefca1a37d359e61c1cc37a0b49f5c232e84d24b03d3065" }, "downloads": -1, "filename": "scrapy_ssdb_spider-0.1.1-py3-none-any.whl", "has_sig": false, "md5_digest": "c30b7375489cd02e5b93e9af84b7b683", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6.0", "size": 8363, "upload_time": "2019-07-10T09:13:15", "url": "https://files.pythonhosted.org/packages/27/13/fa2751c2d40f59320086bb5fb015c5535ab5e32f34b8e8439f79fd534d55/scrapy_ssdb_spider-0.1.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "490aced4a1d40fe499c6c6575b8ac656", "sha256": "be4c0a5bdde5b7c562b4a22de8c767f5494e66d3bc4ececc7dc8cc4e09b9cc26" }, "downloads": -1, "filename": "scrapy_ssdb_spider-0.1.1.tar.gz", "has_sig": false, "md5_digest": "490aced4a1d40fe499c6c6575b8ac656", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6.0", "size": 7359, "upload_time": "2019-07-10T09:13:17", "url": "https://files.pythonhosted.org/packages/83/da/93c5ba5a3f22ffc53a57fc66611b3e4ebabb7212874d0ee10d57f649bac9/scrapy_ssdb_spider-0.1.1.tar.gz" } ] }