{ "info": { "author": "Alex Pereverzyev", "author_email": "pereverzev.alex@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Environment :: Console", "Environment :: Web Environment", "Intended Audience :: Developers", "License :: OSI Approved :: BSD License", "Operating System :: OS Independent", "Programming Language :: Python", "Programming Language :: Python :: 2.7", "Topic :: Internet :: WWW/HTTP", "Topic :: Software Development :: Interpreters", "Topic :: Software Development :: Libraries :: Python Modules" ], "description": "======\nSpidy\n======\n\n.. image:: https://badge.fury.io/py/Spidy.png\n :target: http://badge.fury.io/py/Spidy\n\n.. image:: https://travis-ci.org/AlexPereverzyev/spidy.svg\n :target: https://travis-ci.org/AlexPereverzyev/spidy\n\nOverview\n========\n\nSpidy is an open source scripting language for Web scraping. Spidy allows building\ntools and applications which provide Web scraping features to wide audience of\nusers.\n\nDespite beign a scripting language, Spidy attempts to standardize Web scraping\ntasks around such fundamental tools like Web-get and XPath selectors. While URLs\nallow to reference arbitrary document on the Web, Spidy goes one step further\nto allow to referencing arbitrary piece of data with just a few lines of script,\nwhich is easy to create, distribute and understand.\n\nHere are major features the package offers:\n\n* Flexibility of scripting language\n* XPath selectors to extract data\n* Unified document model for HTML and JSON formats\n* Templates for better output formatting\n* Robust error handling and logging\n\nRequirements\n============\n\nSpidy is written in Python and relies on Python Standard Library only.\n\n* Python 2.7\n* Mac OS X, Windows, Linux, BSD\n\nInstall\n=======\n\nInstalling from Python Package Index::\n\n pip install spidy\n\t\nFor Windows installation instructions, please see documentation in ``docs`` \ndirectory.\n \n'Hello, World!' in Spidy\n========================\n\nLoading document or Web resource, for example trending repos page on GitHub::\n\n get 'https://github.com/explore' as html\n \nSelecting and returning trending repos links using XPath with class selector::\n\n return &'//*[@class=\"repo-name css-truncate css-truncate-target\"]'\n \nAnd all together using Spidy API::\n\n import spidy\n print spidy.do('''\n get 'https://github.com/explore' as html\n return &'//*[@class=\"repo-name css-truncate css-truncate-target\"]'\n ''')\n \nwill output list of relative links to stdout. Check out documentation for more\nexamples.\n\nDocumentation\n=============\n\nDocumentation is available in the ``docs`` directory. Script examples are located\nin ``examples`` directory.\n\nFeedback\n========\n\nSpidy is non-profit, meaning further development needs good reason to go on.\nWe are happy to hear positive feedback, but also we can't wait to learn what's\nbad, so please send your thoughts and our bugs to **spidy.feedback@gmail.com**.\nThank you and have a good scraping!", "description_content_type": null, "docs_url": "https://pythonhosted.org/Spidy/", "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/alexpereverzyev/spidy", "keywords": null, "license": "BSD", "maintainer": null, "maintainer_email": null, "name": "Spidy", "package_url": "https://pypi.org/project/Spidy/", "platform": "Mac OS,Linux,Windows", "project_url": "https://pypi.org/project/Spidy/", "project_urls": { "Download": "UNKNOWN", "Homepage": "https://github.com/alexpereverzyev/spidy" }, "release_url": "https://pypi.org/project/Spidy/0.2/", "requires_dist": null, "requires_python": null, "summary": "Spidy - Web scraping simplified!", "version": "0.2" }, "last_serial": 1271692, "releases": { "0.1": [ { "comment_text": "", "digests": { "md5": "44570c2bb7010906e41adce400e2578b", "sha256": "1c56c06b82e700a03c415cf2c52e15b98c6d0b3dd73fce881e0fba51e5dad8df" }, "downloads": -1, "filename": "Spidy-0.1.tar.gz", "has_sig": false, "md5_digest": "44570c2bb7010906e41adce400e2578b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 50330, "upload_time": "2014-07-04T10:37:35", "url": "https://files.pythonhosted.org/packages/45/cd/e2de84c04e775015ead4b317534d49fe138e2c30338009b6e943ea4e4c49/Spidy-0.1.tar.gz" } ], "0.2": [ { "comment_text": "", "digests": { "md5": "c217ffec32cc0a9dc19b01a7fd45514c", "sha256": "2b283ede056850381fa7d7430a58de94c9759aa2d43bd1ec629d16043f2d5ab9" }, "downloads": -1, "filename": "Spidy-0.2.tar.gz", "has_sig": false, "md5_digest": "c217ffec32cc0a9dc19b01a7fd45514c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 54713, "upload_time": "2014-09-27T09:14:10", "url": "https://files.pythonhosted.org/packages/c2/77/40ee8c82a50556b72725fa763b7a87fb20d1c7d46723fab493950b5ee67c/Spidy-0.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "c217ffec32cc0a9dc19b01a7fd45514c", "sha256": "2b283ede056850381fa7d7430a58de94c9759aa2d43bd1ec629d16043f2d5ab9" }, "downloads": -1, "filename": "Spidy-0.2.tar.gz", "has_sig": false, "md5_digest": "c217ffec32cc0a9dc19b01a7fd45514c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 54713, "upload_time": "2014-09-27T09:14:10", "url": "https://files.pythonhosted.org/packages/c2/77/40ee8c82a50556b72725fa763b7a87fb20d1c7d46723fab493950b5ee67c/Spidy-0.2.tar.gz" } ] }