{ "info": { "author": "scrapehero", "author_email": "pypi@scrapehero.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 2 - Pre-Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Natural Language :: English", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: 3.7" ], "description": "===========\nselectorlib\n===========\n\n\n.. image:: https://img.shields.io/pypi/v/selectorlib.svg\n :target: https://pypi.python.org/pypi/selectorlib\n\n.. image:: https://img.shields.io/travis/scrapehero/selectorlib.svg\n :target: https://travis-ci.org/scrapehero/selectorlib\n\n.. image:: https://readthedocs.org/projects/selectorlib/badge/?version=latest\n :target: https://selectorlib.readthedocs.io/en/latest/?badge=latest\n :alt: Documentation Status\n\n\n.. image:: https://pyup.io/repos/github/scrapehero/selectorlib/shield.svg\n :target: https://pyup.io/repos/github/scrapehero/selectorlib/\n :alt: Updates\n\n\n\nA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them\n\n* Free software: MIT license\n* Documentation: https://selectorlib.readthedocs.io.\n\n\nExample\n--------\n\n>>> from selectorlib import Extractor\n>>> yaml_string = \"\"\"\n title:\n css: \"h1\"\n type: Text\n link:\n css: \"h2 a\"\n type: Link\n \"\"\"\n>>> extractor = Extractor.from_yaml_string(yaml_string)\n>>> html = \"\"\"\n