{ "info": { "author": "Muhammad Azhar", "author_email": "azhar@contentstudio.io", "bugtrack_url": null, "classifiers": [], "description": "A web content parser using Python lxml\n\n\nCompatibility\n-------------\n\nThe library is compatible with Python3. Python2 is currently not supported.\n\n\nUsage\n-----\n\nInstall the package using pip.\n\n```\npip install webparser-py\n```\n\n**Convert to Document**\n\nAccept the html content document, convert it to the doc element, if we want to convert relative links to absolute links, \nwe pass the domain url to the absolute links.\n\n**convert_to_doc()**\n\n```\nfrom webparser.parser import convert_to_doc\n\ndoc = convert_to_doc('HTML content', 'http://yourwebsite.com')\n\n```\n\n**class FeedParser()**\n\nFeed parser class is used for parsing the feed through the response content or using a URL.\n\n\n```\nfrom webparser.parser import FeedParser\n\nfeed = FeedParser() # optional feed URL can be provided.\nparsed_links = feed.parse(url='http://viralnova.com/feed') # url will override constructor feed URL. \n```\n\n**has_rss_feed()**\n\nCheck if the website/URL has a RSS feed link present.\n\n - Check the document with Mimetype of links using XPATH.\n - Fuzzy URL search e.g /feed at the end of the website URL. (Attempted if no links for the RSS URL found)\n\n```\nfrom webparser.parser import has_rss_feed\nrss_links = has_rss_feed(doc=html_content, url=website_url)\n```\n\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://contentstudio.io", "keywords": "", "license": "MIT License", "maintainer": "Muhammad Azhar", "maintainer_email": "azhar@contentstudio.io", "name": "webparser-py", "package_url": "https://pypi.org/project/webparser-py/", "platform": "", "project_url": "https://pypi.org/project/webparser-py/", "project_urls": { "Homepage": "https://contentstudio.io" }, "release_url": "https://pypi.org/project/webparser-py/0.3/", "requires_dist": [ "lxml", "feedparser", "requests", "urllib3; extra == 'full'", "certifi; extra == 'full'" ], "requires_python": "", "summary": "A web parser wrapper on top of lxml and selectolax", "version": "0.3" }, "last_serial": 4082057, "releases": { "0.2": [ { "comment_text": "", "digests": { "md5": "1d90394cdfbccd09cf3ccf4f8dee2733", "sha256": "4f82806f665636b8046b09ce14f283950116e4a269990c030f453ba5e3ac87da" }, "downloads": -1, "filename": "webparser_py-0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "1d90394cdfbccd09cf3ccf4f8dee2733", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5330, "upload_time": "2018-07-19T13:20:09", "url": "https://files.pythonhosted.org/packages/0e/c8/46156d2e0300381ec2af73b8eccb9f4455be9a5207494b872b8c2316f9d4/webparser_py-0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "75948ea0b1815c4765a0db37bd585ad6", "sha256": "f95bf054b4a19ee89bfc4db9f7f4bd7945e5378dd926da73d2e9ea339e0011bc" }, "downloads": -1, "filename": "webparser-py-0.2.tar.gz", "has_sig": false, "md5_digest": "75948ea0b1815c4765a0db37bd585ad6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4599, "upload_time": "2018-07-19T13:20:11", "url": "https://files.pythonhosted.org/packages/33/22/fc2d4b00d450acb75b47c76e869af21e4fb7518492d832030f55310b928a/webparser-py-0.2.tar.gz" } ], "0.3": [ { "comment_text": "", "digests": { "md5": "e1f2ed4430484895f9db76a65f8ac48b", "sha256": "93fb244a4a2a12639e667d473cdae6561110ad1a442ea96f6d2fb1f6f4b1ef11" }, "downloads": -1, "filename": "webparser_py-0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "e1f2ed4430484895f9db76a65f8ac48b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5330, "upload_time": "2018-07-19T13:21:36", "url": "https://files.pythonhosted.org/packages/5f/ce/d840ea1b729a4abc789379d45595361d5a11bb6c24449852df01ba3bc910/webparser_py-0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "7c098143fddb3735fe09af4ec42c7849", "sha256": "d3b510b7152d55480dd4a0a679415a63e8a4d1333f7692b9fa66010061a3d14c" }, "downloads": -1, "filename": "webparser-py-0.3.tar.gz", "has_sig": false, "md5_digest": "7c098143fddb3735fe09af4ec42c7849", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4603, "upload_time": "2018-07-19T13:21:37", "url": "https://files.pythonhosted.org/packages/9f/94/4c25bce9ef18054b7e97511c8487264e43f8196ae0a62cada0ac3d438691/webparser-py-0.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "e1f2ed4430484895f9db76a65f8ac48b", "sha256": "93fb244a4a2a12639e667d473cdae6561110ad1a442ea96f6d2fb1f6f4b1ef11" }, "downloads": -1, "filename": "webparser_py-0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "e1f2ed4430484895f9db76a65f8ac48b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5330, "upload_time": "2018-07-19T13:21:36", "url": "https://files.pythonhosted.org/packages/5f/ce/d840ea1b729a4abc789379d45595361d5a11bb6c24449852df01ba3bc910/webparser_py-0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "7c098143fddb3735fe09af4ec42c7849", "sha256": "d3b510b7152d55480dd4a0a679415a63e8a4d1333f7692b9fa66010061a3d14c" }, "downloads": -1, "filename": "webparser-py-0.3.tar.gz", "has_sig": false, "md5_digest": "7c098143fddb3735fe09af4ec42c7849", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4603, "upload_time": "2018-07-19T13:21:37", "url": "https://files.pythonhosted.org/packages/9f/94/4c25bce9ef18054b7e97511c8487264e43f8196ae0a62cada0ac3d438691/webparser-py-0.3.tar.gz" } ] }