{ "info": { "author": "Guojian Li", "author_email": "guojianlee@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: BSD License", "Programming Language :: Python :: 3" ], "description": "\nIn.parse\n=========\n0.1.0\n\n\nOpen Collaborative AI Driven Parser builder for Web Scraping, Data Extraction and Crawling,Knowledge Graph\n\n\nInstalling\n----------\n\nInstall and update using ``pip``:\n\n`pip install -U inparse`\n\n\nParser Generator \n-----------------\n\nhttp://inparse.com\n\n\n\n\nMotivation\n----------\n1. Most painful thing of Web Data Extraction is to write parser rule. the Inparse try to \ngenerate the parser by AI according the training web pages.\n2. Commercial Universal Parser work good in Statistics, but failed in my case .And blackbox to user. \nInparser create parser for special website ,web page category\u3002And be correctable and improvable online by yourself.\n3. Open and free to create parser .Parser rule can be cached locally without remote server \nif you have concern. \n4. You will not be charged by usage. Run parser in your own CPU. \n\n\n\n\nExample\n===============================\n\n\n\n\n\n```python\nfrom inparse import Inparse\np=Inparse('b45beddc', #parser no is generator by inparse.com parser builder.\n 'd50cb533f69b6a78892afbd093f95fc1') #access token can be found in your user page .\n\n\nd=p.parse_url('https://qz.com/india/1413291/trulymadly-ceo-on-how-dating-apps-like-bumble-india-must-localise/')\nInparse.pretty_print(d)\n\n```\n\n\n**Or parse in raw html**\n\n```python\n\n from inparse import Inparse\n import requests\n p=Inparse('b45beddc', #parser no is generator by inparse.com parser builder.\n 'd50cb533f69b6a78892afbd093f95fc1') #access token can be found in your user page .\n\n html=requests.get('https://qz.com/india/1413291/trulymadly-ceo-on-how-dating-apps-like-bumble-india-must-localise/').text\n d=p.parse(html)\n Inparse.pretty_print(d)\n\n```\n\nBelow is output of Article data extraction \n\n```\n\n { 'article_body': '
',\n 'author': 'Kuwar Singh',\n 'publish_date': None,\n 'title': 'Young Indians are using dating apps for so much more than just '\n 'dating',\n 'top_image': [ 'https://cms.qz.com/wp-content/uploads/2018/10/AP_900509923043-e1538971405267.jpg?quality=75&strip=all&w=410&h=231']\n }\n\n```\n\n\n\nMore about Inparse\n===============\n\n\n\nContributing\n------------\nYou are welcome to port this SDK to Java, Go ,or any other programming languages.\n\nDonate\n------\n\n\nLinks\n-----\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/inparse/inparse", "keywords": "", "license": "BSD", "maintainer": "", "maintainer_email": "", "name": "inparse", "package_url": "https://pypi.org/project/inparse/", "platform": "", "project_url": "https://pypi.org/project/inparse/", "project_urls": { "Homepage": "https://github.com/inparse/inparse" }, "release_url": "https://pypi.org/project/inparse/0.1.1/", "requires_dist": [ "requests", "beautifulsoup4", "lxml" ], "requires_python": ">=3.6.5", "summary": "Collaborative AI for Web Scraping, Data Extraction and Crawling,Knowledge Graph", "version": "0.1.1" }, "last_serial": 4525767, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "ee8980840e42639fd82bd90d250f55cb", "sha256": "cf7185961330e4a14db7ab40a6195d933dfa2e0901dfe8d9c462b4fb9b9a776b" }, "downloads": -1, "filename": "inparse-0.1.0-py3-none-any.whl", "has_sig": false, "md5_digest": "ee8980840e42639fd82bd90d250f55cb", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6.5", "size": 3769, "upload_time": "2018-10-12T14:16:08", "url": "https://files.pythonhosted.org/packages/18/d9/d895ae1bd619fbdab3d76cb7bf1927803676f60c127e446bfc283b938cf2/inparse-0.1.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "8ac1dca74bbf93a691d41f814caadc63", "sha256": "79b4daf7de3a7f2a39b5b17f53e733f214bd8ab72174fabef54c2fcd24d531de" }, "downloads": -1, "filename": "inparse-0.1.0.tar.gz", "has_sig": false, "md5_digest": "8ac1dca74bbf93a691d41f814caadc63", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6.5", "size": 3594, "upload_time": "2018-10-12T14:16:09", "url": "https://files.pythonhosted.org/packages/cb/e8/eb1d828ee4fcb9fbfce701799092de9da4232b992f7e6bd8f4f233ea9d5c/inparse-0.1.0.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "7ecccd305685a21ce183b08471d2e8df", "sha256": "04f7b617b646a1841ef48e3347dac1a2a36002acb3507cb40d90c2df044e2aef" }, "downloads": -1, "filename": "inparse-0.1.1-py3-none-any.whl", "has_sig": false, "md5_digest": "7ecccd305685a21ce183b08471d2e8df", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6.5", "size": 4727, "upload_time": "2018-11-25T10:47:06", "url": "https://files.pythonhosted.org/packages/8d/07/0e83ab7dfaa6517363291a8a3cd16f0ecb15e87ca754c3d09179c0baacd1/inparse-0.1.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d4c4b77fe70530e78cdb05b73a125164", "sha256": "0683ff08db7391972f13252de74dbd3001cdeeaf1e2e6b0d68bf1e017e24af07" }, "downloads": -1, "filename": "inparse-0.1.1.tar.gz", "has_sig": false, "md5_digest": "d4c4b77fe70530e78cdb05b73a125164", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6.5", "size": 4650, "upload_time": "2018-11-25T10:47:07", "url": "https://files.pythonhosted.org/packages/0e/7e/75133fa266a3bfa659f9e16bb17d02cf02bd9bd738032de74fedf50fa11d/inparse-0.1.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "7ecccd305685a21ce183b08471d2e8df", "sha256": "04f7b617b646a1841ef48e3347dac1a2a36002acb3507cb40d90c2df044e2aef" }, "downloads": -1, "filename": "inparse-0.1.1-py3-none-any.whl", "has_sig": false, "md5_digest": "7ecccd305685a21ce183b08471d2e8df", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6.5", "size": 4727, "upload_time": "2018-11-25T10:47:06", "url": "https://files.pythonhosted.org/packages/8d/07/0e83ab7dfaa6517363291a8a3cd16f0ecb15e87ca754c3d09179c0baacd1/inparse-0.1.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d4c4b77fe70530e78cdb05b73a125164", "sha256": "0683ff08db7391972f13252de74dbd3001cdeeaf1e2e6b0d68bf1e017e24af07" }, "downloads": -1, "filename": "inparse-0.1.1.tar.gz", "has_sig": false, "md5_digest": "d4c4b77fe70530e78cdb05b73a125164", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6.5", "size": 4650, "upload_time": "2018-11-25T10:47:07", "url": "https://files.pythonhosted.org/packages/0e/7e/75133fa266a3bfa659f9e16bb17d02cf02bd9bd738032de74fedf50fa11d/inparse-0.1.1.tar.gz" } ] }