{ "info": { "author": "Arran Hobson Sayers", "author_email": "ahobsonsayers@gmail.com", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: GNU Affero General Public License v3", "Operating System :: OS Independent", "Programming Language :: Python :: 3 :: Only", "Topic :: Text Processing :: Markup :: HTML" ], "description": "# html-table-parser-python3\n\nThis module consists of just one small class. Its purpose is to parse HTML\ntables without help of external modules. Everything used is part of python 3.\n\n## Installation\n\n pip install html-table-parser-python3\n\n## How to use\n\nExample Usage:\n\n import urllib.request\n from pprint import pprint\n from html_table_parser import HTMLTableParser\n\n\n def url_get_contents(url):\n \"\"\" Opens a website and read its binary contents (HTTP Response Body) \"\"\"\n req = urllib.request.Request(url=url)\n f = urllib.request.urlopen(req)\n return f.read()\n\n\n def main():\n url = 'http://www.twitter.com'\n xhtml = url_get_contents(url).decode('utf-8')\n\n p = HTMLTableParser()\n p.feed(xhtml)\n pprint(p.tables)\n\n\n if __name__ == '__main__':\n main()\n\nThe parser returns a nested lists of tables containing rows containing cells\nas strings. Tags in cells are stripped and the tags text content is joined.\nThe console output for parsing all tables on the twitter home page looks\nlike this:\n\n```\n>>> \n[[['', 'Anmelden']],\n [['Land', 'Code', 'F\u00c3\u00bcr Kunden von'],\n ['Vereinigte Staaten', '40404', '(beliebig)'],\n ['Kanada', '21212', '(beliebig)'],\n ...\n ['3424486444', 'Vodafone'],\n ['Zeige SMS-Kurzwahlen f\u00c3\u00bcr andere L\u00c3\u00a4nder']]]\n```\n\n## Credit\n\nAll Credit goes to Josua Schmid (schmijos). This is all his work, I just uploaded it to PyPi. Original repository can be found at:\n\nhttps://github.com/schmijos/html-table-parser-python3\n\n\n## License\n\nGNU GPL v3\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/ahobsonsayers/html-table-parser-python3", "keywords": "", "license": "AGPLv3", "maintainer": "", "maintainer_email": "", "name": "html-table-parser-python3", "package_url": "https://pypi.org/project/html-table-parser-python3/", "platform": "", "project_url": "https://pypi.org/project/html-table-parser-python3/", "project_urls": { "Homepage": "https://github.com/ahobsonsayers/html-table-parser-python3" }, "release_url": "https://pypi.org/project/html-table-parser-python3/0.1.5/", "requires_dist": null, "requires_python": "", "summary": "A small and simple HTML table parser not requiring any external dependency.", "version": "0.1.5" }, "last_serial": 5404022, "releases": { "0.1.1": [ { "comment_text": "", "digests": { "md5": "44bbfdc0edfa4ab480cfa45eaaf74822", "sha256": "92362ededa646c7d21ec868f01b3790c5a67da461d34db227ee000740aadff55" }, "downloads": -1, "filename": "html_table_parser_python3-0.1.1-py3-none-any.whl", "has_sig": false, "md5_digest": "44bbfdc0edfa4ab480cfa45eaaf74822", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2793, "upload_time": "2018-07-30T19:25:45", "url": "https://files.pythonhosted.org/packages/98/54/5a91833c1cf07896f9caa824c1c8eb7d7522cc17e284b14a42b0ce1a0483/html_table_parser_python3-0.1.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "30abb6a3ad651fe3834724660faaae98", "sha256": "d46b80345e01387e9ff10301ad89bb57f5dd15ae45ad3718383fc26dbf01f403" }, "downloads": -1, "filename": "html_table_parser_python3-0.1.1.tar.gz", "has_sig": false, "md5_digest": "30abb6a3ad651fe3834724660faaae98", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2734, "upload_time": "2018-07-30T19:25:46", "url": "https://files.pythonhosted.org/packages/42/97/ea870950f09861941ca2e96eb3fd90b361f66d31a44fbd04cc32b554a8a8/html_table_parser_python3-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "512acb9894ad1f2af372264146b4e76e", "sha256": "dbe51e9117c5cd05ee5a2127eddcef06539ba1698b02f61cd5e8b4e6e6afbc9a" }, "downloads": -1, "filename": "html_table_parser_python3-0.1.2-py3-none-any.whl", "has_sig": false, "md5_digest": "512acb9894ad1f2af372264146b4e76e", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 3597, "upload_time": "2018-07-30T19:53:53", "url": "https://files.pythonhosted.org/packages/67/d7/72d0d77cd4b6305aa1240a116ace804ed373d4e796c03e0f09ad3e6101fe/html_table_parser_python3-0.1.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "3ac7160ef9fb4a0c43604a7931c69d78", "sha256": "d84cef7af1b00ff75a6a5ece83a85b0f297b9ac4d8322cd4eff94596fd798a9d" }, "downloads": -1, "filename": "html-table-parser-python3-0.1.2.tar.gz", "has_sig": false, "md5_digest": "3ac7160ef9fb4a0c43604a7931c69d78", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3068, "upload_time": "2018-07-30T19:53:54", "url": "https://files.pythonhosted.org/packages/3b/43/ef384b1315d4e792b8c4f1fe2ba4e0d265cf3bc901e48fff8ffc940d7326/html-table-parser-python3-0.1.2.tar.gz" } ], "0.1.3": [ { "comment_text": "", "digests": { "md5": "f87966ff30fd5d0ce2d315917c3e0ed5", "sha256": "6528bc49abf399320f92a230cbd351fe663f97b74e52095ef7f905feb49bb684" }, "downloads": -1, "filename": "html_table_parser_python3-0.1.3-py3-none-any.whl", "has_sig": false, "md5_digest": "f87966ff30fd5d0ce2d315917c3e0ed5", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 3590, "upload_time": "2018-07-30T22:55:24", "url": "https://files.pythonhosted.org/packages/e3/67/966e5b64c87b275c94dc9e1bb34ca41db926cca534a12d33aa37aa83b3e8/html_table_parser_python3-0.1.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d0e746e53e5ce1e609905ea3c0351bb5", "sha256": "621b7beec3d168dba87e9da064096fde2a7d62f92b7de3449f0c0727a88ee417" }, "downloads": -1, "filename": "html-table-parser-python3-0.1.3.tar.gz", "has_sig": false, "md5_digest": "d0e746e53e5ce1e609905ea3c0351bb5", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3072, "upload_time": "2018-07-30T22:55:25", "url": "https://files.pythonhosted.org/packages/1f/8f/e5990b18b02f2e0febc87dc2bdba49c6b93ce8ae520598f9db69f3ff0ab2/html-table-parser-python3-0.1.3.tar.gz" } ], "0.1.5": [ { "comment_text": "", "digests": { "md5": "f5e9f6049cf361f1c3dc86939b310bcb", "sha256": "3ed1ea7844dd2470dcc660e28943e67c494f5460755bd60cbc4e65370634f854" }, "downloads": -1, "filename": "html_table_parser_python3-0.1.5-py3-none-any.whl", "has_sig": false, "md5_digest": "f5e9f6049cf361f1c3dc86939b310bcb", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 3549, "upload_time": "2018-07-31T18:56:54", "url": "https://files.pythonhosted.org/packages/91/c2/2cf839cec7227e5f1f03fed871c449ad38a47fe73975c3ca57edb9b89207/html_table_parser_python3-0.1.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "7fd51f856b57fdfe6b89250c026673fa", "sha256": "5a3629271e199e1022c5b6e27e5c07c13b6b2e45826479098deaac5b554c58bd" }, "downloads": -1, "filename": "html-table-parser-python3-0.1.5.tar.gz", "has_sig": false, "md5_digest": "7fd51f856b57fdfe6b89250c026673fa", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3079, "upload_time": "2018-07-31T18:56:55", "url": "https://files.pythonhosted.org/packages/d0/05/5b86d1f1c70a590e2fecd8af87c9dfd12def1bf406f243d3d3618356bfc2/html-table-parser-python3-0.1.5.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "f5e9f6049cf361f1c3dc86939b310bcb", "sha256": "3ed1ea7844dd2470dcc660e28943e67c494f5460755bd60cbc4e65370634f854" }, "downloads": -1, "filename": "html_table_parser_python3-0.1.5-py3-none-any.whl", "has_sig": false, "md5_digest": "f5e9f6049cf361f1c3dc86939b310bcb", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 3549, "upload_time": "2018-07-31T18:56:54", "url": "https://files.pythonhosted.org/packages/91/c2/2cf839cec7227e5f1f03fed871c449ad38a47fe73975c3ca57edb9b89207/html_table_parser_python3-0.1.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "7fd51f856b57fdfe6b89250c026673fa", "sha256": "5a3629271e199e1022c5b6e27e5c07c13b6b2e45826479098deaac5b554c58bd" }, "downloads": -1, "filename": "html-table-parser-python3-0.1.5.tar.gz", "has_sig": false, "md5_digest": "7fd51f856b57fdfe6b89250c026673fa", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3079, "upload_time": "2018-07-31T18:56:55", "url": "https://files.pythonhosted.org/packages/d0/05/5b86d1f1c70a590e2fecd8af87c9dfd12def1bf406f243d3d3618356bfc2/html-table-parser-python3-0.1.5.tar.gz" } ] }