{ "info": { "author": "HyperFX Tech Team", "author_email": "coffee@hyperfx.tech", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# newsfx\n> D\u1ef1 \u00e1n \u0111ang trong qu\u00e1 tr\u00ecnh th\u1ef1c hi\u1ec7n\n\n## B\u1eaft \u0111\u1ea7u nhanh\n### C\u00e0i \u0111\u1eb7t\n> Ch\u1ec9 support Python 3.6+\n```\npip install newsfx\n```\n\n### Th\u1ef1c hi\u1ec7n\n```python\nfrom newsfx import NewsFX\nrun = NewsFX('https://vnexpress.net/thoi-su/nguoi-dan-un-un-tro-lai-sai-gon-ha-noi-sau-ky-nghi-le-3917122.html')\nrun.parser()\nprint(run.get_title) # Ng\u01b0\u1eddi d\u00e2n \u00f9n \u00f9n tr\u1edf l\u1ea1i S\u00e0i G\u00f2n, H\u00e0 N\u1ed9i sau k\u1ef3 ngh\u1ec9 l\u1ec5\n```\n\n### l\u1ea5y h\u00ecnh \n\n```python \n#l\u1ea5y link c\u1ee7a h\u00ecnh \nprint(run.get_top_image_link) #https://link_dan_toi_file.jpg\n\n# save h\u00ecnh \nrun.save_top_image_link(name='ten_file_anh.jpg')\n```\n\n## Trang tin h\u1ed7 tr\u1ee3\n\n| news site | title | published_date | summary | content | author | top_image |\n|--------------------|:-----:|----------------|---------|---------|--------|-----------|\n| VnExpress | \u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f | \u2714\ufe0f |\n| Tu\u00f4\u0309i Tre\u0309 Online | \u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f | \ufe0f\ufe0f\ufe0f\ufe0f\ufe0f\ufe0f\u2714\ufe0f |\n| Thanh Ni\u00ean | \u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f |\u2714\ufe0f | \u2714\ufe0f |\n| Ti\u1ec1n Phong | | | | | | |\n| Lao \u0110\u1ed9ng | | | | | | |\n| B\u00e1o m\u1edbi | | | | | | |\n| Ng\u01b0\u1eddi Lao \u0110\u1ed9ng | | | | | | |\n| Nh\u00e2n D\u00e2n | | | | | | |\n| \u0110\u1eddi S\u1ed1ng Ph\u00e1p Lu\u1eadt | | | | | | |\n| Vietnamnet | | | | | | |\n| Zing News | | | | | | |\n| D\u00e2n Tr\u00ed | | | | | | |\n| Nh\u1ecbp S\u1ed1ng S\u1ed1 | | | | | | |\n| Tri Th\u1ee9c Tr\u1ebb | | | | | | |\n| Vietnam Plus | | | | | | |\n\n\n## TODO\n- [ ] T\u1ef1 \u0111\u1ed9ng nh\u1eadn d\u1ea1ng url \u0111\u1ea7u v\u00e0o\n- [ ] \u0110\u1ecbnh d\u1ea1ng k\u1ebft qu\u1ea3 tr\u1ea3 v\u1ec1 trong dictionary\n\n| T\u00ean \t| Ki\u1ec3u tr\u1ea3 v\u1ec1 \t| M\u00f4 t\u1ea3 \t| H\u1ed7 tr\u1ee3 \t|\n|------------\t|-------------\t|---------------------------------------\t|:------:\t|\n| title \t| string \t| Ti\u00eau \u0111\u1ec1 b\u00e0i vi\u1ebft \t| \u2714\ufe0f \t|\n| html \t| string \t| Code html b\u00e0i vi\u1ebft \t| \u2714\ufe0f \t|\n| text \t| string \t| N\u1ed9i dung b\u00e0i vi\u1ebft ch\u01b0a \u0111\u01b0\u1ee3c x\u1eed l\u00fd \t| \u2714\ufe0f \t|\n| clean_text \t| string \t| N\u1ed9i dung b\u00e0i vi\u1ebft \u0111\u00e3 \u0111\u01b0\u1ee3c x\u1eed l\u00fd \t| \t|\n| author \t| list \t| T\u00e1c gi\u1ea3 b\u00e0i vi\u1ebft \t| \u2714\ufe0f \t|\n| published \t| date \t| Ng\u00e0y \u0111\u0103ng b\u00e0i vi\u1ebft \t| \u2714\ufe0f \t|\n| top_image \t| string \t| H\u00ecnh \u1ea3nh \u0111\u1eb7c tr\u01b0ng c\u1ee7a b\u00e0i vi\u1ebft \t| \u2714\ufe0f \t|\n| images \t| list \t| Danh s\u00e1ch h\u00ecnh \u1ea3nh c\u00f3 trong b\u00e0i vi\u1ebft \t| \u2714\ufe0f \t|\n| keywords \t| list \t| T\u1eeb kh\u00f3a b\u00e0i vi\u1ebft (c\u00f3 s\u1eb5n t\u1eeb b\u00e0i vi\u1ebft) \t| \t|\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/hyperfxtech/newsfx", "keywords": "", "license": "", "maintainer": "", "maintainer_email": "", "name": "newsfx", "package_url": "https://pypi.org/project/newsfx/", "platform": "", "project_url": "https://pypi.org/project/newsfx/", "project_urls": { "Homepage": "https://github.com/hyperfxtech/newsfx" }, "release_url": "https://pypi.org/project/newsfx/0.0.9/", "requires_dist": null, "requires_python": "", "summary": "Scraper news article in Viet Nam", "version": "0.0.9" }, "last_serial": 5346120, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "bb04294712eed085fa4aef95119f94c1", "sha256": "6dd0376030464bb51f1c9303fbf7eed3f734a360e4f84e963492053c9570da6d" }, "downloads": -1, "filename": "newsfx-0.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "bb04294712eed085fa4aef95119f94c1", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2021, "upload_time": "2019-05-07T13:51:36", "url": "https://files.pythonhosted.org/packages/0f/b2/7799d32641b39987e319b4cb6263d7283c091e34f8b05325badf485be7a2/newsfx-0.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "9417e89cfb35447fc155895c92e3206a", "sha256": "e40e166e66583904bbdc3417f9b069bcaac8c210cc4d2dab31e3417c186eca7f" }, "downloads": -1, "filename": "newsfx-0.0.1.tar.gz", "has_sig": false, "md5_digest": "9417e89cfb35447fc155895c92e3206a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2068, "upload_time": "2019-05-07T13:51:38", "url": "https://files.pythonhosted.org/packages/3b/e4/57f7c5736d71be787a3d00a011a66343bb8a4e3bfbc8c84be17fe54e0f01/newsfx-0.0.1.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "ecd6954059e87ba09e9a51b11fb4a92b", "sha256": "a54ca6e67924c452735685ebf1a8cd6f7d85baf9a07996fef947eeddbc09b05e" }, "downloads": -1, "filename": "newsfx-0.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "ecd6954059e87ba09e9a51b11fb4a92b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2720, "upload_time": "2019-05-07T14:11:21", "url": "https://files.pythonhosted.org/packages/fd/9f/a5fcbf44e1b1867f6ff14983afa56a5ba45400c5738d5d8d5a9f6c842a08/newsfx-0.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "07fdc805cc3c62ea986379b4724cf632", "sha256": "c7a09e171f7ca15122e60108543a43f011c3622b87ec4da356d5ba778bf53843" }, "downloads": -1, "filename": "newsfx-0.0.2.tar.gz", "has_sig": false, "md5_digest": "07fdc805cc3c62ea986379b4724cf632", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2718, "upload_time": "2019-05-07T14:11:22", "url": "https://files.pythonhosted.org/packages/b9/8f/ed0d43dbed9b3ded538367330c1f92a5590c4b450cbdfa9f5779d7af774e/newsfx-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "c753ae78ee0f63807621289c5b43c434", "sha256": "18ecf0236bceefa7a4d8b53ad6b6860d97323e262b8d4ba10a1807503a6588bd" }, "downloads": -1, "filename": "newsfx-0.0.3.tar.gz", "has_sig": false, "md5_digest": "c753ae78ee0f63807621289c5b43c434", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2163, "upload_time": "2019-05-17T06:02:54", "url": "https://files.pythonhosted.org/packages/18/61/e70e0f24d6459a71cc581f12c550817b00601936c6b7dcb0d5763bee37a6/newsfx-0.0.3.tar.gz" } ], "0.0.5": [ { "comment_text": "", "digests": { "md5": "839ea0b1b095a26f96ee625820734d8e", "sha256": "a2c33ab7fe81f7f3be91a2c3a664c44a9bbddfa493d7838d0110347e65b4e698" }, "downloads": -1, "filename": "newsfx-0.0.5.tar.gz", "has_sig": false, "md5_digest": "839ea0b1b095a26f96ee625820734d8e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2271, "upload_time": "2019-05-19T06:24:47", "url": "https://files.pythonhosted.org/packages/44/39/11d5c3552622eb8dd0a553ca50b2c6e6463292f185eac916c0b27a1a1527/newsfx-0.0.5.tar.gz" } ], "0.0.7": [ { "comment_text": "", "digests": { "md5": "3de1fe569088e14bfebb6da89201a113", "sha256": "4088e4c7285a4af03573258f33c345420e7a0713179a950179c3b9abd2e38d07" }, "downloads": -1, "filename": "newsfx-0.0.7.tar.gz", "has_sig": false, "md5_digest": "3de1fe569088e14bfebb6da89201a113", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2283, "upload_time": "2019-05-19T06:27:36", "url": "https://files.pythonhosted.org/packages/ae/26/430671adc1bcd405c6f90ec565fd5e870afe0a3123ee7fcd67f4c850b350/newsfx-0.0.7.tar.gz" } ], "0.0.9": [ { "comment_text": "", "digests": { "md5": "af8d9cc3783a0f527567475ade778481", "sha256": "7dbfa0334d11c68853f114c31c284d6e96a568fb72cbd1e06e78d32d023ee543" }, "downloads": -1, "filename": "newsfx-0.0.9-py3-none-any.whl", "has_sig": false, "md5_digest": "af8d9cc3783a0f527567475ade778481", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2216, "upload_time": "2019-06-01T10:30:38", "url": "https://files.pythonhosted.org/packages/12/c4/21646015f048e2e863c3cc91ee569c0c2e3c7325a24e12b7862c27b6e615/newsfx-0.0.9-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "4c6b3dae39ae41a180e4e5c453130eec", "sha256": "d89170cb563b07df6e36a1813a8295bf26fc7437fb76f32eda03867082adadba" }, "downloads": -1, "filename": "newsfx-0.0.9.tar.gz", "has_sig": false, "md5_digest": "4c6b3dae39ae41a180e4e5c453130eec", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2288, "upload_time": "2019-06-01T10:30:42", "url": "https://files.pythonhosted.org/packages/a7/44/9d846f8acd79ead587970ce63889f3342c74dcd483c3e7789cc7fdad3cdd/newsfx-0.0.9.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "af8d9cc3783a0f527567475ade778481", "sha256": "7dbfa0334d11c68853f114c31c284d6e96a568fb72cbd1e06e78d32d023ee543" }, "downloads": -1, "filename": "newsfx-0.0.9-py3-none-any.whl", "has_sig": false, "md5_digest": "af8d9cc3783a0f527567475ade778481", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2216, "upload_time": "2019-06-01T10:30:38", "url": "https://files.pythonhosted.org/packages/12/c4/21646015f048e2e863c3cc91ee569c0c2e3c7325a24e12b7862c27b6e615/newsfx-0.0.9-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "4c6b3dae39ae41a180e4e5c453130eec", "sha256": "d89170cb563b07df6e36a1813a8295bf26fc7437fb76f32eda03867082adadba" }, "downloads": -1, "filename": "newsfx-0.0.9.tar.gz", "has_sig": false, "md5_digest": "4c6b3dae39ae41a180e4e5c453130eec", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2288, "upload_time": "2019-06-01T10:30:42", "url": "https://files.pythonhosted.org/packages/a7/44/9d846f8acd79ead587970ce63889f3342c74dcd483c3e7789cc7fdad3cdd/newsfx-0.0.9.tar.gz" } ] }