{ "info": { "author": "Frank Sadan Naranjo Noda, Hian Ca\u00f1izares D\u00edas", "author_email": "fsadannn@gmail.com, hiancdtrsnm@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 5 - Production/Stable", "License :: OSI Approved :: MIT License", "Programming Language :: Python :: 3.6", "Topic :: Software Development :: Libraries :: Python Modules" ], "description": "\"PyPI \"PyPI \"PyPI\" \"Travis \"Codecov\"\n# CubaCrawler\n\nEsta biblioteca apunta a obtener informaci\u00f3n de los sitios de\nnoticias cubanas (Ahora mismo solo funcionan [Cubadebate](http://www.cubadebate.cu/) y\n[Granma](http://www.granma.cu/)).\n\n## Como se usa\n\n```python\nfrom CubaCrawler import Crawler\n\ndata = Crawler()\n\ndata.request(\"http://www.cubadebate.cu/noticias/2018/09/26/fundada-la-primera-empresa-biotecnologica-cubano-estadounidense-innovative-immunotherapy-alliance-sa/#.W6uvSBQpDeM\")\nprint(data.text)\nprint(data.comment)\n```\n\nEn caso de tener que pasar por un proxy se puede usar de la siguiente manera\n\n```python\nfrom CubaCrawler import Crawler\n\nconfig = {\n \"proxy\": \"http://user:password@proxy.host:port\"\n }\ndata = Crawler(config)\n\ndata.request(\"http://www.cubadebate.cu/noticias/2018/09/26/fundada-la-primera-empresa-biotecnologica-cubano-estadounidense-innovative-immunotherapy-alliance-sa/#.W6uvSBQpDeM\")\nprint(data.text)\nprint(data.comment)\n```\n\nO, se pude llamar explicitamente a un Scrapper\n\n```python\nfrom CubaCrawler import Cubadebate\n\nurl = \"http://www.cubadebate.cu/noticias/2018/09/26/fundada-la-primera-empresa-biotecnologica-cubano-estadounidense-innovative-immunotherapy-alliance-sa/#.W6uvSBQpDeM\"\ncubadebate = Cubadebate(url)\n\nprint(cubadebate.text)\nprint(cubadebate.comment)\n```\n\nEsta biblioteca es desarrollada por GIA (Grupo de Inteligencia Artificial), cualquier contribuci\u00f3n o referencia es agradecida.\n\nthanks,\n\nFrank Sadan Naranjo Noda \n\nHian Ca\u00f1izares D\u00edaz \n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "https://github.com/fsadannn/CubaCrawler/tarball/1.0.0", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/fsadannn/CubaCrawler", "keywords": "", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "CubaCrawler", "package_url": "https://pypi.org/project/CubaCrawler/", "platform": "", "project_url": "https://pypi.org/project/CubaCrawler/", "project_urls": { "Download": "https://github.com/fsadannn/CubaCrawler/tarball/1.0.0", "Homepage": "https://github.com/fsadannn/CubaCrawler" }, "release_url": "https://pypi.org/project/CubaCrawler/1.0.0/", "requires_dist": [ "toml", "beautifulsoup4", "bs4", "lxml", "requests" ], "requires_python": "", "summary": "This library aims to obtain information from the sites of Cuban news.", "version": "1.0.0", "yanked": false, "yanked_reason": null }, "last_serial": 6205174, "releases": { "1.0.0": [ { "comment_text": "", "digests": { "md5": "b7330724b909397cd9ceeae7fb434833", "sha256": "ebbfc7d6c263d8609f2a4f53fb63f0ad0ba21fbe724e4e8ffddfb80dd7b9e473" }, "downloads": -1, "filename": "CubaCrawler-1.0.0-py3-none-any.whl", "has_sig": false, "md5_digest": "b7330724b909397cd9ceeae7fb434833", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 8415, "upload_time": "2019-10-25T03:25:13", "upload_time_iso_8601": "2019-10-25T03:25:13.511111Z", "url": "https://files.pythonhosted.org/packages/1b/5e/587bedd6f0826c0a385c408381a9effe9b1f28f6e83220f8cb061814d9e3/CubaCrawler-1.0.0-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "cbb44d2a4136cbd4def583ef2e2e45e2", "sha256": "b5c46ca3c717c259ae20f8281fe935b65fe1ad697d4b285757b2e67f97a79fa4" }, "downloads": -1, "filename": "CubaCrawler-1.0.0.tar.gz", "has_sig": false, "md5_digest": "cbb44d2a4136cbd4def583ef2e2e45e2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5899, "upload_time": "2019-10-25T03:25:16", "upload_time_iso_8601": "2019-10-25T03:25:16.912823Z", "url": "https://files.pythonhosted.org/packages/50/ef/6c3f3359f1f7e164ed921e948447d6d5d2064caad19400a4f18747c1581a/CubaCrawler-1.0.0.tar.gz", "yanked": false, "yanked_reason": null } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "b7330724b909397cd9ceeae7fb434833", "sha256": "ebbfc7d6c263d8609f2a4f53fb63f0ad0ba21fbe724e4e8ffddfb80dd7b9e473" }, "downloads": -1, "filename": "CubaCrawler-1.0.0-py3-none-any.whl", "has_sig": false, "md5_digest": "b7330724b909397cd9ceeae7fb434833", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 8415, "upload_time": "2019-10-25T03:25:13", "upload_time_iso_8601": "2019-10-25T03:25:13.511111Z", "url": "https://files.pythonhosted.org/packages/1b/5e/587bedd6f0826c0a385c408381a9effe9b1f28f6e83220f8cb061814d9e3/CubaCrawler-1.0.0-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "cbb44d2a4136cbd4def583ef2e2e45e2", "sha256": "b5c46ca3c717c259ae20f8281fe935b65fe1ad697d4b285757b2e67f97a79fa4" }, "downloads": -1, "filename": "CubaCrawler-1.0.0.tar.gz", "has_sig": false, "md5_digest": "cbb44d2a4136cbd4def583ef2e2e45e2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5899, "upload_time": "2019-10-25T03:25:16", "upload_time_iso_8601": "2019-10-25T03:25:16.912823Z", "url": "https://files.pythonhosted.org/packages/50/ef/6c3f3359f1f7e164ed921e948447d6d5d2064caad19400a4f18747c1581a/CubaCrawler-1.0.0.tar.gz", "yanked": false, "yanked_reason": null } ], "vulnerabilities": [] }