{ "info": { "author": "", "author_email": "", "bugtrack_url": null, "classifiers": [ "Development Status :: 2 - Pre-Alpha", "Intended Audience :: Developers", "Natural Language :: English", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5" ], "description": "========================================\nYet Another Keyword Extractor (Yake)\n========================================\n\nATTENTION\n-------------\n\nTHIS VERSION IS DEPRECATED AND NO LONGER MAINTAINED. \n\nNEW REPOSITORY\n-------------\n\nThe code has been moved to https://github.com/LIAAD/yake.\nPlease use the code available at our repository at github.\n\nINSTALL \n-------------\n\n\tpip install git+https://github.com/LIAAD/yake\n\n\nDEPRECATED VERSION\n------------------\n\nUnsupervised Approach for Automatic Keyword Extraction using Text Features\n\n* Documentation: https://pypi.python.org/pypi/yake.\n\nMain Features\n-------------\n\n* Unsupervised approach\n* Multi-Language Support \n* Single document\n\nRationale\n-------------\n\nExtracting keywords from texts has become a challenge for individuals and organizations as the information grows in complexity and size. The need to automate this task so that texts can be processed in a timely and adequate manner has led to the emergence of automatic keyword extraction tools. Despite the advances, there is a clear lack of multilingual online tools to automatically extract keywords from single documents. Yake! is a novel feature-based system for multi-lingual keyword extraction, which supports texts of different sizes, domain or languages. Unlike other approaches, Yake! does not rely on dictionaries nor thesauri, neither is trained against any corpora. Instead, it follows an unsupervised approach which builds upon features extracted from the text, making it thus applicable to documents written in different languages without the need for further knowledge. This can be beneficial for a large number of tasks and a plethora of situations where the access to training corpora is either limited or restricted.\n\n\nPlease cite the following works when using YAKE\n------------\n\nCampos, R., Mangaravite, V., Pasquali, A., Jorge, A., Nunes, C., & Jatowt, A. (2018).\nA Text Feature Based Automatic Keyword Extraction Method for Single Documents\nProceedings of the 40th European Conference on Information Retrieval (ECIR'18), Grenoble, France. March 26 \u2013 29.\n\nCampos, R., Mangaravite, V., Pasquali, A., Jorge, A., Nunes, C., & Jatowt, A. (2018).\nYAKE! Collection-independent Automatic Keyword Extractor\nProceedings of the 40th European Conference on Information Retrieval (ECIR'18), Grenoble, France. March 26 \u2013 29\n\n\nRequirements\n-------------\nPython3\n\n\nInstallation\n-------------\n\nTo install Yake on your terminal ::\n\n\tpip install yake\n\nTo upgrade using pip::\n\n\tpip install yake \u2013upgrade\n\nUsage\n---------\n\nHow to use it on your favorite command line::\n\n\tyake --input_file [text file] --language en --ngram_size 3\n\n\nHow to use it on Python::\n\n\timport yake\n\n\ttext_content = \"\"\"\n\t\tSources tell us that Google is acquiring Kaggle, a platform that hosts data science and machine learning\n\t\tcompetitions. Details about the transaction remain somewhat vague , but given that Google is hosting\n\t\tits Cloud Next conference in San Francisco this week, the official announcement could come as early\n\t\tas tomorrow. Reached by phone, Kaggle co-founder CEO Anthony Goldbloom declined to deny that the\n\t\tacquisition is happening. Google itself declined 'to comment on rumors'.\n\t\"\"\"\n\n\t# assuming default parameters\n\tsimple_kwextractor = yake.KeywordExtractor()\n\tkeywords = simple_kwextractor.extract_keywords(text_content)\n\n\tfor kw in keywords:\n\t\tprint(kw)\n\n\t# specifying parameters\n\tcustom_kwextractor = yake.KeywordExtractor(lan=\"en\", n=3, dedupLim=0.8, windowsSize=2, top=20)\n\tkeywords = custom_kwextractor.extract_keywords(text_content)\n\n\tfor kw in keywords:\n\t\tprint(kw)\n\n\nUpload new version to pip \n-----\n\nRun::\n\n\t> make dist\n\t> python setup.py sdist upload -r https://upload.pypi.org/legacy/\n\nSpecify credentials at ~/.pypirc::\n\n\t[distutils]\n\tindex-servers =\n\t pypi\n\n\t[pypi]\n\trepository=https://upload.pypi.org/legacy/\n\tusername=\n\tpassword=", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://pypi.python.org/pypi/yake", "keywords": "yake", "license": "", "maintainer": "", "maintainer_email": "", "name": "yake", "package_url": "https://pypi.org/project/yake/", "platform": "", "project_url": "https://pypi.org/project/yake/", "project_urls": { "Homepage": "https://pypi.python.org/pypi/yake" }, "release_url": "https://pypi.org/project/yake/0.3.7/", "requires_dist": null, "requires_python": "", "summary": "Keyword extraction Python package", "version": "0.3.7" }, "last_serial": 4494946, "releases": { "0.2.0": [ { "comment_text": "", "digests": { "md5": "1e9c8cbfd9e4f8179360d318759cfd59", "sha256": "7f1d4bde24808b4b7d9346827b6cfe224f1852c76e877f33722a6471321305d3" }, "downloads": -1, "filename": "yake-0.2.0.tar.gz", "has_sig": false, "md5_digest": "1e9c8cbfd9e4f8179360d318759cfd59", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 61493, "upload_time": "2017-10-25T16:55:47", "url": "https://files.pythonhosted.org/packages/e8/68/c94d377fa0f2ea030f4efe3e26b466daa314e64f8ff30282573b3ba43212/yake-0.2.0.tar.gz" } ], "0.2.1": [ { "comment_text": "", "digests": { "md5": "85e18a2bb17562f44ebe509e586dbbfd", "sha256": "dc40fa86e5eed9d77624b923b07308f0b2ab3bee3d11466dbb96508c431e584c" }, "downloads": -1, "filename": "yake-0.2.1.tar.gz", "has_sig": false, "md5_digest": "85e18a2bb17562f44ebe509e586dbbfd", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 61116, "upload_time": "2017-10-29T21:10:48", "url": "https://files.pythonhosted.org/packages/b8/80/034ae6764a19d5bc5369df62bb310e8cfaf3fe81a0e2c30b509a8657409f/yake-0.2.1.tar.gz" } ], "0.2.2": [ { "comment_text": "", "digests": { "md5": "3339c62be53ad7f232dd4b11d51d479d", "sha256": "a326a301ae0cf958ac0af897b5e6a0ace87b9b218adf13a8bd030b8309c5ed21" }, "downloads": -1, "filename": "yake-0.2.2.tar.gz", "has_sig": false, "md5_digest": "3339c62be53ad7f232dd4b11d51d479d", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 61102, "upload_time": "2017-10-30T17:56:18", "url": "https://files.pythonhosted.org/packages/0c/54/5aa746a8d8743813788e28e2f4169a6d50852ecd8a96e28970da89d073f3/yake-0.2.2.tar.gz" } ], "0.2.4": [ { "comment_text": "", "digests": { "md5": "30b377e74610540a6a70c4cebc9cf622", "sha256": "ed13035e7791a4b7ff6f896619f70277826ad8ba6298964ddabb7273e23f3911" }, "downloads": -1, "filename": "yake-0.2.4.tar.gz", "has_sig": false, "md5_digest": "30b377e74610540a6a70c4cebc9cf622", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 52121, "upload_time": "2017-11-27T16:52:42", "url": "https://files.pythonhosted.org/packages/12/3c/7b42fc806789999f419091285c7ddafbdb40a4b0fbfb781107d47e549664/yake-0.2.4.tar.gz" } ], "0.3.0": [ { "comment_text": "", "digests": { "md5": "367d461ca2ecbde6e728ae93687f138b", "sha256": "6f282647c5cc2042dea039cbc47dcfb23514074098e0f986f89a7e66a1d2fec3" }, "downloads": -1, "filename": "yake-0.3.0.tar.gz", "has_sig": false, "md5_digest": "367d461ca2ecbde6e728ae93687f138b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 52995, "upload_time": "2018-01-13T13:20:58", "url": "https://files.pythonhosted.org/packages/5f/bb/86a6c535747f3954ec27ad55b83de25541da837afc19446e994452a0b024/yake-0.3.0.tar.gz" } ], "0.3.2": [ { "comment_text": "", "digests": { "md5": "42fa98e14532ff20135a92d0b6638b61", "sha256": "c84911d89f8dedd1533066cb05096b0ac96d4b36c2f71ba5afec4fdb036c53b3" }, "downloads": -1, "filename": "yake-0.3.2.tar.gz", "has_sig": false, "md5_digest": "42fa98e14532ff20135a92d0b6638b61", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 54522, "upload_time": "2018-03-28T14:23:30", "url": "https://files.pythonhosted.org/packages/96/d9/bb268e57ffe66540a747b346b4302a4ff2874fab3535cf1d49e569db61a1/yake-0.3.2.tar.gz" } ], "0.3.7": [ { "comment_text": "", "digests": { "md5": "b706a18cf8934273e5b18f4466bab4be", "sha256": "b680b36db554ec6585339d025f79756c0bd2b8896f07b1de96fab1e12470aca9" }, "downloads": -1, "filename": "yake-0.3.7.tar.gz", "has_sig": false, "md5_digest": "b706a18cf8934273e5b18f4466bab4be", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 53652, "upload_time": "2018-11-16T20:18:57", "url": "https://files.pythonhosted.org/packages/b6/3c/ed20047202793a08786978be68ec44c7ffbcb814ed05b985b38037f31a56/yake-0.3.7.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "b706a18cf8934273e5b18f4466bab4be", "sha256": "b680b36db554ec6585339d025f79756c0bd2b8896f07b1de96fab1e12470aca9" }, "downloads": -1, "filename": "yake-0.3.7.tar.gz", "has_sig": false, "md5_digest": "b706a18cf8934273e5b18f4466bab4be", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 53652, "upload_time": "2018-11-16T20:18:57", "url": "https://files.pythonhosted.org/packages/b6/3c/ed20047202793a08786978be68ec44c7ffbcb814ed05b985b38037f31a56/yake-0.3.7.tar.gz" } ] }