{ "info": { "author": "Gary Lai", "author_email": "gary.lai@emory.edu", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Intended Audience :: Developers", "Intended Audience :: Science/Research", "License :: OSI Approved :: MIT License", "Operating System :: MacOS", "Operating System :: Microsoft :: Windows", "Operating System :: POSIX", "Operating System :: Unix", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Topic :: Scientific/Engineering", "Topic :: Software Development" ], "description": "fastText\n========\n\n`fastText `__ is a library for efficient learning\nof word representations and sentence classification.\n\nRequirements\n------------\n\n`fastText `__ builds on modern Mac OS and Linux\ndistributions. Since it uses C++11 features, it requires a compiler with\ngood C++11 support. These include :\n\n- (gcc-4.8 or newer) or (clang-3.3 or newer)\n\nYou will need\n\n- `Python `__ version 2.7 or >=3.4\n- `NumPy `__ &\n `SciPy `__\n- `pybind11 `__\n\nBuilding fastText\n-----------------\n\nThe easiest way to get the latest version of `fastText is to use\npip `__.\n\n::\n\n $ pip install fasttext\n\nIf you want to use the latest unstable release you will need to build\nfrom source using setup.py.\n\nNow you can import this library with\n\n::\n\n import fastText\n\nExamples\n--------\n\nIn general it is assumed that the reader already has good knowledge of\nfastText. For this consider the main\n`README `__\nand in particular `the tutorials on our\nwebsite `__.\n\nWe recommend you look at the `examples within the doc\nfolder `__.\n\nAs with any package you can get help on any Python function using the\nhelp function.\n\nFor example\n\n::\n\n +>>> import fastText\n +>>> help(fastText.FastText)\n\n Help on module fastText.FastText in fastText:\n\n NAME\n fastText.FastText\n\n DESCRIPTION\n # Copyright (c) 2017-present, Facebook, Inc.\n # All rights reserved.\n #\n # This source code is licensed under the BSD-style license found in the\n # LICENSE file in the root directory of this source tree. An additional grant\n # of patent rights can be found in the PATENTS file in the same directory.\n\n FUNCTIONS\n load_model(path)\n Load a model given a filepath and return a model object.\n\n tokenize(text)\n Given a string of text, tokenize it and return a list of tokens\n [...]\n\nIMPORTANT: Preprocessing data / enconding conventions\n-----------------------------------------------------\n\nIn general it is important to properly preprocess your data. In\nparticular our example scripts in the `root\nfolder `__ do this.\n\nfastText assumes UTF-8 encoded text. All text must be `unicode for\nPython2 `__\nand `str for\nPython3 `__.\nThe passed text will be `encoded as UTF-8 by\npybind11 `__\nbefore passed to the fastText C++ library. This means it is important to\nuse UTF-8 encoded text when building a model. On Unix-like systems you\ncan convert text using `iconv `__.\n\nfastText will tokenize (split text into pieces) based on the following\nASCII characters (bytes). In particular, it is not aware of UTF-8\nwhitespace. We advice the user to convert UTF-8 whitespace / word\nboundaries into one of the following symbols as appropiate.\n\n- space\n- tab\n- vertical tab\n- carriage return\n- formfeed\n- the null character\n\nThe newline character is used to delimit lines of text. In particular,\nthe EOS token is appended to a line of text if a newline character is\nencountered. The only exception is if the number of tokens exceeds the\nMAX\\_LINE\\_SIZE constant as defined in the `Dictionary\nheader `__.\nThis means if you have text that is not separate by newlines, such as\nthe `fil9 dataset `__, it will be\nbroken into chunks with MAX\\_LINE\\_SIZE of tokens and the EOS token is\nnot appended.\n\nThe length of a token is the number of UTF-8 characters by considering\nthe `leading two bits of a\nbyte `__ to identify\n`subsequent bytes of a multi-byte\nsequence `__.\nKnowing this is especially important when choosing the minimum and\nmaximum length of subwords. Further, the EOS token (as specified in the\n`Dictionary\nheader `__)\nis considered a character and will not be broken into subwords.", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/elitcloud/yafasttext", "keywords": "", "license": "BSD", "maintainer": "", "maintainer_email": "", "name": "yafasttext", "package_url": "https://pypi.org/project/yafasttext/", "platform": "", "project_url": "https://pypi.org/project/yafasttext/", "project_urls": { "Homepage": "https://github.com/elitcloud/yafasttext" }, "release_url": "https://pypi.org/project/yafasttext/0.8.22.2018120701/", "requires_dist": null, "requires_python": "", "summary": "yet another fastText Python bindings", "version": "0.8.22.2018120701" }, "last_serial": 4573218, "releases": { "0.8.22": [ { "comment_text": "", "digests": { "md5": "e0f04c2c6f9c41dc6432254623a6c478", "sha256": "0178bd4e6501cd96336291e267a4eeca37271fc3a7b86b95625e3836c4383ab5" }, "downloads": -1, "filename": "yafasttext-0.8.22.tar.gz", "has_sig": false, "md5_digest": "e0f04c2c6f9c41dc6432254623a6c478", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41884, "upload_time": "2018-05-18T16:44:48", "url": "https://files.pythonhosted.org/packages/7f/e9/9c646a06c5b3c81c2eb18bd3edbea404d261008b0434d72ac5e9e1cfe9b6/yafasttext-0.8.22.tar.gz" } ], "0.8.22.1": [ { "comment_text": "", "digests": { "md5": "a4a11bb61f0c46604e00db5db223d3ed", "sha256": "ab5db379562d9a6b25e118d427b5067a0b3d1bd39013ba482f7761568807552d" }, "downloads": -1, "filename": "yafasttext-0.8.22.1.tar.gz", "has_sig": false, "md5_digest": "a4a11bb61f0c46604e00db5db223d3ed", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41967, "upload_time": "2018-05-21T19:00:51", "url": "https://files.pythonhosted.org/packages/ff/fe/cbab278449780b446839763c9a1ce1b9c44968ba5c2b3c8659ad198fa673/yafasttext-0.8.22.1.tar.gz" } ], "0.8.22.2": [ { "comment_text": "", "digests": { "md5": "1d7b04c4e50bf62854173aee46fd56a7", "sha256": "d69b1e15ae07a18e2d8eff748bb50d91ad9145aae17541c0e018900061460b8b" }, "downloads": -1, "filename": "yafasttext-0.8.22.2.tar.gz", "has_sig": false, "md5_digest": "1d7b04c4e50bf62854173aee46fd56a7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 42157, "upload_time": "2018-09-12T18:43:54", "url": "https://files.pythonhosted.org/packages/3e/eb/15c16077fda3bd731da3a627181ae4d11c689a93d0093f6e30b6c14cab3c/yafasttext-0.8.22.2.tar.gz" } ], "0.8.22.2018120701": [ { "comment_text": "", "digests": { "md5": "2c5c6911d66e455472548f07c0456752", "sha256": "132c3105e206e8d76212e5df4015804f17cbe39ab4483b786ac57786bc181698" }, "downloads": -1, "filename": "yafasttext-0.8.22.2018120701.tar.gz", "has_sig": false, "md5_digest": "2c5c6911d66e455472548f07c0456752", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 44080, "upload_time": "2018-12-07T20:24:18", "url": "https://files.pythonhosted.org/packages/00/4a/ef4fa6d403ed6c50e3c3469a0b1521589129c2fe9f63de5d6d2748836814/yafasttext-0.8.22.2018120701.tar.gz" } ], "0.8.22.3": [ { "comment_text": "", "digests": { "md5": "ac044562f6cdccbd74750bb07b14fcd7", "sha256": "6591fc5c079a2697582ebf05f0306ca2c7a33c1487fbc0a8480acb5d037de5dc" }, "downloads": -1, "filename": "yafasttext-0.8.22.3.tar.gz", "has_sig": false, "md5_digest": "ac044562f6cdccbd74750bb07b14fcd7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41707, "upload_time": "2018-11-08T19:14:41", "url": "https://files.pythonhosted.org/packages/2e/1d/c26f8d4b71bbb791a800df46bf8159184c5d9fa6b38a1d6697f10c92d213/yafasttext-0.8.22.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "2c5c6911d66e455472548f07c0456752", "sha256": "132c3105e206e8d76212e5df4015804f17cbe39ab4483b786ac57786bc181698" }, "downloads": -1, "filename": "yafasttext-0.8.22.2018120701.tar.gz", "has_sig": false, "md5_digest": "2c5c6911d66e455472548f07c0456752", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 44080, "upload_time": "2018-12-07T20:24:18", "url": "https://files.pythonhosted.org/packages/00/4a/ef4fa6d403ed6c50e3c3469a0b1521589129c2fe9f63de5d6d2748836814/yafasttext-0.8.22.2018120701.tar.gz" } ] }