{ "info": { "author": "Victor Zhong", "author_email": "victor@victorzhong.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python :: 3.5" ], "description": "Embeddings\n==========\n\n.. image:: https://readthedocs.org/projects/embeddings/badge/?version=latest\n :target: http://embeddings.readthedocs.io/en/latest/?badge=latest\n :alt: Documentation Status\n.. image:: https://travis-ci.org/vzhong/embeddings.svg?branch=master\n :target: https://travis-ci.org/vzhong/embeddings\n\nEmbeddings is a python package that provides pretrained word embeddings for natural language processing and machine learning.\n\nInstead of loading a large file to query for embeddings, ``embeddings`` is backed by a database and fast to load and query:\n\n.. code-block:: python\n\n >>> %timeit GloveEmbedding('common_crawl_840', d_emb=300)\n 100 loops, best of 3: 12.7 ms per loop\n\n >>> %timeit GloveEmbedding('common_crawl_840', d_emb=300).emb('canada')\n 100 loops, best of 3: 12.9 ms per loop\n\n >>> g = GloveEmbedding('common_crawl_840', d_emb=300)\n\n >>> %timeit -n1 g.emb('canada')\n 1 loop, best of 3: 38.2 \u00b5s per loop\n\n\nInstallation\n------------\n\n.. code-block:: sh\n\n pip install embeddings # from pypi\n pip install git+https://github.com/vzhong/embeddings.git # from github\n\n\nUsage\n-----\n\nUpon first use, the embeddings are first downloaded to disk in the form of a SQLite database.\nThis may take a long time for large embeddings such as GloVe.\nFurther usage of the embeddings are directly queried against the database.\nEmbedding databases are stored in the ``$EMBEDDINGS_ROOT`` directory (defaults to ``~/.embeddings``). Note that this location is probably **undesirable** if your home directory is on NFS, as it would slow down database queries significantly.\n\n\n.. code-block:: python\n\n from embeddings import GloveEmbedding, FastTextEmbedding, KazumaCharEmbedding, ConcatEmbedding\n\n g = GloveEmbedding('common_crawl_840', d_emb=300, show_progress=True)\n f = FastTextEmbedding()\n k = KazumaCharEmbedding()\n c = ConcatEmbedding([g, f, k])\n for w in ['canada', 'vancouver', 'toronto']:\n print('embedding {}'.format(w))\n print(g.emb(w))\n print(f.emb(w))\n print(k.emb(w))\n print(c.emb(w))\n\n\nDocker\n------\n\nIf you use Docker, an image prepopulated with the Common Crawl 840 GloVe embeddings and Kazuma Hashimoto's character ngram embeddings is available at `vzhong/embeddings `_.\nTo mount volumes from this container, set ``$EMBEDDINGS_ROOT`` in your container to ``/opt/embeddings``.\n\nFor example:\n\n.. code-block:: bash\n\n docker run --volumes-from vzhong/embeddings -e EMBEDDINGS_ROOT='/opt/embeddings' myimage python train.py\n\n\nContribution\n------------\n\nPull requests welcome!\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/vzhong/embeddings", "keywords": "text nlp machine-learning", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "embeddings", "package_url": "https://pypi.org/project/embeddings/", "platform": "", "project_url": "https://pypi.org/project/embeddings/", "project_urls": { "Homepage": "https://github.com/vzhong/embeddings" }, "release_url": "https://pypi.org/project/embeddings/0.0.7/", "requires_dist": [ "tqdm", "requests", "numpy", "check-manifest ; extra == 'dev'", "sphinx ; extra == 'dev'", "sphinx-rtd-theme ; extra == 'dev'", "coverage ; extra == 'test'", "nose ; extra == 'test'" ], "requires_python": "", "summary": "Pretrained word embeddings in Python.", "version": "0.0.7" }, "last_serial": 5500007, "releases": { "0.0.2": [ { "comment_text": "", "digests": { "md5": "77993029367f7e5c0e0e7425cf3c659a", "sha256": "3f206103e7cab4791f68fe4a519d7763eaff8671da2cd6b4a17fb6b08089cdc1" }, "downloads": -1, "filename": "embeddings-0.0.2.tar.gz", "has_sig": false, "md5_digest": "77993029367f7e5c0e0e7425cf3c659a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6824, "upload_time": "2017-05-21T22:13:58", "url": "https://files.pythonhosted.org/packages/ef/67/751c22d7cfc7010a58a2abb7a64309585c39d0e8467be76c90711f24d3ba/embeddings-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "0f7cab1050cd02fcf7348edefd04c554", "sha256": "8562b87b3918d041711fc3cc25a2b0224d75f23f5851c79eda8949f20b2ebe46" }, "downloads": -1, "filename": "embeddings-0.0.3-py3.5.egg", "has_sig": false, "md5_digest": "0f7cab1050cd02fcf7348edefd04c554", "packagetype": "bdist_egg", "python_version": "3.5", "requires_python": null, "size": 20510, "upload_time": "2018-08-25T22:49:35", "url": "https://files.pythonhosted.org/packages/cd/0c/55cf315cadfb0c5aaee98cd6f8ada4d9522f8b2219fb1727a7afe5fbfd11/embeddings-0.0.3-py3.5.egg" }, { "comment_text": "", "digests": { "md5": "601916be116f09ca9cad9e3b3cc8c9a3", "sha256": "f213e01886afe8a1442e0012f03db7efb9dfcb8bb40d319beb1e95e80e14be42" }, "downloads": -1, "filename": "embeddings-0.0.3.tar.gz", "has_sig": false, "md5_digest": "601916be116f09ca9cad9e3b3cc8c9a3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5997, "upload_time": "2017-05-25T19:55:10", "url": "https://files.pythonhosted.org/packages/0f/49/0a8315eef4fb95b25f3622d8e82d441c8e8fb6261fa5a825068b787eea8d/embeddings-0.0.3.tar.gz" } ], "0.0.4": [ { "comment_text": "", "digests": { "md5": "c86156b776e3eeb27db776e922f42885", "sha256": "19a1b82e99ae3d5a99218f3d0fb456d6ac8d028bcaafe169fd698b2de4d7f558" }, "downloads": -1, "filename": "embeddings-0.0.4.tar.gz", "has_sig": false, "md5_digest": "c86156b776e3eeb27db776e922f42885", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7252, "upload_time": "2017-06-17T15:47:58", "url": "https://files.pythonhosted.org/packages/fa/44/5b29d2fd2bd00c37650e6929ed7429420beb4afc737cec3e30f7fee658e3/embeddings-0.0.4.tar.gz" } ], "0.0.5": [ { "comment_text": "", "digests": { "md5": "0eb9355f691c8ef9fc93a2ca2c6a0379", "sha256": "7308de78c184f8c84342628c28c73cabf69342c3c3344d861886d779348ff3d6" }, "downloads": -1, "filename": "embeddings-0.0.5.tar.gz", "has_sig": false, "md5_digest": "0eb9355f691c8ef9fc93a2ca2c6a0379", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7849, "upload_time": "2018-06-06T18:39:58", "url": "https://files.pythonhosted.org/packages/aa/b6/65df7fa36bd88ab4a5b75dfd83d6d1e533c44c7a44a0a0fc65f243521a79/embeddings-0.0.5.tar.gz" } ], "0.0.6": [ { "comment_text": "", "digests": { "md5": "c2c37217abdd0593d125e8fe786fbe6f", "sha256": "c5ff0a5d9619cee0fd4d56bbc4c600e0e73329527c74e163edb67823bbc78524" }, "downloads": -1, "filename": "embeddings-0.0.6.tar.gz", "has_sig": false, "md5_digest": "c2c37217abdd0593d125e8fe786fbe6f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 9377, "upload_time": "2018-08-25T22:49:38", "url": "https://files.pythonhosted.org/packages/ea/89/850eede11ae8e0b7b644473910e7c780804092e01cb343c00eb1d60f9d4e/embeddings-0.0.6.tar.gz" } ], "0.0.7": [ { "comment_text": "", "digests": { "md5": "6e3ef404fd1671917860955a1eb5f3c5", "sha256": "9b7012d37937b3446f5dbcc6861e2c56c80a7646133b4e70993e207fc0f9461b" }, "downloads": -1, "filename": "embeddings-0.0.7-py3-none-any.whl", "has_sig": false, "md5_digest": "6e3ef404fd1671917860955a1eb5f3c5", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 11754, "upload_time": "2019-07-08T10:16:04", "url": "https://files.pythonhosted.org/packages/69/cf/b1fd9ea8062e444e6588eeba623d83d6d6a1920ce66c9bf669f218f1b733/embeddings-0.0.7-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "c79b6cd28cdcc8523aef3342b0f0e61d", "sha256": "51aadc05e47fc862b157b7ee0a6cb9e8c42c5e194986d035227b6dd3bddd7262" }, "downloads": -1, "filename": "embeddings-0.0.7.tar.gz", "has_sig": false, "md5_digest": "c79b6cd28cdcc8523aef3342b0f0e61d", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 8259, "upload_time": "2019-07-08T10:16:08", "url": "https://files.pythonhosted.org/packages/f6/cd/0d58701d13188c7c06031fa164dc65016e0e8489fa7e5d8418c3ffd3b78e/embeddings-0.0.7.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "6e3ef404fd1671917860955a1eb5f3c5", "sha256": "9b7012d37937b3446f5dbcc6861e2c56c80a7646133b4e70993e207fc0f9461b" }, "downloads": -1, "filename": "embeddings-0.0.7-py3-none-any.whl", "has_sig": false, "md5_digest": "6e3ef404fd1671917860955a1eb5f3c5", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 11754, "upload_time": "2019-07-08T10:16:04", "url": "https://files.pythonhosted.org/packages/69/cf/b1fd9ea8062e444e6588eeba623d83d6d6a1920ce66c9bf669f218f1b733/embeddings-0.0.7-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "c79b6cd28cdcc8523aef3342b0f0e61d", "sha256": "51aadc05e47fc862b157b7ee0a6cb9e8c42c5e194986d035227b6dd3bddd7262" }, "downloads": -1, "filename": "embeddings-0.0.7.tar.gz", "has_sig": false, "md5_digest": "c79b6cd28cdcc8523aef3342b0f0e61d", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 8259, "upload_time": "2019-07-08T10:16:08", "url": "https://files.pythonhosted.org/packages/f6/cd/0d58701d13188c7c06031fa164dc65016e0e8489fa7e5d8418c3ffd3b78e/embeddings-0.0.7.tar.gz" } ] }