{ "info": { "author": "Maarten Grootendorst", "author_email": "maartengrootendorst@gmail.com", "bugtrack_url": null, "classifiers": [], "description": "# Vectors of Locally Aggregated Concepts (VLAC)\n\n[![PyPI - Status](https://img.shields.io/badge/status-beta-yellow.svg)]()\n[![PyPI - Python](https://img.shields.io/badge/python-3.4%20%7C%203.5%20%7C%203.6-blue.svg)]()\n[![PyPI - Python](https://img.shields.io/badge/license-MIT-green.svg)]()\n\n\n## Purpose\nAs illustrated in the Figure below, VLAC clusters word embeddings to create *k* concepts. Due to the high dimensionality of word embeddings (i.e., 300) spherical k-means is used to perform the clustering as applying euclidean distance will result in little difference in the distances between samples. The method works as follows. Let *wi* be a word embedding of size *D* assigned to cluster center *ck*. Then, for each word in a document, VLAC computes the element-wise sum of residuals of each word embedding to its assigned cluster center. This results in *k* feature vectors, one for each concept, and all of size *D*. All feature vectors are then concatenated, power normalized, and finally, l2 normalization is applied. For example, if 10 concepts were to be created out of word embeddings of size 300 then the resulting document vector would contain 10 x 300 values. \n\n\n\n## Usage\nTested in python 3.5.4. \n\n```python\n# Train model and transform collection of documents\nvlac_model = VLAC(documents=train_docs, model=model, oov=False)\nvlac_features, kmeans = vlac_model.fit_transform(num_concepts=30)\n\n# Create features new documents\nvlac_model = VLAC(documents=train_docs, model=model, oov=False)\ntest_features = vlac_model.transform(kmeans=kmeans)\n```\n\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/MaartenGr/VLAC", "keywords": "", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "vlac", "package_url": "https://pypi.org/project/vlac/", "platform": "", "project_url": "https://pypi.org/project/vlac/", "project_urls": { "Homepage": "https://github.com/MaartenGr/VLAC" }, "release_url": "https://pypi.org/project/vlac/0.1.2.5/", "requires_dist": [ "numpy", "sklearn" ], "requires_python": "", "summary": "Tool for creating document features", "version": "0.1.2.5" }, "last_serial": 5331819, "releases": { "0.1": [ { "comment_text": "", "digests": { "md5": "c7a8308589815a041cf7ca0d638e565e", "sha256": "7ebc38b685fce9a915bc21574dfaa596b04dd56c57b9a0798f4a54bcae98a082" }, "downloads": -1, "filename": "vlac-0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "c7a8308589815a041cf7ca0d638e565e", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 4990, "upload_time": "2019-05-29T10:06:02", "url": "https://files.pythonhosted.org/packages/b9/31/d45bff6035a81cbabfb6ea61ea2a58912e38e6e3d26c5ea90e7afcf9aab7/vlac-0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d2d265f4fb17282042ac5e8f561dd91d", "sha256": "2a6056c03ab35c2682c789a01d3d40674b4c27d58dbf47fc2f809513923b48a8" }, "downloads": -1, "filename": "vlac-0.1.tar.gz", "has_sig": false, "md5_digest": "d2d265f4fb17282042ac5e8f561dd91d", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3595, "upload_time": "2019-05-29T10:06:04", "url": "https://files.pythonhosted.org/packages/da/a2/725d11f05480b9db320c8e1845a20cb9c6e25fcf23eefefefc9cb67b657d/vlac-0.1.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "6c62263251bb5aff52cdda70e327e80b", "sha256": "db4d1e8980c332c8b3ccd7a55b9e8a219481a2e5720537e3549d683edaac45ba" }, "downloads": -1, "filename": "vlac-0.1.1.tar.gz", "has_sig": false, "md5_digest": "6c62263251bb5aff52cdda70e327e80b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3613, "upload_time": "2019-05-29T10:36:24", "url": "https://files.pythonhosted.org/packages/02/c6/3741c78ac6b8918abb3ba681e3a27037f3e7f7ef288f0c5b85861972aceb/vlac-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "6fe72ee090fa7b735723232e7528705c", "sha256": "d2134c43b31796b7a9a21ff9f3b77988d3d9fd1c085218b3564a669bf0527097" }, "downloads": -1, "filename": "vlac-0.1.2.0-py3-none-any.whl", "has_sig": false, "md5_digest": "6fe72ee090fa7b735723232e7528705c", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 7166, "upload_time": "2019-05-29T10:59:27", "url": "https://files.pythonhosted.org/packages/c8/4c/0f24ad62846e54ed5f3ce4a861f7b15601ab446f38f0496a0ab5be4747cf/vlac-0.1.2.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "ea78b887924e95c079975dc4609dc144", "sha256": "a8b4a47eacc45b851967329c4df256a3ab98b0e15a4f16d2728ef0f783c6acfd" }, "downloads": -1, "filename": "vlac-0.1.2-py3-none-any.whl", "has_sig": false, "md5_digest": "ea78b887924e95c079975dc4609dc144", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5027, "upload_time": "2019-05-29T10:51:04", "url": "https://files.pythonhosted.org/packages/55/02/6a13948affafe7526f0fbea6c2f9dd6f37f05f5ca849d4f05b4a61b08058/vlac-0.1.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "dfa67e6c322a95b0190a8e704a8b7a48", "sha256": "17f7198a09b7761f907aa2827f42e1b294b211409e9676152117cf406194c165" }, "downloads": -1, "filename": "vlac-0.1.2.tar.gz", "has_sig": false, "md5_digest": "dfa67e6c322a95b0190a8e704a8b7a48", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3608, "upload_time": "2019-05-29T10:51:06", "url": "https://files.pythonhosted.org/packages/0e/c7/479949dc7d3b916ce863f4d3a2a9f637d4bd1f19de1c0bda29cfc2a55dc5/vlac-0.1.2.tar.gz" } ], "0.1.2.1": [ { "comment_text": "", "digests": { "md5": "d550c8e5d8e7247e9254e7a6666bd4b0", "sha256": "fab82843d31306ad9c2b39f185fc6dd7defec66b0872263d814f75f3cf5f2e4d" }, "downloads": -1, "filename": "vlac-0.1.2.1-py3-none-any.whl", "has_sig": false, "md5_digest": "d550c8e5d8e7247e9254e7a6666bd4b0", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 7165, "upload_time": "2019-05-29T11:01:33", "url": "https://files.pythonhosted.org/packages/d5/ab/0f852d253de043a77f974fc0238f13eb0e9f4322d7d4af9d45f823a1f8de/vlac-0.1.2.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "a067cf5885d5d9e829a163e194ead0ee", "sha256": "4d737d0f067bf3460525129de4fa06519a5b9659e9f3d91e8d367eeb54d14bd9" }, "downloads": -1, "filename": "vlac-0.1.2.1.tar.gz", "has_sig": false, "md5_digest": "a067cf5885d5d9e829a163e194ead0ee", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3611, "upload_time": "2019-05-29T11:01:35", "url": "https://files.pythonhosted.org/packages/b2/6f/c2309f27e3a5a233b1556bfaf27851a1d5dabe822afbbbd2489af463c411/vlac-0.1.2.1.tar.gz" } ], "0.1.2.2": [ { "comment_text": "", "digests": { "md5": "f1b02ff3c5e4ec7d1d847e5c9bc59979", "sha256": "90d7680770aed64e24e565f61259f2cad3d1ceb0adf87989bc66c8f4a4acd5dc" }, "downloads": -1, "filename": "vlac-0.1.2.2-py3-none-any.whl", "has_sig": false, "md5_digest": "f1b02ff3c5e4ec7d1d847e5c9bc59979", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2712, "upload_time": "2019-05-29T11:07:08", "url": "https://files.pythonhosted.org/packages/26/8e/6d7ba77f5fdd12698d4780914e73feb86b019969e326d99ce49c34cd52b9/vlac-0.1.2.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "35e3ad9cebbe38390a1c2e4faee986ee", "sha256": "32263b5a6671aea9a55341d0a93e4e2c802eeab397f7623f70d3d000eb46fd5e" }, "downloads": -1, "filename": "vlac-0.1.2.2.tar.gz", "has_sig": false, "md5_digest": "35e3ad9cebbe38390a1c2e4faee986ee", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3631, "upload_time": "2019-05-29T11:07:10", "url": "https://files.pythonhosted.org/packages/e7/35/fd56550b5bb0531c681728a1dc889ee7fc0740a5a61b31d39da12738e6f1/vlac-0.1.2.2.tar.gz" } ], "0.1.2.3": [ { "comment_text": "", "digests": { "md5": "c026009b135acd7603e392ee041941c7", "sha256": "5bda87229af7d21d29819f6b4484ca4025a9c3e1d0d42cd7b6a2822dcc75000b" }, "downloads": -1, "filename": "vlac-0.1.2.3-py3-none-any.whl", "has_sig": false, "md5_digest": "c026009b135acd7603e392ee041941c7", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 7167, "upload_time": "2019-05-29T11:10:07", "url": "https://files.pythonhosted.org/packages/94/3a/f501a26276770c300153d59d182a5ed302c57d3ac9236becf976fc300f65/vlac-0.1.2.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "2cb1770e3d5c18b5f00419a8e397d528", "sha256": "3309cb1139f6e567ea22b5dacde199c492d23e53f9dff63e91c50dacee1b75a4" }, "downloads": -1, "filename": "vlac-0.1.2.3.tar.gz", "has_sig": false, "md5_digest": "2cb1770e3d5c18b5f00419a8e397d528", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3641, "upload_time": "2019-05-29T11:10:09", "url": "https://files.pythonhosted.org/packages/71/3e/0926fcd20c7879cf8445a49c996cd4f97e8ef425745018b271f74355a162/vlac-0.1.2.3.tar.gz" } ], "0.1.2.4": [ { "comment_text": "", "digests": { "md5": "752f90d1ee90beb6b94f62c9e08a0d3b", "sha256": "4bc5d55408844de5d113597b00f1170cc4d0ac1ecefd594d4a281f4792392e2e" }, "downloads": -1, "filename": "vlac-0.1.2.4-py3-none-any.whl", "has_sig": false, "md5_digest": "752f90d1ee90beb6b94f62c9e08a0d3b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 7154, "upload_time": "2019-05-29T11:12:37", "url": "https://files.pythonhosted.org/packages/91/67/b417276fe37a473f6a0307954f25753359c4a0c16decfaf6352edabfb2df/vlac-0.1.2.4-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "523f631e35d5cd87c9a0ec8f08bb34e6", "sha256": "ad45cfae0dccb35bbcd871842fd85c63b553ab7fff3109acb7b4cfc0876fc5f6" }, "downloads": -1, "filename": "vlac-0.1.2.4.tar.gz", "has_sig": false, "md5_digest": "523f631e35d5cd87c9a0ec8f08bb34e6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3630, "upload_time": "2019-05-29T11:12:38", "url": "https://files.pythonhosted.org/packages/8b/f8/b16f2696e8bc7f65b57f3993934c7dc12300daf2d8b2ff4a10478dd47740/vlac-0.1.2.4.tar.gz" } ], "0.1.2.5": [ { "comment_text": "", "digests": { "md5": "39bea7f235755cb81bac6250d0248bac", "sha256": "949ba13018a9836f7bb73ce43c1bc7f75f2b7c55b3cb1222e4bbf3cf872fc995" }, "downloads": -1, "filename": "vlac-0.1.2.5-py3-none-any.whl", "has_sig": false, "md5_digest": "39bea7f235755cb81bac6250d0248bac", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 7160, "upload_time": "2019-05-29T11:15:33", "url": "https://files.pythonhosted.org/packages/68/a5/f0059f6c080eb18471b0bff13f828406d10b3a07c26bb93b578ee26f7561/vlac-0.1.2.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f7b5c203c523f1671918635208259f0c", "sha256": "5390d4f8b554e10eddb45a41ea774711bf5f6c1c21fb58206571a6a2cf933161" }, "downloads": -1, "filename": "vlac-0.1.2.5.tar.gz", "has_sig": false, "md5_digest": "f7b5c203c523f1671918635208259f0c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3639, "upload_time": "2019-05-29T11:15:34", "url": "https://files.pythonhosted.org/packages/fe/94/de8061646846aa96929685a0a499c44e4ec400079f9b11088f069e1e3567/vlac-0.1.2.5.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "39bea7f235755cb81bac6250d0248bac", "sha256": "949ba13018a9836f7bb73ce43c1bc7f75f2b7c55b3cb1222e4bbf3cf872fc995" }, "downloads": -1, "filename": "vlac-0.1.2.5-py3-none-any.whl", "has_sig": false, "md5_digest": "39bea7f235755cb81bac6250d0248bac", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 7160, "upload_time": "2019-05-29T11:15:33", "url": "https://files.pythonhosted.org/packages/68/a5/f0059f6c080eb18471b0bff13f828406d10b3a07c26bb93b578ee26f7561/vlac-0.1.2.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f7b5c203c523f1671918635208259f0c", "sha256": "5390d4f8b554e10eddb45a41ea774711bf5f6c1c21fb58206571a6a2cf933161" }, "downloads": -1, "filename": "vlac-0.1.2.5.tar.gz", "has_sig": false, "md5_digest": "f7b5c203c523f1671918635208259f0c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3639, "upload_time": "2019-05-29T11:15:34", "url": "https://files.pythonhosted.org/packages/fe/94/de8061646846aa96929685a0a499c44e4ec400079f9b11088f069e1e3567/vlac-0.1.2.5.tar.gz" } ] }