{ "info": { "author": "Daniel Perez", "author_email": "tuvistavie@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# bigcode-embeddings\n\nNOTE: data must be generated with [`bigcode-ast-tools`][2] before being able to use\nthis tool\n\n`bigcode-embeddings` allows to generate and visualize embeddings for\nAST nodes.\n\n## Install\n\nThis project should be used with Python 3.\n\nTo install the package either run\n\n```\npip install bigcode-embeddings\n```\n\nor clone the repository and run\n\n```\ncd bigcode-embeddings\npip install -r requirements.txt\npython setup.py install\n```\n\nNOTE: tensorflow needs to be installed separately.\n\n## Usage\n\n### Training embeddings\n\nTraining data can be generated using [`bigcode-ast-tools`][2]\n\nGiven a `data.txt.gz` generated from a vocabulary of size 30000,\n100D embeddings can be trained using\n\n```\n./bin/bigcode-embeddings train -o embeddings/ --vocab-size 30000 --emb-size 100 --l2-value 0.05 --learning-rate 0.01 data.txt.gz\n```\n\n[Tensorboard][2] can be used to visualize the progress\n\n```\ntensorboard --logdir embeddings/\n```\n\nAfter the first epoch, embeddings visualization becomes available from\nTensorboard. The vocabulary TSV file generated by `bigcode-ast-tools` can\nbe loaded to have labels on the embeddings.\n\n### Visualizing the embeddings\n\nTrained embeddings can be visualized using the `visualize` subcommand\nIf the generated vocabulary file is `vocab.tsv`, the above embeddings\ncan be visualized with the following command\n\n```\n./bin/data-explorer visualize clusters -m embeddings/embeddings.bin-STEP -l vocab.tsv\n```\n\nwhere `STEP` should be the largest value found in the `embeddings/` directory.\n\nThe `-i` flag can be passed to generate an interactive plot.\n\n[1]: ../bigcode-ast-tools/README.md\n[2]: https://github.com/tensorflow/tensorboard\n\n\n", "description_content_type": null, "docs_url": null, "download_url": "https://github.com/tuvistavie/bigcode-tools/archive/master.zip", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/tuvistavie/bigcode-tools/tree/master/bigcode-embeddings", "keywords": "", "license": "", "maintainer": "", "maintainer_email": "", "name": "bigcode-embeddings", "package_url": "https://pypi.org/project/bigcode-embeddings/", "platform": "", "project_url": "https://pypi.org/project/bigcode-embeddings/", "project_urls": { "Download": "https://github.com/tuvistavie/bigcode-tools/archive/master.zip", "Homepage": "https://github.com/tuvistavie/bigcode-tools/tree/master/bigcode-embeddings" }, "release_url": "https://pypi.org/project/bigcode-embeddings/0.1.2/", "requires_dist": [ "pandas", "scipy", "numpy", "scikit-learn", "matplotlib", "plotly" ], "requires_python": "", "summary": "Tool generate and visualize embeddings from bigcode", "version": "0.1.2" }, "last_serial": 3249768, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "b4c6c8edba051170d11acffbbf1b9413", "sha256": "314930755b6feac64e532d502baad4165c9ff84b711b1e711c9e36efe6ecc618" }, "downloads": -1, "filename": "bigcode_embeddings-0.1.0-py3-none-any.whl", "has_sig": true, "md5_digest": "b4c6c8edba051170d11acffbbf1b9413", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 10043, "upload_time": "2017-10-11T15:59:26", "url": "https://files.pythonhosted.org/packages/d1/c8/0fa29826e551fa5b737af47ce0187abe9794430a820fea955020fafea1e8/bigcode_embeddings-0.1.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "2c7301be498905f21982fc1371661618", "sha256": "c3e6bb0478c73081ac7972f7e780a8d7f2eef91c2b25c39df5e7e8e652be8a93" }, "downloads": -1, "filename": "bigcode-embeddings-0.1.0.tar.gz", "has_sig": true, "md5_digest": "2c7301be498905f21982fc1371661618", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6870, "upload_time": "2017-10-11T15:59:33", "url": "https://files.pythonhosted.org/packages/11/63/447274aacc7123fd4e774d82f5b75c80867002c3c5f6834929e8fb2a3f53/bigcode-embeddings-0.1.0.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "29f0c06c9610b309af72cf3490bf2780", "sha256": "a7c1e37e076b073d124ade339e09ff91a4485df394eb4798ad5e9ddd7b8d03bd" }, "downloads": -1, "filename": "bigcode_embeddings-0.1.1-py3-none-any.whl", "has_sig": true, "md5_digest": "29f0c06c9610b309af72cf3490bf2780", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 10049, "upload_time": "2017-10-12T05:23:41", "url": "https://files.pythonhosted.org/packages/f6/13/423fef2babb9d07dbac28933157516257c1c2064241a5d1ffcda13ba5f8e/bigcode_embeddings-0.1.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "2845f4079994f76e59ca92e38d58ad96", "sha256": "4c9f2f57d602ed1690d478f5e201598805870523f967555cdc017d8158d5f557" }, "downloads": -1, "filename": "bigcode-embeddings-0.1.1.tar.gz", "has_sig": true, "md5_digest": "2845f4079994f76e59ca92e38d58ad96", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6667, "upload_time": "2017-10-12T05:23:42", "url": "https://files.pythonhosted.org/packages/08/78/bb90872a19baacc6d74eb0aece593b82ac01a47f54230dad987877758917/bigcode-embeddings-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "cab8721ab461cf820e6c8387ee77b23f", "sha256": "46eb6fa689cb5f98f70bbab06f21440e743735d2d3eef4c02521c4670473fa22" }, "downloads": -1, "filename": "bigcode_embeddings-0.1.2-py3-none-any.whl", "has_sig": true, "md5_digest": "cab8721ab461cf820e6c8387ee77b23f", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 10389, "upload_time": "2017-10-14T08:14:37", "url": "https://files.pythonhosted.org/packages/17/fc/bd4bee5397bfb57eb847b82edf7691ae5af49deba15c2d94ff64203c3f57/bigcode_embeddings-0.1.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "17f064180c415d925d0be60d8d27eda4", "sha256": "889e98d29ccbe4459337a8dcb9830918aa1bc1bfbe17d4e2ded5f48e5e0b0530" }, "downloads": -1, "filename": "bigcode-embeddings-0.1.2.tar.gz", "has_sig": true, "md5_digest": "17f064180c415d925d0be60d8d27eda4", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6995, "upload_time": "2017-10-14T08:14:39", "url": "https://files.pythonhosted.org/packages/f3/06/0e4ac9ab46ed577f3f5499fb4264b3e73bf6d59fe7dd343baba0215aa03f/bigcode-embeddings-0.1.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "cab8721ab461cf820e6c8387ee77b23f", "sha256": "46eb6fa689cb5f98f70bbab06f21440e743735d2d3eef4c02521c4670473fa22" }, "downloads": -1, "filename": "bigcode_embeddings-0.1.2-py3-none-any.whl", "has_sig": true, "md5_digest": "cab8721ab461cf820e6c8387ee77b23f", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 10389, "upload_time": "2017-10-14T08:14:37", "url": "https://files.pythonhosted.org/packages/17/fc/bd4bee5397bfb57eb847b82edf7691ae5af49deba15c2d94ff64203c3f57/bigcode_embeddings-0.1.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "17f064180c415d925d0be60d8d27eda4", "sha256": "889e98d29ccbe4459337a8dcb9830918aa1bc1bfbe17d4e2ded5f48e5e0b0530" }, "downloads": -1, "filename": "bigcode-embeddings-0.1.2.tar.gz", "has_sig": true, "md5_digest": "17f064180c415d925d0be60d8d27eda4", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6995, "upload_time": "2017-10-14T08:14:39", "url": "https://files.pythonhosted.org/packages/f3/06/0e4ac9ab46ed577f3f5499fb4264b3e73bf6d59fe7dd343baba0215aa03f/bigcode-embeddings-0.1.2.tar.gz" } ] }