{ "info": { "author": "Bryan Perozzi", "author_email": "bperozzi@cs.stonybrook.edu", "bugtrack_url": null, "classifiers": [ "Development Status :: 2 - Pre-Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: BSD License", "Natural Language :: English", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.4" ], "description": "===============================\nDeepWalk\n===============================\n\nDeepWalk uses short random walks to learn representations for vertices in graphs.\n\nUsage\n-----\n\n**Example Usage**\n ``$deepwalk --input example_graphs/karate.adjlist --output karate.embeddings``\n\n**--input**: *input_filename*\n\n 1. ``--format adjlist`` for an adjacency list, e.g::\n\n 1 2 3 4 5 6 7 8 9 11 12 13 14 18 20 22 32\n 2 1 3 4 8 14 18 20 22 31\n 3 1 2 4 8 9 10 14 28 29 33\n ...\n\n 2. ``--format edgelist`` for an edge list, e.g::\n\n 1 2\n 1 3\n 1 4\n ...\n\n 3. ``--format mat`` for a Matlab .mat file containing an adjacency matrix\n (note, you must also specify the variable name of the adjacency matrix ``--matfile-variable-name``)\n\n**--output**: *output_filename*\n\n The output representations in skipgram format - first line is header, all other lines are node-id and *d* dimensional representation::\n\n 34 64\n 1 0.016579 -0.033659 0.342167 -0.046998 ...\n 2 -0.007003 0.265891 -0.351422 0.043923 ...\n ...\n\n**Full Command List**\n The full list of command line options is available with ``$deepwalk --help``\n\nEvaluation\n----------\nHere, we will show how to evaluate DeepWalk on the *BlogCatalog* dataset used in the DeepWalk paper.\nFirst, we run the following command to produce its DeepWalk embeddings::\n\n deepwalk --format mat --input example_graphs/blogcatalog.mat\n --max-memory-data-size 0 --number-walks 80 --representation-size 128 --walk-length 40 --window-size 10\n --workers 1 --output example_graphs/blogcatalog.embeddings\n\nThe parameters specified here are the same as in the paper.\nIf you are using a multi-core machine, try to set ``--workers`` to a larger number for faster training.\nOn a single machine with 24 Xeon E5-2620 @ 2.00GHz CPUs, this command takes about 20 minutes to finish (``--workers`` is set to 20).\nThen, we evaluate the learned embeddings on a multi-label node classification task with ``example_graphs/scoring.py``::\n\n python example_graphs/scoring.py --emb example_graphs/blogcatalog.embeddings\n --network example_graphs/blogcatalog.mat\n --num-shuffle 10 --all\n\nThis command finishes in 8 minutes on the same machine. For faster evaluation, you can set ``--num-shuffle`` to a smaller number, but expect more fluctuation in performance. The micro F1 and macro F1 scores we get with different ratio of labeled nodes are as follows:\n\n+-----------------+-------+-------+-------+-------+-------+-------+-------+-------+-------+\n| % Labeled Nodes | 10% | 20% | 30% | 40% | 50% | 60% | 70% | 80% | 90% |\n+=================+=======+=======+=======+=======+=======+=======+=======+=======+=======+\n| *Micro-F1 (%)* | 35.86 | 38.51 | 39.96 | 40.76 | 41.51 | 41.85 | 42.27 | 42.35 | 42.40 |\n+-----------------+-------+-------+-------+-------+-------+-------+-------+-------+-------+\n| *Macro-F1 (%)* | 21.08 | 23.98 | 25.71 | 26.73 | 27.68 | 28.28 | 28.88 | 28.70 | 28.21 |\n+-----------------+-------+-------+-------+-------+-------+-------+-------+-------+-------+\n\n**Note that the current version of DeepWalk is based on a newer version of gensim, which may have a different implementation of the word2vec model. To completely reproduce the results in our paper, you will probably have to install an older version of gensim(version 0.10.2).**\n\nRequirements\n------------\n* numpy\n* scipy\n\n(may have to be independently installed) \n\n\n\nInstallation\n------------\n#. cd deepwalk\n#. pip install -r requirements.txt \n#. python setup.py install\n\n\nCiting\n------\nIf you find DeepWalk useful in your research, we ask that you cite the following paper::\n\n @inproceedings{Perozzi:2014:DOL:2623330.2623732,\n author = {Perozzi, Bryan and Al-Rfou, Rami and Skiena, Steven},\n title = {DeepWalk: Online Learning of Social Representations},\n booktitle = {Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining},\n series = {KDD '14},\n year = {2014},\n isbn = {978-1-4503-2956-9},\n location = {New York, New York, USA},\n pages = {701--710},\n numpages = {10},\n url = {http://doi.acm.org/10.1145/2623330.2623732},\n doi = {10.1145/2623330.2623732},\n acmid = {2623732},\n publisher = {ACM},\n address = {New York, NY, USA},\n keywords = {deep learning, latent representations, learning with partial labels, network classification, online learning, social networks},\n } \n\nMisc\n----\n\nDeepWalk - Online learning of social representations.\n\n* Free software: GPLv3 license\n* Documentation: http://deepwalk.readthedocs.org.\n\n\n\n.. image:: https://badge.fury.io/py/deepwalk.png\n :target: http://badge.fury.io/py/deepwalk\n\n.. image:: https://travis-ci.org/phanein/deepwalk.png?branch=master\n :target: https://travis-ci.org/phanein/deepwalk\n\n.. image:: https://pypip.in/d/deepwalk/badge.png\n :target: https://pypi.python.org/pypi/deepwalk\n\n\n\n\nHistory\n-------\n\n1.0.3 (2018-03-23)\n---------------------\n\n* Now compatible with the latest version of gensim and sklearn\n* Better support for Python 3\n\n1.0.2 (2014-09-19)\n---------------------\n\n* Fixed gensim at 0.10.2 for now\n\n1.0.1 (2014-09-19)\n---------------------\n\n* Added utilities to support generated embeddings for larger graphs\n* Support for additional input file formats\n\n1.0.0 (2014-08-24)\n---------------------\n\n* First release on PyPI.\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/phanein/deepwalk", "keywords": "deepwalk", "license": "GPLv3", "maintainer": "", "maintainer_email": "", "name": "deepwalk", "package_url": "https://pypi.org/project/deepwalk/", "platform": "", "project_url": "https://pypi.org/project/deepwalk/", "project_urls": { "Homepage": "https://github.com/phanein/deepwalk" }, "release_url": "https://pypi.org/project/deepwalk/1.0.3/", "requires_dist": [ "wheel (>=0.23.0)", "Cython (>=0.20.2)", "argparse (>=1.2.1)", "futures (>=2.1.6)", "six (>=1.7.3)", "gensim (>=1.0.0)", "scipy (>=0.15.0)", "psutil (>=2.1.1)" ], "requires_python": "", "summary": "DeepWalk online learning of social representations.", "version": "1.0.3" }, "last_serial": 3819267, "releases": { "1.0.0": [ { "comment_text": "", "digests": { "md5": "7e08d0a6e5c8eb4f4428203064ef6537", "sha256": "3bd32f5c7c4c5be4ad95882665ebac99deb1eb57f33147384809802928d55060" }, "downloads": -1, "filename": "deepwalk-1.0.0.tar.gz", "has_sig": false, "md5_digest": "7e08d0a6e5c8eb4f4428203064ef6537", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 26250, "upload_time": "2014-08-23T18:04:14", "url": "https://files.pythonhosted.org/packages/89/9e/83d483581832a0ede07ddf2c4098bfba3ca9dc3ef6a57800d9c5732ed354/deepwalk-1.0.0.tar.gz" } ], "1.0.1": [ { "comment_text": "", "digests": { "md5": "c3d8ec159b397d1348128ca07ba9f8e0", "sha256": "a44e8667715345753983240b92cb5a215ffbf20363beb4e06b2896b7de3c01ab" }, "downloads": -1, "filename": "deepwalk-1.0.1.tar.gz", "has_sig": false, "md5_digest": "c3d8ec159b397d1348128ca07ba9f8e0", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 29077, "upload_time": "2014-09-19T18:19:17", "url": "https://files.pythonhosted.org/packages/79/64/9ae791d80b83874f9aa86c74e0da4a1f7f35fd3dfb4c554736ecbde4c425/deepwalk-1.0.1.tar.gz" } ], "1.0.3": [ { "comment_text": "", "digests": { "md5": "52440ef0ac822475882821ff6aad73fa", "sha256": "fb15c59c94fb35467071f45700ae33e6fecb11750776c53b4f2d38fffb33a57a" }, "downloads": -1, "filename": "deepwalk-1.0.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "52440ef0ac822475882821ff6aad73fa", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 10664, "upload_time": "2018-04-29T21:05:18", "url": "https://files.pythonhosted.org/packages/17/84/97fcfdea22ebe0f61e1a6740daae98d47cc1ba49fffa2b117c550b1a76ee/deepwalk-1.0.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6c167f4dc1a8a6abce988a91adb431ff", "sha256": "507a8fc85363fb14a2838eb2304b8a04a08ac0d8ff57611fbea22db671a44674" }, "downloads": -1, "filename": "deepwalk-1.0.3.tar.gz", "has_sig": false, "md5_digest": "6c167f4dc1a8a6abce988a91adb431ff", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 32027, "upload_time": "2018-04-29T21:05:19", "url": "https://files.pythonhosted.org/packages/aa/30/bbbf62ca65e9c427b91eb685e3ce37490324969f8a4141971f5a12a3e2bf/deepwalk-1.0.3.tar.gz" } ], "v1.0.2": [ { "comment_text": "", "digests": { "md5": "04a43409e1fb5c50c68745915eaa72d1", "sha256": "ac60fb3abf13abd0b8a8e9d3808b8e16b97d53a9ecf610ea9701bc19b94e4df5" }, "downloads": -1, "filename": "deepwalk-v1.0.2.tar.gz", "has_sig": false, "md5_digest": "04a43409e1fb5c50c68745915eaa72d1", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 29104, "upload_time": "2014-11-19T19:29:24", "url": "https://files.pythonhosted.org/packages/0e/b4/ea969f75f20ccd16d0685ec4a30d14ca2ac21370674200463241132693f3/deepwalk-v1.0.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "52440ef0ac822475882821ff6aad73fa", "sha256": "fb15c59c94fb35467071f45700ae33e6fecb11750776c53b4f2d38fffb33a57a" }, "downloads": -1, "filename": "deepwalk-1.0.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "52440ef0ac822475882821ff6aad73fa", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 10664, "upload_time": "2018-04-29T21:05:18", "url": "https://files.pythonhosted.org/packages/17/84/97fcfdea22ebe0f61e1a6740daae98d47cc1ba49fffa2b117c550b1a76ee/deepwalk-1.0.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6c167f4dc1a8a6abce988a91adb431ff", "sha256": "507a8fc85363fb14a2838eb2304b8a04a08ac0d8ff57611fbea22db671a44674" }, "downloads": -1, "filename": "deepwalk-1.0.3.tar.gz", "has_sig": false, "md5_digest": "6c167f4dc1a8a6abce988a91adb431ff", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 32027, "upload_time": "2018-04-29T21:05:19", "url": "https://files.pythonhosted.org/packages/aa/30/bbbf62ca65e9c427b91eb685e3ce37490324969f8a4141971f5a12a3e2bf/deepwalk-1.0.3.tar.gz" } ] }