{ "info": { "author": "Junpei Kawamoto", "author_email": "kawamoto.junpei@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Science/Research", "License :: OSI Approved :: GNU General Public License v3 (GPLv3)", "Natural Language :: English", "Programming Language :: Python", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3.6", "Topic :: Scientific/Engineering :: Information Analysis", "Topic :: Software Development :: Libraries" ], "description": "Trip Advisor Dataset Loader\n===========================\n\n|GPLv3| |Build Status| |wercker status| |Release| |PyPi| |Japanese|\n\n|Logo|\n\nFor the `Review Graph Mining project `__,\nthis package provides a loader of the `Trip Advisor\ndataset `__ provided by `Dr.\nWang `__.\n\nInstallation\n------------\n\nUse pip to install this package.\n\n.. code:: shell\n\n $ pip install --upgrade rgmining-tripadvisor-dataset\n\nNote that this installation will download a big data file from the\n`original web site `__.\n\nThis package uses ``bz2`` internally. If your python doesn't have that\npackage (try ``import bz2``), rebuild python before installation.\n\nUsage\n-----\n\nThis package provides module ``tripadvisor`` and this module provides\n``load`` function. The ``load`` function takes a graph object which\nimplements the `graph\ninterface `__\ndefined in `Review Graph Mining\nproject `__.\n\nFor example, the following code constructs a graph object provides the\n`FRAUDAR `__\nalgorithm, loads the Trip Advisor dataset, runs the algorithm, and then\noutputs names of anomalous reviewers. Since this dataset consists of\nhuge reviews, loading may take long time.\n\n.. code:: py\n\n import fraudar\n import tripadvisor\n\n # Construct a graph and load the dataset.\n graph = fraudar.ReviewGraph()\n tripadvisor.load(graph)\n\n # Run the analyzing algorithm.\n graph.update()\n\n # Print names of reviewers who are judged as anomalous.\n for r in graph.reviewers:\n if r.anomalous_score == 1:\n print r.name\n\n # The number of reviewers the dataset has: -> 1169456.\n len(graph.reviewers)\n\n # The number of reviewers judged as anomalous: -> 147.\n len([r for r in graph.reviewers if r.anomalous_score == 1])\n\nNote that you may need to install the FRAUDAR algorithm for the Review\nMining Project by ``pip install rgmining-fraudar``.\n\nLicense\n-------\n\nThis software is released under The GNU General Public License Version\n3, see `COPYING `__ for more detail.\n\nThe authors of the Trip Advisor dataset, which this software imports,\nrequires to cite the following papers when you publish research papers\nusing this package:\n\n- `Hongning Wang `__, `Yue\n Lu `__, and `ChengXiang\n Zhai `__, \"`Latent Aspect Rating\n Analysis without Aspect Keyword\n Supervision `__,\"\n In Proc. of the 17th ACM SIGKDD Conference on Knowledge Discovery and\n Data Mining (KDD'2011), pp.618-626, 2011;\n- `Hongning Wang `__, `Yue\n Lu `__, and `Chengxiang\n Zhai `__, \"`Latent Aspect Rating\n Analysis on Review Text Data: A Rating Regression\n Approach `__,\"\n In Proc. of the 16th ACM SIGKDD Conference on Knowledge Discovery and\n Data Mining (KDD'2010), pp.783-792, 2010.\n\n.. |GPLv3| image:: https://img.shields.io/badge/license-GPLv3-blue.svg\n :target: https://www.gnu.org/copyleft/gpl.html\n.. |Build Status| image:: https://travis-ci.org/rgmining/tripadvisor.svg?branch=master\n :target: https://travis-ci.org/rgmining/tripadvisor\n.. |wercker status| image:: https://app.wercker.com/status/815b060cc2fa911215674efdc1078d50/s/master\n :target: https://app.wercker.com/project/byKey/815b060cc2fa911215674efdc1078d50\n.. |Release| image:: https://img.shields.io/badge/release-0.5.6-brightgreen.svg\n :target: https://github.com/rgmining/tripadvisor/releases/tag/v0.5.6\n.. |PyPi| image:: https://img.shields.io/badge/pypi-0.5.6-brightgreen.svg\n :target: https://pypi.python.org/pypi/rgmining-tripadvisor-dataset\n.. |Japanese| image:: https://img.shields.io/badge/qiita-%E6%97%A5%E6%9C%AC%E8%AA%9E-brightgreen.svg\n :target: http://qiita.com/jkawamoto/items/86c687d85efb783bcd7d\n.. |Logo| image:: https://rgmining.github.io/tripadvisor/_static/image.png\n :target: https://rgmining.github.io/tripadvisor/", "description_content_type": null, "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/rgmining/tripadvisor", "keywords": "", "license": "GPLv3", "maintainer": "", "maintainer_email": "", "name": "rgmining-tripadvisor-dataset", "package_url": "https://pypi.org/project/rgmining-tripadvisor-dataset/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/rgmining-tripadvisor-dataset/", "project_urls": { "Homepage": "https://github.com/rgmining/tripadvisor" }, "release_url": "https://pypi.org/project/rgmining-tripadvisor-dataset/0.5.6/", "requires_dist": null, "requires_python": "", "summary": "Trip Advisor dataset for Review Graph Mining Project", "version": "0.5.6" }, "last_serial": 2975306, "releases": { "0.5.0": [], "0.5.1": [ { "comment_text": "", "digests": { "md5": "674399ab7465b9ecd5aca660a8a9cb13", "sha256": "fb88f6321bcacf91d095c4817209af5473c9fb5185406ddaeb4c49a7ad5f6f89" }, "downloads": -1, "filename": "rgmining-tripadvisor-dataset-0.5.1.tar.gz", "has_sig": false, "md5_digest": "674399ab7465b9ecd5aca660a8a9cb13", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16033, "upload_time": "2017-01-16T08:50:41", "url": "https://files.pythonhosted.org/packages/6d/0c/56e45bceb23a142023ef45fa6d759c0fea0eb04c09cdab9bf673bcd9f4ee/rgmining-tripadvisor-dataset-0.5.1.tar.gz" } ], "0.5.2": [ { "comment_text": "", "digests": { "md5": "8930a5bc0f6db6c51718dc79e9a22767", "sha256": "4eed9477cd8c1f7696c099a839c355a11c8d4feaa2ef0d48ff22bfd3c4731df8" }, "downloads": -1, "filename": "rgmining-tripadvisor-dataset-0.5.2.tar.gz", "has_sig": false, "md5_digest": "8930a5bc0f6db6c51718dc79e9a22767", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16366, "upload_time": "2017-01-16T16:04:05", "url": "https://files.pythonhosted.org/packages/3c/e3/7b6f76bb404551523f37bbecca3c63fbfa3e061cc1318f4c00f53b195742/rgmining-tripadvisor-dataset-0.5.2.tar.gz" } ], "0.5.3": [ { "comment_text": "", "digests": { "md5": "5370a1d854f4c1bc80edcf626c37ad93", "sha256": "fced49ae6b0f306bcbca4765a93fb2dafcb7ccb03705124601f4428f8ef32734" }, "downloads": -1, "filename": "rgmining-tripadvisor-dataset-0.5.3.tar.gz", "has_sig": false, "md5_digest": "5370a1d854f4c1bc80edcf626c37ad93", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16495, "upload_time": "2017-01-16T16:45:35", "url": "https://files.pythonhosted.org/packages/20/f1/d5f75c9e3f2abcc10a477b82c5e847002c6ee0e15d41a1cee700ea4667d9/rgmining-tripadvisor-dataset-0.5.3.tar.gz" } ], "0.5.5": [ { "comment_text": "", "digests": { "md5": "6a2daec993cf746954a8b29020838e6a", "sha256": "697b603d65e2b995564490a5049363d0b1457285a7c09e33affc95e030b44b34" }, "downloads": -1, "filename": "rgmining-tripadvisor-dataset-0.5.5.tar.gz", "has_sig": false, "md5_digest": "6a2daec993cf746954a8b29020838e6a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16660, "upload_time": "2017-01-22T07:43:27", "url": "https://files.pythonhosted.org/packages/b5/7c/4c7cc6fe77a5542591dd30a856b4a14a469e24215abbd0f36134f5391546/rgmining-tripadvisor-dataset-0.5.5.tar.gz" } ], "0.5.5.post1": [ { "comment_text": "", "digests": { "md5": "6279fb1dc7d35b7bc1aa27856c38c3f3", "sha256": "3297d83c893e9863ce5830329c6a959df25c9fa949524bd04ff9630e708668e6" }, "downloads": -1, "filename": "rgmining-tripadvisor-dataset-0.5.5.post1.tar.gz", "has_sig": false, "md5_digest": "6279fb1dc7d35b7bc1aa27856c38c3f3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 53623, "upload_time": "2017-01-26T18:55:31", "url": "https://files.pythonhosted.org/packages/51/ac/ca4c5cdd873f5f74e8a0b4bef75477c4af01129d3d81cc67ad7153f25a1c/rgmining-tripadvisor-dataset-0.5.5.post1.tar.gz" } ], "0.5.6": [ { "comment_text": "", "digests": { "md5": "0ee7dd8f2a17f34e523158570a7ab87c", "sha256": "faa8fb0562f19f806569e73726f788dc89782b749bfc907df2f2e05341473bdb" }, "downloads": -1, "filename": "rgmining-tripadvisor-dataset-0.5.6.tar.gz", "has_sig": false, "md5_digest": "0ee7dd8f2a17f34e523158570a7ab87c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 56076, "upload_time": "2017-06-24T06:06:16", "url": "https://files.pythonhosted.org/packages/07/3f/2a3ebefd0f92578c106dd8e0b34bdb5cf7e07d45d5a57886edd7e6fdc1f2/rgmining-tripadvisor-dataset-0.5.6.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "0ee7dd8f2a17f34e523158570a7ab87c", "sha256": "faa8fb0562f19f806569e73726f788dc89782b749bfc907df2f2e05341473bdb" }, "downloads": -1, "filename": "rgmining-tripadvisor-dataset-0.5.6.tar.gz", "has_sig": false, "md5_digest": "0ee7dd8f2a17f34e523158570a7ab87c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 56076, "upload_time": "2017-06-24T06:06:16", "url": "https://files.pythonhosted.org/packages/07/3f/2a3ebefd0f92578c106dd8e0b34bdb5cf7e07d45d5a57886edd7e6fdc1f2/rgmining-tripadvisor-dataset-0.5.6.tar.gz" } ] }