{ "info": { "author": "Armand Boschin", "author_email": "aboschin@enst.fr", "bugtrack_url": null, "classifiers": [ "Development Status :: 2 - Pre-Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: BSD License", "Natural Language :: English", "Programming Language :: Python :: 3.7" ], "description": "============\nWikiDataSets\n============\n\n\n.. image:: https://img.shields.io/pypi/v/wikidatasets.svg\n :target: https://pypi.python.org/pypi/wikidatasets\n\n.. image:: https://img.shields.io/travis/armand33/wikidatasets.svg\n :target: https://travis-ci.org/armand33/wikidatasets\n\n.. image:: https://readthedocs.org/projects/wikidatasets/badge/?version=latest\n :target: https://wikidatasets.readthedocs.io/en/latest/?badge=latest\n :alt: Documentation Status\n\n\n.. image:: https://pyup.io/repos/github/armand33/wikidatasets/shield.svg\n :target: https://pyup.io/repos/github/armand33/wikidatasets/\n :alt: Updates\n\n\n\nBreaking WikiData dumps into smaller knowledge graphs (e.g. graph of human entities).\n\n\n* Free software: BSD license\n* Documentation: https://wikidatasets.readthedocs.io.\n* Paper: https://arxiv.org/abs/1906.04536\n\nData Sets\n---------\nData sets are available on this `page `_.\n\nFeatures\n--------\n\nThis is a non-exhaustive list of useful functions :\n\n* ``wikidatasets.processFunction.get_subclasses`` : Gets a list of WikiData IDs of entities which are subclasses of the subject.\n* ``wikidatasets.processFunction.query_wikidata_dump`` : Goes through a Wikidata dump. It can either collect entities that are instances of test_entities or collect the dictionary of labels. It can also do both.\n* ``wikidatasets.processFunction.build_dataset`` : Builds datasets from the pickle files produced by the query_wikidata_dump.\n* ``wikidatasets.utils.load_data_labels`` : Loads the edges and attributes files into Pandas dataframes and merges the labels of entities and relations to get.\n\nThe example/ folder contains examples of scripts to create datasets (e.g. `build_humans.py `_).\nSuch scripts should be placed in the main directory (along with ``utils.py``, ``processFunctions.py``) and hard-coded paths should be tuned to match your installation.\n\nCitations\n---------\n\nIf you find this code useful in your research, please consider citing our `paper `_:\n\n.. code-block::\n\n @misc{arm2019wikidatasets,\n title={WikiDataSets : Standardized sub-graphs from WikiData},\n author={Armand Boschin},\n year={2019},\n eprint={1906.04536},\n archivePrefix={arXiv},\n primaryClass={cs.LG}\n }\n\nCredits\n-------\n\nThis package was created with Cookiecutter_ and the `audreyr/cookiecutter-pypackage`_ project template.\n\n.. _Cookiecutter: https://github.com/audreyr/cookiecutter\n.. _`audreyr/cookiecutter-pypackage`: https://github.com/audreyr/cookiecutter-pypackage\n\n\n=======\nHistory\n=======\n\n0.2.0 (2019-07-02)\n------------------\n\n* Added export of a nodes.txt to the build_dataset function.\n\n\n\n0.1.0 (2019-07-01)\n------------------\n\n* First release on PyPI.\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/armand33/wikidatasets", "keywords": "wikidatasets", "license": "BSD license", "maintainer": "", "maintainer_email": "", "name": "wikidatasets", "package_url": "https://pypi.org/project/wikidatasets/", "platform": "", "project_url": "https://pypi.org/project/wikidatasets/", "project_urls": { "Homepage": "https://github.com/armand33/wikidatasets" }, "release_url": "https://pypi.org/project/wikidatasets/0.2.0/", "requires_dist": [ "tqdm (==4.32.2)", "sparqlwrapper (==1.8.2)", "pandas (==0.24.1)" ], "requires_python": "", "summary": "Break WikiData dumps into smaller knowledge graphs", "version": "0.2.0" }, "last_serial": 5476446, "releases": { "0.1.4": [ { "comment_text": "", "digests": { "md5": "224660dda7e2c6c3ad518e989476b8db", "sha256": "a715b1a8fbb06b66edcc24fcc3df7cf37c8c89041f1c8c300dba8c3e7cf82463" }, "downloads": -1, "filename": "wikidatasets-0.1.4-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "224660dda7e2c6c3ad518e989476b8db", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 8987, "upload_time": "2019-07-02T09:01:35", "url": "https://files.pythonhosted.org/packages/06/79/75b56c9b39c2a4b6bbc4baa2a7ddf61cf6d96d480f2e2ac8676b82a9a007/wikidatasets-0.1.4-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6f9e18277c82352f02a7e803f98af062", "sha256": "fb88cf6ea2057ac1b5c53dc41439be3e1a337fcc588e003e22b195c56e442fca" }, "downloads": -1, "filename": "wikidatasets-0.1.4.tar.gz", "has_sig": false, "md5_digest": "6f9e18277c82352f02a7e803f98af062", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13546, "upload_time": "2019-07-02T09:01:36", "url": "https://files.pythonhosted.org/packages/ab/11/b95ba9b9f1302e7d33c6ed311cfad2c9629400884c3b4ab2f178aee92f09/wikidatasets-0.1.4.tar.gz" } ], "0.1.5": [ { "comment_text": "", "digests": { "md5": "d384a36b47297330af15917b34a2e934", "sha256": "22aab373feb8f8c048bda5dc5f84d5a155361813366b54b54752c08d35650dfb" }, "downloads": -1, "filename": "wikidatasets-0.1.5-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "d384a36b47297330af15917b34a2e934", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 10077, "upload_time": "2019-07-02T09:22:19", "url": "https://files.pythonhosted.org/packages/e4/3b/24c546235d536c6d4828a8af028c81339eb7e713971676acf4daa9ab8e7e/wikidatasets-0.1.5-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "7c74ddf6f6857472f23e8d55e983e859", "sha256": "71cecbd21ad8833451716990f61a6411eac0456e682a2207620cf804f99d2867" }, "downloads": -1, "filename": "wikidatasets-0.1.5.tar.gz", "has_sig": false, "md5_digest": "7c74ddf6f6857472f23e8d55e983e859", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 14674, "upload_time": "2019-07-02T09:22:20", "url": "https://files.pythonhosted.org/packages/74/e6/4f77f670a1f007ffa35071771ff8ee55c713a2814c1ba985c16de8faadf5/wikidatasets-0.1.5.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "8a56fe5b319019ad6a48a300a4f4fd97", "sha256": "1ded488c0e5abcd4e20a0d230713dc58d288631b3b03755b446727c665898d0d" }, "downloads": -1, "filename": "wikidatasets-0.2.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "8a56fe5b319019ad6a48a300a4f4fd97", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 10136, "upload_time": "2019-07-02T11:25:09", "url": "https://files.pythonhosted.org/packages/6e/25/98cae4e58f85384334ad2d0fb8dc3f558e7b0b283eb0f6df33f76df2b921/wikidatasets-0.2.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "86a84469dbdfb66ffb150089c4475b2e", "sha256": "377e17fa558c1abebc6d0c4c3fd50f642ce7ebce6fe1a8a6a4813daf786f2b45" }, "downloads": -1, "filename": "wikidatasets-0.2.0.tar.gz", "has_sig": false, "md5_digest": "86a84469dbdfb66ffb150089c4475b2e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 14804, "upload_time": "2019-07-02T11:25:11", "url": "https://files.pythonhosted.org/packages/c5/41/47d1f1f86f154f892a8ca5d8d9c6c0e4d27ebc907499e719529e06db4806/wikidatasets-0.2.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "8a56fe5b319019ad6a48a300a4f4fd97", "sha256": "1ded488c0e5abcd4e20a0d230713dc58d288631b3b03755b446727c665898d0d" }, "downloads": -1, "filename": "wikidatasets-0.2.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "8a56fe5b319019ad6a48a300a4f4fd97", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 10136, "upload_time": "2019-07-02T11:25:09", "url": "https://files.pythonhosted.org/packages/6e/25/98cae4e58f85384334ad2d0fb8dc3f558e7b0b283eb0f6df33f76df2b921/wikidatasets-0.2.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "86a84469dbdfb66ffb150089c4475b2e", "sha256": "377e17fa558c1abebc6d0c4c3fd50f642ce7ebce6fe1a8a6a4813daf786f2b45" }, "downloads": -1, "filename": "wikidatasets-0.2.0.tar.gz", "has_sig": false, "md5_digest": "86a84469dbdfb66ffb150089c4475b2e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 14804, "upload_time": "2019-07-02T11:25:11", "url": "https://files.pythonhosted.org/packages/c5/41/47d1f1f86f154f892a8ca5d8d9c6c0e4d27ebc907499e719529e06db4806/wikidatasets-0.2.0.tar.gz" } ] }