{ "info": { "author": "Tony Narlock", "author_email": "cihai@git-pull.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Environment :: Web Environment", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: Implementation :: PyPy", "Topic :: Utilities" ], "description": "*cihaidata-unihan* - tool to build `unihan`_ into `simple data format`\nCSV format. Part of the `cihai`_ project.\n\n|pypi| |docs| |build-status| |coverage| |license|\n\nUnihan's data is disperved across multiple files in the format of::\n\n U+3400\tkCantonese\tjau1\n U+3400\tkDefinition\t(same as U+4E18 \u4e18) hillock or mound\n U+3400\tkMandarin\tqi\u016b\n U+3401\tkCantonese\ttim2\n U+3401\tkDefinition\tto lick; to taste, a mat, bamboo bark\n U+3401\tkHanyuPinyin\t10019.020:ti\u00e0n\n U+3401\tkMandarin\tti\u00e0n\n\n``cihaidata_unihan/process.py`` will download Unihan.zip and build all files into a\nsingle tabular CSV (default output: ``./data/unihan.csv``)::\n\n char,ucn,kCantonese,kDefinition,kHanyuPinyin,kMandarin\n \u4e18,U+3400,jau1,(same as U+4E18 \u4e18) hillock or mound,,qi\u016b\n \u3401,U+3401,tim2,\"to lock; to taste, a mat, bamboo bark\",10019.020:\"ti\u00e0n,ti\u00e0n\"\n\n``process.py`` supports command line arguments. See `cihaidata_unihan/process.py CLI\narguments`_ for information on how you can specify custom columns, files,\ndownload URL's and output destinations.\n\nBeing built against unit tests. See the `Travis Builds`_ and\n`Revision History`_.\n\n.. _cihai: https://cihai.git-pull.com\n.. _cihai-handbook: https://github.com/cihai/cihai-handbook\n.. _cihai team: https://github.com/cihai?tab=members\n.. _cihai-python: https://github.com/cihai/cihai-python\n.. _cihaidata-unihan on github: https://github.com/cihai/cihaidata-unihan\n\nUsage\n-----\n\nTo download and build your own ``unihan.csv``:\n\n.. code-block:: bash\n\n $ ./cihaidata_unihan/process.py\n\nCreates ``data/unihan.csv``.\n\nSee `cihaidata_unihan/process.py CLI arguments`_ for advanced usage examples.\n\n.. _cihaidata_unihan/process.py CLI arguments: http://cihaidata-unihan.readthedocs.org/en/latest/cli.html\n\nStructure\n---------\n\n.. code-block:: bash\n\n # dataset metadata, schema information.\n datapackage.json\n\n # (future) when this package is stable, unihan.csv will be provided\n data/unihan.csv\n\n # stores downloaded Unihan.zip and it's txt file contents (.gitignore'd)\n data/build_files/\n\n # script to download + build a SDF csv of unihan.\n cihaidata_unihan/process.py\n\n # unit tests to verify behavior / consistency of builder\n tests/*\n\n # python 2/3 compatibility modules\n cihaidata_unihan/_compat.py\n cihaidata_unihan/unicodecsv.py\n\n # python module, public-facing python API.\n __init__.py\n cihaidata_unihan/__init__.py\n\n # utility / helper functions\n cihaidata_unihan/util.py\n\n\nCihai is *not* required for:\n\n- ``data/unihan.csv`` - `simple data format`_ compatible csv file.\n- ``cihaidata_unihan/process.py`` - create a ``data/unihan.csv``.\n\nWhen this module is stable, ``data/unihan.csv`` will have prepared\nreleases, without requires using ``cihaidata_unihan/process.py``. ``process.py``\nwill not require external libraries.\n\nExamples\n--------\n\n- https://github.com/datasets/gdp\n- https://github.com/datasets/country-codes\n\nRelated links:\n\n- CSV *Simple Data Format* (SDF): http://data.okfn.org/standards/simple-data-format\n- Tools: http://data.okfn.org/tools\n\n\n.. _Travis Builds: https://travis-ci.org/cihai/cihaidata-unihan/builds\n.. _Revision History: https://github.com/cihai/cihaidata-unihan/commits/master\n.. _cjklib: http://cjklib.org/0.3/\n.. _current datasets: http://cihai.readthedocs.org/en/latest/api.html#datasets\n.. _permissively licensing your dataset: http://cihai.readthedocs.org/en/latest/information_liberation.html\n\n============== ==========================================================\nPython support Python 2.7, >= 3.3, pypy/pypy3\nSource https://github.com/cihai/cihaidata-unihan\nDocs https://cihaidata-unihan.git-pull.com\nChangelog https://cihaidata-unihan.git-pull.com/en/latest/history.html\nAPI https://cihaidata-unihan.git-pull.com/en/latest/api.html\nIssues https://github.com/cihai/cihaidata-unihan/issues\nTravis https://travis-ci.org/cihai/cihaidata-unihan\nTest coverage https://codecov.io/gh/cihai/cihaidata-unihan\npypi https://pypi.python.org/pypi/cihaidata-unihan\nOpenHub https://www.openhub.net/p/cihaidata-unihan\nLicense `MIT`_.\ngit repo .. code-block:: bash\n\n $ git clone https://github.com/cihai/cihaidata-unihan.git\ninstall dev .. code-block:: bash\n\n $ git clone https://github.com/cihai/cihaidata-unihan.git cihai\n $ cd ./cihai\n $ virtualenv .env\n $ source .env/bin/activate\n $ pip install -e .\ntests .. code-block:: bash\n\n $ python setup.py test\n============== ==========================================================\n\n.. _MIT: http://opensource.org/licenses/MIT\n.. _Documentation: http://cihai.readthedocs.org/en/latest/\n.. _API: http://cihai.readthedocs.org/en/latest/api.html\n.. _Unihan: http://www.unicode.org/charts/unihan.html\n.. _datapackages: http://dataprotocols.org/data-packages/\n.. _datapackage.json format: https://github.com/datasets/gdp/blob/master/datapackage.json\n.. _json table schema: http://dataprotocols.org/json-table-schema/\n.. _simple data format: http://data.okfn.org/standards/simple-data-format\n.. _cihai dataset API: http://cihai.readthedocs.org/en/latest/extending.html\n.. _PEP 301\\: python package format: http://www.python.org/dev/peps/pep-0301/\n\n.. |pypi| image:: https://img.shields.io/pypi/v/cihaidata-unihan.svg\n :alt: Python Package\n :target: http://badge.fury.io/py/cihaidata-unihan\n\n.. |build-status| image:: https://img.shields.io/travis/cihai/cihaidata-unihan.svg\n :alt: Build Status\n :target: https://travis-ci.org/cihai/cihaidata-unihan\n\n.. |coverage| image:: https://codecov.io/gh/cihai/cihaidata-unihan/branch/master/graph/badge.svg\n :alt: Code Coverage\n :target: https://codecov.io/gh/cihai/cihaidata-unihan\n\n.. |license| image:: https://img.shields.io/github/license/cihai/cihaidata-unihan.svg\n :alt: License \n\n.. |docs| image:: https://readthedocs.org/projects/cihaidata-unihan/badge/?version=latest\n :alt: Documentation Status\n :scale: 100%\n :target: https://readthedocs.org/projects/cihaidata-unihan/", "description_content_type": null, "docs_url": null, "download_url": "https://pypi.python.org/pypi/cihaidata-unihan", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://cihaidata-unihan.git-pull.com", "keywords": "cihaidata-unihan", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "cihaidata-unihan", "package_url": "https://pypi.org/project/cihaidata-unihan/", "platform": "", "project_url": "https://pypi.org/project/cihaidata-unihan/", "project_urls": { "Download": "https://pypi.python.org/pypi/cihaidata-unihan", "Homepage": "https://cihaidata-unihan.git-pull.com" }, "release_url": "https://pypi.org/project/cihaidata-unihan/0.4.2/", "requires_dist": null, "requires_python": "", "summary": "Tool to build UNIHAN dataset into datapackage / simple data format.", "version": "0.4.2" }, "last_serial": 2858194, "releases": { "0.4.0": [ { "comment_text": "", "digests": { "md5": "e24934dadf197b6e653c85798c0cb13e", "sha256": "a551f396235bc4fed682d07de74f46865ead1793d53039f24bf15d85a98792c3" }, "downloads": -1, "filename": "cihaidata-unihan-0.4.0.tar.gz", "has_sig": false, "md5_digest": "e24934dadf197b6e653c85798c0cb13e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11590, "upload_time": "2017-05-08T01:27:37", "url": "https://files.pythonhosted.org/packages/99/03/3f291c1828a33a1cae2170d0b01811e9de0ba10a131d61a8faa5575f6d63/cihaidata-unihan-0.4.0.tar.gz" } ], "0.4.1": [ { "comment_text": "", "digests": { "md5": "1d7817551cd9771b5e3b4fc73d08441c", "sha256": "733372f53b69f87a1f30b2d5eb6b2d0c6bab3693bad69ef70a631b55b506708e" }, "downloads": -1, "filename": "cihaidata-unihan-0.4.1.tar.gz", "has_sig": false, "md5_digest": "1d7817551cd9771b5e3b4fc73d08441c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11707, "upload_time": "2017-05-08T01:53:31", "url": "https://files.pythonhosted.org/packages/00/cd/79e8b9c930f108ec548bd25c96a5c7e448fdef5a475c410dc69850dd20f2/cihaidata-unihan-0.4.1.tar.gz" } ], "0.4.2": [ { "comment_text": "", "digests": { "md5": "7421b08089bb7dc28d4fbef07a8ad3c3", "sha256": "f513cde270acdb165087ffc673aab857f498acb174bb28cfcc9f82b9f3cb5d5e" }, "downloads": -1, "filename": "cihaidata-unihan-0.4.2.tar.gz", "has_sig": false, "md5_digest": "7421b08089bb7dc28d4fbef07a8ad3c3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11730, "upload_time": "2017-05-08T03:08:32", "url": "https://files.pythonhosted.org/packages/d4/e6/0534af69077b5ebf3e8ab2f8a383263ce42e72267388592735d7139b4d74/cihaidata-unihan-0.4.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "7421b08089bb7dc28d4fbef07a8ad3c3", "sha256": "f513cde270acdb165087ffc673aab857f498acb174bb28cfcc9f82b9f3cb5d5e" }, "downloads": -1, "filename": "cihaidata-unihan-0.4.2.tar.gz", "has_sig": false, "md5_digest": "7421b08089bb7dc28d4fbef07a8ad3c3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11730, "upload_time": "2017-05-08T03:08:32", "url": "https://files.pythonhosted.org/packages/d4/e6/0534af69077b5ebf3e8ab2f8a383263ce42e72267388592735d7139b4d74/cihaidata-unihan-0.4.2.tar.gz" } ] }