{ "info": { "author": "Marc Lodewijck", "author_email": "mlodewijck@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 5 - Production/Stable", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python :: 3.3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: 3.7", "Programming Language :: Python :: 3 :: Only", "Topic :: Text Processing :: Filters", "Topic :: Utilities" ], "description": "``unicode-charnames``\n=====================\n\nUnicode characters have names that serve as unique identifiers for each character. The character names in the Unicode Standard are identical to those of ISO/IEC 10646.\n\nThe unicode-charnames package performs searches for Unicode character names or code point labels by Unicode character, and searches for Unicode code points by character names. It also performs substring searches in Unicode character names. This package supports version 12.1 of the Unicode Standard (137,929 characters).\n\nThe generic term \"character name\" refers to the Unicode character \"Name\" property value for an encoded Unicode character. For code points that do not have character names (unassigned, reserved code points and other special code point types), the Unicode Standard uses constructed Unicode code point labels, displayed between angle brackets, to stand in for character names.\n\nFeatures\n--------\n\nThe library provides:\n\n* A function to get the character name (the normative character property \"Name\") or the code point label (for characters that do not have character names) of a single Unicode character.\n* A function to get the code point value (in the usual 4- to 6-digit hexadecimal format) corresponding to a Unicode character name; the search is case-sensitive and requires exact string match.\n* A function to search characters by character name; the search is case-insensitive but requires exact substring match.\n\nExample usage::\n\n # -*- coding: utf-8 -*-\n\n from unicode_charnames import (\n charname,\n codepoint,\n search_charnames\n )\n\n # charname()\n print('charname():\\n')\n print(charname('\u9fa0'))\n print(charname('\\U0001F60A'))\n print(charname('\\u00E5'))\n print(charname('\\u0002'))\n\n # codepoint()\n print('\\ncodepoint():\\n')\n print(codepoint('LATIN CAPITAL LETTER E WITH ACUTE'))\n print(codepoint('SUPERCALIFRAGILISTICEXPIALIDOCIOUS'))\n print(codepoint('SQUARE ERA NAME REIWA'))\n\n # search_charnames()\n print('\\nsearch_charnames():\\n')\n for x in search_charnames('era name'):\n print('\\t'.join(x))\n\nWill produce the following output::\n\n charname():\n\n CJK UNIFIED IDEOGRAPH-9FA0\n SMILING FACE WITH SMILING EYES\n LATIN SMALL LETTER A WITH RING ABOVE\n \n\n codepoint():\n\n 00C9\n None\n 32FF\n\n search_charnames():\n\n 32FF\tSQUARE ERA NAME REIWA\n 337B\tSQUARE ERA NAME HEISEI\n 337C\tSQUARE ERA NAME SYOUWA\n 337D\tSQUARE ERA NAME TAISYOU\n 337E\tSQUARE ERA NAME MEIZI\n\nReferences\n----------\n\n* https://www.unicode.org/versions/Unicode12.1.0/ch04.pdf#M9.40526.Heading.48.NameNormative\n* https://www.unicode.org/Public/12.1.0/ucd/UnicodeData.txt\n* https://www.unicode.org/Public/12.1.0/ucd/extracted/DerivedName.txt\n\nLicense\n-------\n\nunicode-charnames is released under an MIT license. The full text of the license is available `here `_.\n\nThe Unicode Standard v12.1.0 *DerivedName.txt* file is licensed under the Unicode License Agreement for Data Files and Software. Please consult the `UNICODE, INC. LICENSE AGREEMENT `_ prior to use.", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/mlodewijck/unicode_charnames", "keywords": "Unicode,character names", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "unicode-charnames", "package_url": "https://pypi.org/project/unicode-charnames/", "platform": "", "project_url": "https://pypi.org/project/unicode-charnames/", "project_urls": { "Bug Reports": "https://github.com/mlodewijck/unicode_charnames/issues", "Homepage": "https://github.com/mlodewijck/unicode_charnames", "Source": "https://github.com/mlodewijck/unicode_charnames/" }, "release_url": "https://pypi.org/project/unicode-charnames/12.1.0.post1/", "requires_dist": null, "requires_python": ">=3.3", "summary": "Look up Unicode character name or code point label and search in Unicode character names", "version": "12.1.0.post1" }, "last_serial": 5749390, "releases": { "12.1.0.post1": [ { "comment_text": "", "digests": { "md5": "e8c82f7334b16b54158f2818103dee67", "sha256": "2643b5a5bcb8b5f07187a437411e4f0d8ac7ab53069b51b535b266ef53c4599f" }, "downloads": -1, "filename": "unicode_charnames-12.1.0.post1.tar.gz", "has_sig": false, "md5_digest": "e8c82f7334b16b54158f2818103dee67", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.3", "size": 264494, "upload_time": "2019-08-29T06:46:55", "url": "https://files.pythonhosted.org/packages/9c/c2/760b269ed4a13b07f49da8cdaf3d7a4f9e2dbb7903ef17ca0ecf16b0708e/unicode_charnames-12.1.0.post1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "e8c82f7334b16b54158f2818103dee67", "sha256": "2643b5a5bcb8b5f07187a437411e4f0d8ac7ab53069b51b535b266ef53c4599f" }, "downloads": -1, "filename": "unicode_charnames-12.1.0.post1.tar.gz", "has_sig": false, "md5_digest": "e8c82f7334b16b54158f2818103dee67", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.3", "size": 264494, "upload_time": "2019-08-29T06:46:55", "url": "https://files.pythonhosted.org/packages/9c/c2/760b269ed4a13b07f49da8cdaf3d7a4f9e2dbb7903ef17ca0ecf16b0708e/unicode_charnames-12.1.0.post1.tar.gz" } ] }