{ "info": { "author": "DevRoss", "author_email": "devross1997@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Operating System :: OS Independent", "Programming Language :: Python :: 2", "Programming Language :: Python :: 3", "Programming Language :: Python :: Implementation :: PyPy" ], "description": "# bert_slot_tokenizer\n\nVersion 0.2\n\n![Travis (.org)](https://img.shields.io/travis/DevRoss/bert-slot-tokenizer) ![GitHub](https://img.shields.io/github/license/devross/bert-slot-tokenizer)\n\n**bert_slot_tokenizer** \u662f\u4e00\u4e2a\u5c06slot filling \u4efb\u52a1\u4e2dslot\u89e3\u6790\u4e3a\u5176\u4ed6\u683c\u5f0f\u7684\u5de5\u5177\n\n## \u73af\u5883\uff1a\n\n- Python 3\n- Python 2\n\n## \u5b89\u88c5\uff1a\n\n```shell\npip install bert-slot-tokenizer\n```\n\n## \u652f\u6301\u7684\u683c\u5f0f\uff1a\n\n- [IOB\u683c\u5f0f](https://en.wikipedia.org/wiki/Inside\u2013outside\u2013beginning_(tagging))\n\n## \u4f7f\u7528\u65b9\u6cd5\uff1a\n\n```python\nfrom bert_slot_tokenizer import SlotConverter\nvacab_path = 'tests/test_data/example_vocab.txt' \n# you can find a example here --> https://github.com/DevRoss/bert-slot-tokenizer/blob/master/tests/test_data/example_vocab.txt\nsc = SlotConverter(vocab_path, do_lower_case=True)\ntext = 'Too YOUNG, too simple, sometimes naive! \u86e4\u86e4+1s'\nslot = {'name': '\u86e4\u86e4', 'time': '+1s'}\noutput_text, iob_slot = sc.convert2iob(text, slot)\nprint(output_text)\n# ['too', 'young', ',', 'too', 'simple', ',', 'some', '##times', 'na', '##ive', '!', '\u86e4', '\u86e4', '+', '1', '##s']\nprint(iob_slot)\n# ['O', 'O', 'O', 'O', 'O', 'O', 'O', 'O', 'O', 'O', 'O', 'B-name', 'I-name', 'B-time', 'I-time', 'I-time']\n```\n\n## \u5199\u5728\u6700\u540e\uff1a\n\n\u611f\u8c22BERT\u5bf9NLP\u9886\u57df\u7684\u63a8\u52a8\n\n\u611f\u8c22\u5f00\u6e90\n\n\u6b22\u8fcePR\u548cissue\n\n\u8054\u7cfb\u65b9\u5f0f\uff1a devross1997@gmail.com\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/DevRoss/bert-slot-tokenizer", "keywords": "bert_slot_tokenizer,bert,slot filling", "license": "License :: OSI Approved :: Apache Software License", "maintainer": "", "maintainer_email": "", "name": "bert-slot-tokenizer", "package_url": "https://pypi.org/project/bert-slot-tokenizer/", "platform": "", "project_url": "https://pypi.org/project/bert-slot-tokenizer/", "project_urls": { "Homepage": "https://github.com/DevRoss/bert-slot-tokenizer" }, "release_url": "https://pypi.org/project/bert-slot-tokenizer/0.2.1/", "requires_dist": null, "requires_python": ">=2.7", "summary": "A tool for converting raw text to slot", "version": "0.2.1" }, "last_serial": 5766265, "releases": { "0.1": [ { "comment_text": "", "digests": { "md5": "d704dc4604805249e7eec3efeb2e0a06", "sha256": "e530147d33e1b5d19a3186a438cfb5644f568083901d95572f700aa5748d64ad" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "d704dc4604805249e7eec3efeb2e0a06", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": ">=3", "size": 8719, "upload_time": "2019-07-31T03:52:45", "url": "https://files.pythonhosted.org/packages/69/b5/ee5f567cca1081fbaf5d45604fd284db10bbc0629f061015b6b1ffabd5d4/bert_slot_tokenizer-0.1-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f56bbbcb46d5a32d9a9a2f31dec632cc", "sha256": "3673f7402d96df3352102db5110865532f50990520efaf75fd2701253f02c8b1" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.1.tar.gz", "has_sig": false, "md5_digest": "f56bbbcb46d5a32d9a9a2f31dec632cc", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3", "size": 7544, "upload_time": "2019-07-31T03:52:47", "url": "https://files.pythonhosted.org/packages/85/98/64ce1d1ef5ea53c5669d19a3d1233492576ca4e17b0fd4742a774680a657/bert_slot_tokenizer-0.1.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "d2f53ffec04f777f6c1abfa570a4aafe", "sha256": "ec60369ab8580e52ada1608ea53bc7e86d9a957c9ce94b95b4e94fcbcc660a8f" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.1.1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "d2f53ffec04f777f6c1abfa570a4aafe", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": ">=3", "size": 9157, "upload_time": "2019-07-31T05:02:31", "url": "https://files.pythonhosted.org/packages/32/6a/975e4a84b8f5dbb40fce13f12c1394a08aa4b1cdbc7a66c497d1b407854b/bert_slot_tokenizer-0.1.1-py2.py3-none-any.whl" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "f9258e011811e2a8599648886fc75664", "sha256": "45874e13c80ec1094a83d4d9781a7125abbc350578cea599defa1b0a62902beb" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.2.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "f9258e011811e2a8599648886fc75664", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": ">=2.7", "size": 13102, "upload_time": "2019-09-01T06:22:11", "url": "https://files.pythonhosted.org/packages/d4/53/eb4923e54609b1cc84de77c1361bd76c5970b6b39b7656a94f33a4187dc1/bert_slot_tokenizer-0.2.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f286e09099758f527143a81f5271c61f", "sha256": "f63aae491be5dd1574c1df3c46cbefbb43d8a6953991df425353923df15eeea9" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.2.0.tar.gz", "has_sig": false, "md5_digest": "f286e09099758f527143a81f5271c61f", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.7", "size": 11911, "upload_time": "2019-09-01T06:22:13", "url": "https://files.pythonhosted.org/packages/16/db/ec92e4b77ee0098a89f1e5e91e7af8a60ac9a02dc89e0863b4e9b290efba/bert_slot_tokenizer-0.2.0.tar.gz" } ], "0.2.1": [ { "comment_text": "", "digests": { "md5": "a9eeb23828d5b51d92f6bb4691a35033", "sha256": "c1bba9061c6fcbdb0174f871e1f9f59a9602ed9bd1f46f27dc9f7d88e9f1ff7a" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.2.1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "a9eeb23828d5b51d92f6bb4691a35033", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": ">=2.7", "size": 13123, "upload_time": "2019-09-01T06:44:00", "url": "https://files.pythonhosted.org/packages/ea/41/86bf89b1561ec0c8c93cb99f69fe0b54fe89d135ac985f004939094b9260/bert_slot_tokenizer-0.2.1-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "134472fcad2be020525f7ab6064b2b93", "sha256": "3563c048499f730aa25a7b15deb8bad1da2d654818e083f308acb685f93cf012" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.2.1.tar.gz", "has_sig": false, "md5_digest": "134472fcad2be020525f7ab6064b2b93", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.7", "size": 11905, "upload_time": "2019-09-01T06:44:02", "url": "https://files.pythonhosted.org/packages/3e/0c/751e707569a76949a0d8c00fbef914b0d9e673d1b20c653cbb6ce20c2e43/bert_slot_tokenizer-0.2.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "a9eeb23828d5b51d92f6bb4691a35033", "sha256": "c1bba9061c6fcbdb0174f871e1f9f59a9602ed9bd1f46f27dc9f7d88e9f1ff7a" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.2.1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "a9eeb23828d5b51d92f6bb4691a35033", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": ">=2.7", "size": 13123, "upload_time": "2019-09-01T06:44:00", "url": "https://files.pythonhosted.org/packages/ea/41/86bf89b1561ec0c8c93cb99f69fe0b54fe89d135ac985f004939094b9260/bert_slot_tokenizer-0.2.1-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "134472fcad2be020525f7ab6064b2b93", "sha256": "3563c048499f730aa25a7b15deb8bad1da2d654818e083f308acb685f93cf012" }, "downloads": -1, "filename": "bert_slot_tokenizer-0.2.1.tar.gz", "has_sig": false, "md5_digest": "134472fcad2be020525f7ab6064b2b93", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.7", "size": 11905, "upload_time": "2019-09-01T06:44:02", "url": "https://files.pythonhosted.org/packages/3e/0c/751e707569a76949a0d8c00fbef914b0d9e673d1b20c653cbb6ce20c2e43/bert_slot_tokenizer-0.2.1.tar.gz" } ] }