{ "info": { "author": "Travis Chen", "author_email": "teckiechen@gmail.com", "bugtrack_url": null, "classifiers": [ "Natural Language :: Chinese (Simplified)", "Natural Language :: Chinese (Traditional)", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3.6" ], "description": "SegJb\n=====\n\n| Segmentation wrapper of `Jieba `__\n Chinese segmentation.\n| https://github.com/kn45/SegJb\n| ``pip install segjb`` (dependency: jieba) \n\n- | Lazy initialization.\n\n- | Initialization with user defined dict.\n\n- | Build-in stop-words dict, punctuations dict.\n\n- | Output control of stopwords, punctuations, minimum word length,\n output delimiters etc..\n\n- | Support ngram.\n\nAPI\n---\n\n| **init(stopwords\\_file, puncs\\_file, user\\_dict, silent, main\\_dict,\n thread)**\n| -- Initialize the segmentation utility instance.\n\n- | return: void.\n\n- | stopwords\\_file: stopword dictionary. Use \"\" to disable.\n [SegJb.DEFAULT\\_STPW]\n\n- | puncs\\_file: punctuation dictionary. Use \"\" to disable.\n [SegJb.DEFAULT\\_PUNC]\n\n- | user\\_dict: load user customized dictionary. Use \"\" to disable.\n [SegJb.DEFAULT\\_DICT]\n\n- | silent: whether print initializing log. [True]\n\n- thread: number of part to separate the corpus for parallel. [1]\n\n| **set\\_param(delim, min\\_word\\_len, ngram, keep\\_stopwords,\n keep\\_puncs)**\n| -- Set one or more parameters of the segmentation utility instance.\n Refer to parameter description.\n\n- | return: void\n | \n\n| **cut2list(corp)**\n| -- Cut a sentence to list due to configuration.\n\n- | return: list\n\n- | corp: unicode or utf8 sentence.\n | \n\n| **cut2str(corp)**\n| -- Cut a sentence to a delimeter(can be set by set\\_param) joined\n string.\n\n- | return: unicode string.\n\n- | corp: unicode or utf8 sentence.\n | \n\nParameters\n----------\n\n- | delim [' ']\n | the delimeter used to constuct the segmentation result in string.\n\n- | min\\_word\\_len [1]\n | word with length less than min\\_word\\_len will not in segmentation\n result.\n\n- | ngram [1]\n | result can be ngram.\n\n- | keep\\_stopwords [True]\n | whether to keep stopwords in result.\n\n- | keep\\_puncs [True]\n | whether to keep stopwords in result.\n\nExample:\n--------\n\n.. code:: python\n\n from segjb import SegJb\n hdl_seg = SegJb()\n hdl_seg.init()\n hdl_seg.set_param(delim=' ', ngram=2, keep_stopwords=True, keep_puncs=False)\n print hdl_seg.cut2str('\u8fd9\u662f\u4e00\u573a\u7cbe\u5f69\u7684\u6bd4\u8d5b')\n\nReference:\n----------\n\n- `Bigdict `__ from\n iLife(\\ 562193561@qq.com)\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/kn45/SegJb", "keywords": "nlp", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "segjb", "package_url": "https://pypi.org/project/segjb/", "platform": "", "project_url": "https://pypi.org/project/segjb/", "project_urls": { "Homepage": "https://github.com/kn45/SegJb" }, "release_url": "https://pypi.org/project/segjb/1.0/", "requires_dist": null, "requires_python": "", "summary": "A wrapper for jieba segmentation", "version": "1.0" }, "last_serial": 4151508, "releases": { "0.1": [ { "comment_text": "", "digests": { "md5": "6397eadfd4d2ff03521d203fc0beb4a7", "sha256": "f8a36f5ca7093ee3aeaecb1d72aaa57b48ef11f1cb218d9d05598df65d9b1446" }, "downloads": -1, "filename": "segjb-0.1.tar.gz", "has_sig": false, "md5_digest": "6397eadfd4d2ff03521d203fc0beb4a7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5896804, "upload_time": "2016-12-02T10:49:29", "url": "https://files.pythonhosted.org/packages/b3/3c/d6c0f5e6cce3bb7789a72e8480d76e24aa82463adf3ac14de86dc3bab710/segjb-0.1.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "b65a7fe6de14854392e900a575b8c13a", "sha256": "2a46ed6175dea1656a9ca51ca59970a21d5d08a3ff7fd594595c2db3a06c30bd" }, "downloads": -1, "filename": "segjb-0.1.1.tar.gz", "has_sig": false, "md5_digest": "b65a7fe6de14854392e900a575b8c13a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5896812, "upload_time": "2016-12-02T11:00:21", "url": "https://files.pythonhosted.org/packages/6e/67/b09477ea42c49900c998b62d7cbd04d47e7d94b96d0d5d4001c3860ff038/segjb-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "29bba1a67ff6ea386de9ea55b62f1a7b", "sha256": "8df657fe0e2ae92379284d3f644943e5c383290294de7dd058e8386f9ebceabe" }, "downloads": -1, "filename": "segjb-0.1.2.tar.gz", "has_sig": false, "md5_digest": "29bba1a67ff6ea386de9ea55b62f1a7b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5898574, "upload_time": "2016-12-02T11:35:00", "url": "https://files.pythonhosted.org/packages/e2/f0/60ad32e26c0e84cbb191147a26d950c6865dba1279db164e47968ec6034e/segjb-0.1.2.tar.gz" } ], "0.1.3": [ { "comment_text": "", "digests": { "md5": "798cf5279d11ee4d6cea7d7c6c9cc7e6", "sha256": "5008884bf52f6b51f7eb459afabdf2dd816f5cd15810f9e0f245b4b1577d215f" }, "downloads": -1, "filename": "segjb-0.1.3-py2-none-any.whl", "has_sig": false, "md5_digest": "798cf5279d11ee4d6cea7d7c6c9cc7e6", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 6003445, "upload_time": "2016-12-08T05:41:09", "url": "https://files.pythonhosted.org/packages/fc/e2/727e9a9dcbcd4a97ba6f445d57018638501f88e07a30891c528400555171/segjb-0.1.3-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f9ed790ed9355f3d6ef1d454d95bdafb", "sha256": "73b4d1b8dbfa28daaf38ae0cd59c35fdb05da68143b11fbb3b7cf3eb17b55a43" }, "downloads": -1, "filename": "segjb-0.1.3.tar.gz", "has_sig": false, "md5_digest": "f9ed790ed9355f3d6ef1d454d95bdafb", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5898559, "upload_time": "2016-12-08T05:38:29", "url": "https://files.pythonhosted.org/packages/85/86/73ac0d886ea546e8bbb18eab7bcfa3cdb85036662a776fa22b8a2b01887f/segjb-0.1.3.tar.gz" } ], "0.1.3.post1": [ { "comment_text": "", "digests": { "md5": "b840ddce4c5425eeadc21a39331aece3", "sha256": "a77ccfb5d7f414d32d0296cdcb5bb11bdbb0e78790686b1077cdb249f79d111c" }, "downloads": -1, "filename": "segjb-0.1.3.post1-py2-none-any.whl", "has_sig": false, "md5_digest": "b840ddce4c5425eeadc21a39331aece3", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 6003475, "upload_time": "2016-12-08T08:36:56", "url": "https://files.pythonhosted.org/packages/6b/a3/d6188d1e0b9f268b9d6abf6719ba806fd9fd70463ef465fddee520f7a30a/segjb-0.1.3.post1-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "15e479aab14fe62c4cb7bb106c4f7b5a", "sha256": "77d8ec191156b87d29743dd665c8d99bc5cb226e25ae89b0be68a0d2354ac2ec" }, "downloads": -1, "filename": "segjb-0.1.3.post1.tar.gz", "has_sig": false, "md5_digest": "15e479aab14fe62c4cb7bb106c4f7b5a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5898559, "upload_time": "2016-12-08T08:35:48", "url": "https://files.pythonhosted.org/packages/94/65/a6bac1ec6d311e591c9f174ea0b3335c9f1eadc07489c54f4a8266844a98/segjb-0.1.3.post1.tar.gz" } ], "0.2.0rc1": [ { "comment_text": "", "digests": { "md5": "b9c056862d8cd322d979102463bc4187", "sha256": "f2330d9b9a7496b7348650ffe35fcf470c532d1eb738ce7627a583a2ae1f99f2" }, "downloads": -1, "filename": "segjb-0.2.0rc1-py2-none-any.whl", "has_sig": false, "md5_digest": "b9c056862d8cd322d979102463bc4187", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 6003558, "upload_time": "2016-12-11T15:03:21", "url": "https://files.pythonhosted.org/packages/4e/a4/69dc00f67ee0cd164511313a89e5a5bf63db742c85d58bd75d2904e29116/segjb-0.2.0rc1-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "fbfbfab26a1ed3d36deb26d91f06472b", "sha256": "a7f2c53b67f5f198324be1c93857bb1f400cd9dea9c3d9560d83c072ca719266" }, "downloads": -1, "filename": "segjb-0.2.0rc1.tar.gz", "has_sig": false, "md5_digest": "fbfbfab26a1ed3d36deb26d91f06472b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5898739, "upload_time": "2016-12-11T15:02:42", "url": "https://files.pythonhosted.org/packages/4e/bc/b915a045c73f596678c40cac8aa90543a39ffb3e0db34216ba54dfb6a1b0/segjb-0.2.0rc1.tar.gz" } ], "1.0": [ { "comment_text": "", "digests": { "md5": "b9c798e485e8e3ff723cb52899922a9a", "sha256": "fe3d89891aef3723f7edfdb214475c5dbbb2aeb9738a6bf9109261fc77962244" }, "downloads": -1, "filename": "segjb-1.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "b9c798e485e8e3ff723cb52899922a9a", "packagetype": "bdist_wheel", "python_version": "3.6", "requires_python": null, "size": 5999769, "upload_time": "2018-08-09T03:55:11", "url": "https://files.pythonhosted.org/packages/02/4a/99025719efafe7cff864f24be592a529593b279cbdec029a5bab4f1b4aef/segjb-1.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6c6d0c2cfda9709e76da56a65ae95331", "sha256": "9547cf4bca2e9c6023c9ffd365e3a47988a2b1be1d1911231af8d87e6790e4b0" }, "downloads": -1, "filename": "segjb-1.0.tar.gz", "has_sig": false, "md5_digest": "6c6d0c2cfda9709e76da56a65ae95331", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5896016, "upload_time": "2018-08-09T03:45:18", "url": "https://files.pythonhosted.org/packages/1b/34/8b3cfeeda727afdfd99d95343a7fe40bab802f4852398b9ca1be751c9cee/segjb-1.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "b9c798e485e8e3ff723cb52899922a9a", "sha256": "fe3d89891aef3723f7edfdb214475c5dbbb2aeb9738a6bf9109261fc77962244" }, "downloads": -1, "filename": "segjb-1.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "b9c798e485e8e3ff723cb52899922a9a", "packagetype": "bdist_wheel", "python_version": "3.6", "requires_python": null, "size": 5999769, "upload_time": "2018-08-09T03:55:11", "url": "https://files.pythonhosted.org/packages/02/4a/99025719efafe7cff864f24be592a529593b279cbdec029a5bab4f1b4aef/segjb-1.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6c6d0c2cfda9709e76da56a65ae95331", "sha256": "9547cf4bca2e9c6023c9ffd365e3a47988a2b1be1d1911231af8d87e6790e4b0" }, "downloads": -1, "filename": "segjb-1.0.tar.gz", "has_sig": false, "md5_digest": "6c6d0c2cfda9709e76da56a65ae95331", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5896016, "upload_time": "2018-08-09T03:45:18", "url": "https://files.pythonhosted.org/packages/1b/34/8b3cfeeda727afdfd99d95343a7fe40bab802f4852398b9ca1be751c9cee/segjb-1.0.tar.gz" } ] }