{ "info": { "author": "Chris Fournier", "author_email": "chris.m.fournier@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 5 - Production/Stable", "Environment :: Console", "Intended Audience :: Developers", "Intended Audience :: Science/Research", "License :: OSI Approved :: BSD License", "Natural Language :: English", "Operating System :: OS Independent", "Programming Language :: Python", "Programming Language :: Python :: 2.6", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3.2", "Programming Language :: Python :: 3.3", "Topic :: Scientific/Engineering :: Artificial Intelligence", "Topic :: Scientific/Engineering :: Information Analysis", "Topic :: Text Processing", "Topic :: Utilities" ], "description": "SegEval v2.0\n============\n\n.. image:: https://travis-ci.org/cfournie/segmentation.evaluation.png?branch=master\n\t:target: https://travis-ci.org/cfournie/segmentation.evaluation\n.. image:: https://coveralls.io/repos/cfournie/segmentation.evaluation/badge.png?branch=master\n\t:target: https://coveralls.io/r/cfournie/segmentation.evaluation?branch=master\n.. image:: https://pypip.in/v/segeval/badge.png\n :target: https://crate.io/packages/segeval/\n.. image:: https://pypip.in/d/segeval/badge.png\n\t:target: https://crate.io/packages/segeval/\n\n|\n\nText segmentation is the task of splitting up any amount of text into segments by placing boundaries between some atomic unit (e.g., morphemes, words, lines, sentences, paragraphs, sections, etc.). It's a common pre-processing step in many `Natural Language Processing (NLP) `_ tasks.\n\nThis package is a collection of metrics and for comparing text segmentations and evaluating automatic text segmenters. Both new (**Boundary Similarity**, **Segmentation Similarity**) and traditional (**WindowDiff**, **Pk**) are included, as well as inter-coder agreement coefficients and confusion matrices based upon a boundary edit distance.\n\nTo see some examples of its usage, `read the docs `_.\n\n\nFeature Support\n---------------\nIncluded is a variety of segmentation comparison metrics, including:\n\n* Boundary Edit Distance (BED)\n* Boundary Similarity (B)\n* BED-based confusion matrices (and precision/recall/F1)\n* Segmentation Similarity (S)\n* WindowDiff\n* Pk\n\nAdditionally, B-based inter-coder agreement coefficients for segmentation that are suitable for 2 or more coders are provided, including:\n\n* Fleiss' Pi (i.e., Siegel and Castellan's K)\n* Fleiss' Kappa\n\n\nInstallation\n------------\n\nTo install SegEval, simply run:\n\n.. code-block:: bash\n\n $ pip install segeval\n\n\nDocumentation\n-------------\n\nDocumentation is available at http://segeval.readthedocs.org/.\n\n\nCiting SegEval\n--------------\nIf you're using this software for research, please cite the `ACL paper `_ [PDF] and, if you need to go into details, the `thesis `_ [PDF] describing this work:\n\n- *Chris Fournier. 2013. Evaluating Text Segmentation using Boundary Edit Distance. Proceedings of 51st Annual Meeting of the Association for Computational Linguistics. (ACL 2013), to appear. Association for Computational Linguistics, Stroudsburg, PA, USA.*\n\n- *Chris Fournier. 2013. Evaluating Text Segmentation. (Master's thesis). University of Ottawa.*\n\nBibTeX:\n\n.. code-block:: latex\n\n\t@inproceedings{Fournier2013a,\n\t\tauthor\t\t= {Fournier, Chris},\n\t\tyear\t\t= {2013},\n\t\ttitle\t\t= {{Evaluating Text Segmentation using Boundary Edit Distance}},\n\t\tbooktitle\t= {Proceedings of 51st Annual Meeting of the Association for Computational Linguistics},\n\t\tpublisher\t= {Association for Computational Linguistics},\n\t\tlocation\t= {Sophia, Bulgaria},\n\t\tpages\t\t= {to appear},\n\t\taddress\t\t= {Stroudsburg, PA, USA}\n\t}\n\n\t@mastersthesis{Fournier2013b,\n\t\tauthor\t\t= {Fournier, Chris},\n\t\ttitle\t\t= {Evaluating Text Segmentation},\n\t\tschool\t\t= {University of Ottawa},\n\t\tyear\t\t= {2013}\n\t}", "description_content_type": null, "docs_url": "https://pythonhosted.org/segeval/", "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://segeval.readthedocs.org/", "keywords": "segmentation,similarity,discourse", "license": "Copyright (c) 2011-2013 Chris Fournier\n\nAll rights reserved.\n\nRedistribution and use in source and binary forms, with or without\nmodification, are permitted provided that the following conditions are met:\n\n * Redistributions of source code must retain the above copyright\n notice, this list of conditions and the following disclaimer.\n * Redistributions in binary form must reproduce the above copyright\n notice, this list of conditions and the following disclaimer in the\n documentation and/or other materials provided with the distribution.\n * Neither the name of the author nor the names of its contributors may\n be used to endorse or promote products derived from this software\n without specific prior written permission.\n\nTHIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS \"AS IS\"\nAND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE\nIMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE\nDISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE\nFOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL\nDAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR\nSERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER\nCAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,\nOR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE\nOF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.", "maintainer": null, "maintainer_email": null, "name": "segeval", "package_url": "https://pypi.org/project/segeval/", "platform": "Any", "project_url": "https://pypi.org/project/segeval/", "project_urls": { "Download": "UNKNOWN", "Homepage": "http://segeval.readthedocs.org/" }, "release_url": "https://pypi.org/project/segeval/2.0.11/", "requires_dist": null, "requires_python": null, "summary": "A package providing text segmentation evaluation metrics and utilities", "version": "2.0.11" }, "last_serial": 950641, "releases": { "1.0.1 beta": [ { "comment_text": "", "digests": { "md5": "38b8a0ff55751151fa71d26404e697d7", "sha256": "3878be8897658b203491a742fc2c18e0faf42b3a38180b149ba235cf60804cdc" }, "downloads": -1, "filename": "segeval-1.0.1 beta.tar.gz", "has_sig": false, "md5_digest": "38b8a0ff55751151fa71d26404e697d7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 43133, "upload_time": "2012-06-04T08:02:07", "url": "https://files.pythonhosted.org/packages/fc/67/a72131d37c0498125b64a9300455700c012bf911d190c8e5b5656821c16d/segeval-1.0.1%20beta.tar.gz" } ], "1.1 beta": [ { "comment_text": "", "digests": { "md5": "044034fb543edd66d443974e71c318b1", "sha256": "e70e5059391de66fe9fd47afc99b354450bbc18112759e9a0ab2eb7102d62fd4" }, "downloads": -1, "filename": "segeval-1.1 beta.tar.gz", "has_sig": false, "md5_digest": "044034fb543edd66d443974e71c318b1", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 49846, "upload_time": "2012-08-09T22:51:25", "url": "https://files.pythonhosted.org/packages/b4/1b/4bb76f255bda24203dddf163d8f7b6d487ef88b19b620b3aa2aa4a9f66e3/segeval-1.1%20beta.tar.gz" }, { "comment_text": "", "digests": { "md5": "1755094dfb30bde1818bab5fb9770741", "sha256": "8232b97f4e1b2d203d1a5717a34fb810569baf66b926620096cd35d2ad5613e1" }, "downloads": -1, "filename": "segeval-1.1 beta.zip", "has_sig": false, "md5_digest": "1755094dfb30bde1818bab5fb9770741", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 109216, "upload_time": "2012-08-09T22:51:28", "url": "https://files.pythonhosted.org/packages/15/19/d442db7a9075f86ce751fac5b30473d512008878d191506c3578bc5c9396/segeval-1.1%20beta.zip" } ], "2.0": [ { "comment_text": "", "digests": { "md5": "395e3b47327ffb41b351da9c49e93896", "sha256": "eeb58805e28adaf13c587338c1c119a466df8edbcb05c0a3e6c6b6a3f7b58f41" }, "downloads": -1, "filename": "segeval-2.0.tar.gz", "has_sig": false, "md5_digest": "395e3b47327ffb41b351da9c49e93896", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 37416, "upload_time": "2013-06-14T02:03:16", "url": "https://files.pythonhosted.org/packages/a9/90/5a40d2cdf1ffba2276949a632732b643c90a42d70377ab9c5390747f5447/segeval-2.0.tar.gz" } ], "2.0.1": [ { "comment_text": "", "digests": { "md5": "0eecdfd576fc243eafc8997bb8b2be9a", "sha256": "f355f5423c0ba2ebec2338da560a80710937e72ab863cb1aba63c682d28deab0" }, "downloads": -1, "filename": "segeval-2.0.1.tar.gz", "has_sig": false, "md5_digest": "0eecdfd576fc243eafc8997bb8b2be9a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 37418, "upload_time": "2013-06-16T03:39:16", "url": "https://files.pythonhosted.org/packages/a5/14/fd6d1f7b8bb0aa248136643778c7b6a262de35c7ef3fc84405bdfd624fc2/segeval-2.0.1.tar.gz" } ], "2.0.10": [ { "comment_text": "", "digests": { "md5": "308f7a11da3aeaa41152202a59b8fd52", "sha256": "65f90d9cd53e1f023d72df346e94042c77dbca0ca1c010895d698692396a10a5" }, "downloads": -1, "filename": "segeval-2.0.10.tar.gz", "has_sig": false, "md5_digest": "308f7a11da3aeaa41152202a59b8fd52", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 36241, "upload_time": "2013-10-02T01:29:36", "url": "https://files.pythonhosted.org/packages/07/84/424649668fac59c9c5b64228e828b7a50518985cb3ff15b99693500b2654/segeval-2.0.10.tar.gz" } ], "2.0.11": [ { "comment_text": "", "digests": { "md5": "0ad78aa3239eca35069bc5578fb229b3", "sha256": "a13babad83462b954ae6ccfb698bdac050c2231bfb551c2838ae4b70fa1fd5af" }, "downloads": -1, "filename": "segeval-2.0.11.tar.gz", "has_sig": false, "md5_digest": "0ad78aa3239eca35069bc5578fb229b3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 39822, "upload_time": "2013-12-21T21:08:12", "url": "https://files.pythonhosted.org/packages/3a/81/a3022624a621fe29d9ef2fddaadf637b42d005c233b4046e6b5ab8173312/segeval-2.0.11.tar.gz" } ], "2.0.2": [ { "comment_text": "", "digests": { "md5": "6e15391adc24dd97365c8369d92eac30", "sha256": "6166de64b290d07e73e041473119b0a3e2bc249b979029e045b92fe0a7f10ff6" }, "downloads": -1, "filename": "segeval-2.0.2.tar.gz", "has_sig": false, "md5_digest": "6e15391adc24dd97365c8369d92eac30", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 37334, "upload_time": "2013-07-01T01:23:05", "url": "https://files.pythonhosted.org/packages/ce/4d/a0209af3ee1f06a56ffd35124771c53fc6e909422c676c9aabfc9ad2d01c/segeval-2.0.2.tar.gz" } ], "2.0.3": [ { "comment_text": "", "digests": { "md5": "7c54963201771ff1e1b9f862fbe27896", "sha256": "599b423aa6e5b72f76a3716f584ba5fc781080d2f5e623104862ca2b6d41da94" }, "downloads": -1, "filename": "segeval-2.0.3.tar.gz", "has_sig": false, "md5_digest": "7c54963201771ff1e1b9f862fbe27896", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 12088, "upload_time": "2013-07-03T21:24:37", "url": "https://files.pythonhosted.org/packages/c2/73/ae6b82a6ce503274dc1024b8cd6711c8c90d2cc843cbdf22e1088ee861e4/segeval-2.0.3.tar.gz" } ], "2.0.4": [ { "comment_text": "", "digests": { "md5": "b60e266664dad4ba2415e2635b650dd8", "sha256": "3ecc7087d31e8a3ce3c14cd4bde6ad67749651fae11eafa539b6c0bf7577e391" }, "downloads": -1, "filename": "segeval-2.0.4.tar.gz", "has_sig": false, "md5_digest": "b60e266664dad4ba2415e2635b650dd8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 12042, "upload_time": "2013-07-03T22:53:41", "url": "https://files.pythonhosted.org/packages/21/10/2830e53bec8f0a2b8de5cb0cd48ab493e1b9a27618745859903ed2cb58e8/segeval-2.0.4.tar.gz" } ], "2.0.5": [ { "comment_text": "", "digests": { "md5": "f33787996b2754a268aa500474e249c9", "sha256": "3c611ba457b7cf7e64a0f49dc554c7ad2f47546dd1a2875a04f821dbaf0c867a" }, "downloads": -1, "filename": "segeval-2.0.5.tar.gz", "has_sig": false, "md5_digest": "f33787996b2754a268aa500474e249c9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 12011, "upload_time": "2013-07-03T23:17:59", "url": "https://files.pythonhosted.org/packages/49/b1/d36b9ed28be1b588efbc14abd993b0fa6aa52375d718f77a42dd8a538afb/segeval-2.0.5.tar.gz" } ], "2.0.6": [ { "comment_text": "", "digests": { "md5": "7e7bb57e65e33a55301450560d1d7b30", "sha256": "5ff512a11e2484ade4f8bb969ad784a0eacd6a082328eee4d0b70186693fecbc" }, "downloads": -1, "filename": "segeval-2.0.6.tar.gz", "has_sig": false, "md5_digest": "7e7bb57e65e33a55301450560d1d7b30", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 37450, "upload_time": "2013-07-04T06:52:26", "url": "https://files.pythonhosted.org/packages/e9/7f/da3c114c47e50361403d9c782e1137267f13401e1480a61ba09a3dab7d13/segeval-2.0.6.tar.gz" } ], "2.0.7": [ { "comment_text": "", "digests": { "md5": "39b589145071ac2e7db2cb7797d02214", "sha256": "fcb2498714e028578cbd12adf86968a2f88034a5d44f19d74a3bf339be41b7f8" }, "downloads": -1, "filename": "segeval-2.0.7.tar.gz", "has_sig": false, "md5_digest": "39b589145071ac2e7db2cb7797d02214", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 37537, "upload_time": "2013-07-05T23:33:09", "url": "https://files.pythonhosted.org/packages/fb/d6/725bf89c615855780721d652f29276dde8daf0929199fa8f2f928a180da2/segeval-2.0.7.tar.gz" } ], "2.0.8": [ { "comment_text": "", "digests": { "md5": "220ab42751c437b99cd5b71cce39f529", "sha256": "4f9469c6cad0d5ddbb1dd4fb57fe5fc4e79ad86632fdc97e90d1c884b842e1b7" }, "downloads": -1, "filename": "segeval-2.0.8.tar.gz", "has_sig": false, "md5_digest": "220ab42751c437b99cd5b71cce39f529", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 38321, "upload_time": "2013-07-06T04:22:57", "url": "https://files.pythonhosted.org/packages/df/f8/0c518a3161fd65e37b7cd40ee44a959ae318834c2f1546db568f894e6335/segeval-2.0.8.tar.gz" } ], "2.0.9": [ { "comment_text": "", "digests": { "md5": "345ddb251e1f01e7d1ef0e73a888d697", "sha256": "16237cb0ace752ceae76ad903898632b5142a2e81f3cfe31a0e571d8037973dd" }, "downloads": -1, "filename": "segeval-2.0.9.tar.gz", "has_sig": false, "md5_digest": "345ddb251e1f01e7d1ef0e73a888d697", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 35911, "upload_time": "2013-09-11T18:27:33", "url": "https://files.pythonhosted.org/packages/5a/6b/0eda61dbc0e28cf1e01fc5cf50ad4bf197b5cd4bcd1ec227dc2cd4df49d3/segeval-2.0.9.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "0ad78aa3239eca35069bc5578fb229b3", "sha256": "a13babad83462b954ae6ccfb698bdac050c2231bfb551c2838ae4b70fa1fd5af" }, "downloads": -1, "filename": "segeval-2.0.11.tar.gz", "has_sig": false, "md5_digest": "0ad78aa3239eca35069bc5578fb229b3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 39822, "upload_time": "2013-12-21T21:08:12", "url": "https://files.pythonhosted.org/packages/3a/81/a3022624a621fe29d9ef2fddaadf637b42d005c233b4046e6b5ab8173312/segeval-2.0.11.tar.gz" } ] }