{ "info": { "author": "Robert Grigoroiu", "author_email": "robertgr991@yahoo.ro", "bugtrack_url": null, "classifiers": [ "Intended Audience :: Developers", "Intended Audience :: Education", "Intended Audience :: Science/Research", "Programming Language :: Cython", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: 3.7", "Topic :: Scientific/Engineering :: Bio-Informatics", "Topic :: Scientific/Engineering :: Information Analysis", "Topic :: Text Processing :: Linguistic" ], "description": "# fastDamerauLevenshtein\n\n[![Build Status](https://travis-ci.com/robertgr991/fastDamerauLevenshtein.svg?branch=master)](https://travis-ci.com/robertgr991/fastDamerauLevenshtein)\n[![Wheel Status](https://pypip.in/wheel/fastDamerauLevenshtein/badge.svg)](https://pypi.python.org/pypi/fastDamerauLevenshtein/)\n\nCython implementation of true Damerau-Levenshtein edit distance which allows one item to be edited more than once.\nMore information from [Wikipedia](http://en.wikipedia.org/wiki/Damerau%E2%80%93Levenshtein_distance):\n\n> In information theory and computer science, the Damerau-Levenshtein distance (named after Frederick J. Damerau and Vladimir I. Levenshtein) is a string metric for measuring the edit distance between two sequences. Informally, the Damerau-Levenshtein distance between two words is the minimum number of operations (consisting of insertions, deletions or substitutions of a single character, or transposition of two adjacent characters) required to change one word into the other.
\nThe Damerau-Levenshtein distance differs from the classical Levenshtein distance by including transpositions among its allowable operations in addition to the three classical single-character edit operations (insertions, deletions and substitutions).\n\nThe implementation is based on [James M. Jensen II](https://www.lemoda.net/text-fuzzy/damerau-levenshtein/index.html) explanation and it allows specifying the cost of every operation.\n\n## Requirements\nThis code requires Python 2.7 or 3.4+ and a C compiler such as GCC.\n\n## Install\nfastDamerauLevenshtein is available on PyPI at https://pypi.python.org/pypi/fastDamerauLevenshtein.\n\nInstall using [pip](https://pypi.org/project/pip/):\n\n pip install fastDamerauLevenshtein\n\nInstall from source:\n\n python setup.py install\n\nor\n\n pip install .\n\n## Usage\nThe available method it's called `damerauLevenshtein` and can compute the distance on two objects that are hashable(strings, list of strings etc.). The method provides the following parameters:\n\n* **firstObject**\n\n* **secondObject**\n\n* **similarity**\n * If this parameter value is `False`, it will return the total cost of edit, otherwise it will return a score from 0.0 to 1.0 denoting how similar the two objects are. It is `True` by default.\n\n* **deleteWeight**\n * Cost of delete operation.\n\n* **insertWeight**\n * Cost of insert operation.\n\n* **replaceWeight**\n * Cost of replace operation.\n\n* **swapWeight**\n * Cost of swap operation.\n\nThe provided weights of operations must be `int` values. All these values are `1` by default.\n\nBasic use:\n\n```python\nfrom fastDamerauLevenshtein import damerauLevenshtein\ndamerauLevenshtein('ca', 'abc', similarity=False) # expected result: 2.0\ndamerauLevenshtein('car', 'cars', similarity=True) # expected result: 0.75\ndamerauLevenshtein(['ab', 'bc'], ['ab'], similarity=False) # expected result: 1.0\ndamerauLevenshtein(['ab', 'bc'], ['ab'], similarity=True) # expected result: 0.5\n```\n\n## Benchmark\nOther Python Damerau-Levenshtein and OSA implementations:\n\n* [pyxDamerauLevenshtein](https://github.com/gfairchild/pyxDamerauLevenshtein) (restricted edit distance and no custom weights)\n* [jellyfish](https://github.com/sunlightlabs/jellyfish) (true Damerau-Levenshtein but no custom weights)\n* [editdistance](https://github.com/aflc/editdistance) (restricted edit distance and no custom weights)\n* [textdistance](https://github.com/orsinium/textdistance) (true Damerau-Levenshtein but no custom weights)\n\nPython 3.7 (on Intel i5 6500):\n\n >>> import timeit\n >>> #fastDamerauLevenshtein:\n ... timeit.timeit(setup=\"import fastDamerauLevenshtein; text1='afwafghfdowbihgp'; text2='goagumkphfwifawpte'\", stmt=\"fastDamerauLevenshtein.damerauLevenshtein(text1, text2)\", number=100000)\n 0.43\n >>> #pyxDamerauLevenshtein:\n ... timeit.timeit(setup=\"from pyxdameraulevenshtein import normalized_damerau_levenshtein_distance; text1='afwafghfdowbihgp'; text2='goagumkphfwifawpte'\", stmt=\"normalized_damerau_levenshtein_distance(text1, text2)\", number=100000)\n 2.44\n >>> #jellyfish\n ... timeit.timeit(setup=\"import jellyfish; text1='afwafghfdowbihgp'; text2='goagumkphfwifawpte'\", stmt=\"jellyfish.damerau_levenshtein_distance(text1, text2)\", number=100000)\n 0.20\n >>> #editdistance\n ... timeit.timeit(setup=\"import editdistance; text1='afwafghfdowbihgp'; text2='goagumkphfwifawpte'\", stmt=\"editdistance.eval(text1, text2)\", number=100000)\n 0.22\n >>> #textdistance\n ... timeit.timeit(setup=\"import textdistance; text1='afwafghfdowbihgp'; text2='goagumkphfwifawpte'\", stmt=\"textdistance.damerau_levenshtein.distance(text1, text2)\", number=100000)\n 0.70\n\n-------\nLicense\n-------\n\nIt is released under the MIT license.\n\n Copyright (c) 2019 Robert Grigoroiu\n\n Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the \"Software\"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:\n\n The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.\n\n THE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/robertgr991/fastDamerauLevenshtein", "keywords": "", "license": "MIT License", "maintainer": "", "maintainer_email": "", "name": "fastDamerauLevenshtein", "package_url": "https://pypi.org/project/fastDamerauLevenshtein/", "platform": "", "project_url": "https://pypi.org/project/fastDamerauLevenshtein/", "project_urls": { "Homepage": "https://github.com/robertgr991/fastDamerauLevenshtein" }, "release_url": "https://pypi.org/project/fastDamerauLevenshtein/1.0.6/", "requires_dist": null, "requires_python": "", "summary": "Cython implementation of true Damerau-Levenshtein algorithm.", "version": "1.0.6" }, "last_serial": 5737733, "releases": { "1.0.0": [ { "comment_text": "", "digests": { "md5": "3fd1acb3dc5741daaa8d96e8d4371c7a", "sha256": "9e24b002ee1b427ff18a79f448d9d75009944568a8e04549633c0f4deebdbcd8" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.0-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "3fd1acb3dc5741daaa8d96e8d4371c7a", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 23843, "upload_time": "2019-07-03T18:22:15", "url": "https://files.pythonhosted.org/packages/f7/65/f7ef111c1b610c01563a3b3782b8666e060e05394ec2b7bd9638122147de/fastDamerauLevenshtein-1.0.0-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "3d3aa1b56fafeebb8b22af81e6b3bdd5", "sha256": "920bc38e2d78438135ea8a1ebbd6dc34f48f578b30d7536293cbcc4eaa725d43" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.0.tar.gz", "has_sig": false, "md5_digest": "3d3aa1b56fafeebb8b22af81e6b3bdd5", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 32571, "upload_time": "2019-07-03T17:56:08", "url": "https://files.pythonhosted.org/packages/7a/3c/2a5d0636fa990cd5fe9aeb6fbc80666763d189a38e5df0ed9251ee4cbb82/fastDamerauLevenshtein-1.0.0.tar.gz" } ], "1.0.1": [ { "comment_text": "", "digests": { "md5": "f3a8ab43b5264679890916734c54e547", "sha256": "ca7f131430729c09635c116a5638ec0c5c7c9b341804f344aec51a0a4e4e7c75" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.1-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "f3a8ab43b5264679890916734c54e547", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 23841, "upload_time": "2019-07-03T18:24:43", "url": "https://files.pythonhosted.org/packages/74/8c/af228066d29bb3788af877201cd5c8aaa5e5a0bac3c8dd954048c642ffc5/fastDamerauLevenshtein-1.0.1-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "27bb8b0bd3619579d5581796f31633d8", "sha256": "b4580fd6fc8102aeb5a2f8b447e50de714de4e5b781afaac7f90e25daaa0373c" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.1.tar.gz", "has_sig": false, "md5_digest": "27bb8b0bd3619579d5581796f31633d8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 32537, "upload_time": "2019-07-03T18:24:45", "url": "https://files.pythonhosted.org/packages/1c/74/2e0d54af2dcda95f84fcce0d9d04f2e65007172bb2657f88aede85a57cb1/fastDamerauLevenshtein-1.0.1.tar.gz" } ], "1.0.2": [ { "comment_text": "", "digests": { "md5": "754999a31d4a6796eca0ff5b6cd98a53", "sha256": "2d91651faad3113f52fa55cf446f3c3d34a36a328241660737258faba12ecb61" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.2-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "754999a31d4a6796eca0ff5b6cd98a53", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 23928, "upload_time": "2019-07-03T18:56:57", "url": "https://files.pythonhosted.org/packages/6c/6e/0c6f9f42b7421c3a5259dc57f491b3dd825b3c55d7b6e972dfdd58c64ba9/fastDamerauLevenshtein-1.0.2-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "b01a53318b4c399f5ed562bc8f4979d5", "sha256": "2b5fbfe8f382905c8d770354a516c2e119def3718b8a678e035817c45c23c591" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.2.tar.gz", "has_sig": false, "md5_digest": "b01a53318b4c399f5ed562bc8f4979d5", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 32636, "upload_time": "2019-07-03T18:56:58", "url": "https://files.pythonhosted.org/packages/17/06/02666e53b056340b13da21dd7af355a1cbae1c538f7c97d28cf1c820000f/fastDamerauLevenshtein-1.0.2.tar.gz" } ], "1.0.3": [ { "comment_text": "", "digests": { "md5": "70b5a1e5474464805fd68851661c4708", "sha256": "66ef672103cdd748f65da3e8713516f32db74f43a38c10d8f7ddf9021e0a90f6" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.3-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "70b5a1e5474464805fd68851661c4708", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 24040, "upload_time": "2019-07-04T17:23:23", "url": "https://files.pythonhosted.org/packages/a6/c7/372a066469af25e9ac61d73af4ed261fec6b6f261fbfab60d4a64719c395/fastDamerauLevenshtein-1.0.3-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "ef57c8f10e334564e29e69ed883fea03", "sha256": "a7082e6d431238762149700b9c7b9228516beabc93a5a18f2a0666c5bf438c4d" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.3.tar.gz", "has_sig": false, "md5_digest": "ef57c8f10e334564e29e69ed883fea03", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 32944, "upload_time": "2019-07-04T17:23:25", "url": "https://files.pythonhosted.org/packages/68/53/ad11d5c0266e75d131df34f2aa51b8aa515157916ec7072b8ee08b37089d/fastDamerauLevenshtein-1.0.3.tar.gz" } ], "1.0.4": [ { "comment_text": "", "digests": { "md5": "c0a4044e278f70ca7075b0076f8efc50", "sha256": "e1f4776b52b6cb9ac715a9ab04a6b498c03b26f7721f6ad417c63b106e480525" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.4-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "c0a4044e278f70ca7075b0076f8efc50", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 24048, "upload_time": "2019-07-06T17:19:08", "url": "https://files.pythonhosted.org/packages/67/58/5c5aee3bea2930f991e5e726188d8886a1cfbc9e0be11e3c6b277407587a/fastDamerauLevenshtein-1.0.4-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "0b83ac6f61e335283be82130831fadb0", "sha256": "824967f5559287b8fbe35a05d61be68e7dc4ee74068e853cdbef3037a7893639" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.4.tar.gz", "has_sig": false, "md5_digest": "0b83ac6f61e335283be82130831fadb0", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 32962, "upload_time": "2019-07-06T17:19:09", "url": "https://files.pythonhosted.org/packages/43/31/906b19b8dfcd2e9607646c26359bdab382cfcd26589f41500e24475e37da/fastDamerauLevenshtein-1.0.4.tar.gz" } ], "1.0.5": [ { "comment_text": "", "digests": { "md5": "a91feed748d89e6080dea5502ecf43c7", "sha256": "8c5a6a407674ffd3a2419673fd67af8da53d9dd30e8f7ebc8f9842246772489c" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.5-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "a91feed748d89e6080dea5502ecf43c7", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 24131, "upload_time": "2019-08-26T12:01:20", "url": "https://files.pythonhosted.org/packages/64/d9/14bd4eb5a472cbbcee9d23a15496a69eb39a8cb94eb6326fa846214f98f2/fastDamerauLevenshtein-1.0.5-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "a1e6be5f4c2e5381a1a9c240e73474e6", "sha256": "5c9bdab076f9e9e30a1166c65026897ef5f06cc2325fc93380fdd2dc69c18571" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.5.tar.gz", "has_sig": false, "md5_digest": "a1e6be5f4c2e5381a1a9c240e73474e6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 33246, "upload_time": "2019-08-26T12:01:21", "url": "https://files.pythonhosted.org/packages/f0/7f/8560afb5e7739f415636d7ab6481264370dcf1571381a78a18003ea6acb1/fastDamerauLevenshtein-1.0.5.tar.gz" } ], "1.0.6": [ { "comment_text": "", "digests": { "md5": "b9984b3544bc29d38fd12a5d2b99d0de", "sha256": "b49fc697ed0667800db7b9bdda272d222e38d772900fbd9f888fed6c770a71d1" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.6-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "b9984b3544bc29d38fd12a5d2b99d0de", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 24148, "upload_time": "2019-08-27T15:50:44", "url": "https://files.pythonhosted.org/packages/88/56/a7f3d68e9a38a7efe8c6c326e30128d6434d76ab7f159a157ee2bd365d30/fastDamerauLevenshtein-1.0.6-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "0049c4a6fa4ac18777f3d1b0606f0fc8", "sha256": "8f2a219ef6bb72eb20419ec6ffb81750155d92d510b27a00270623f5c5708ff0" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.6.tar.gz", "has_sig": false, "md5_digest": "0049c4a6fa4ac18777f3d1b0606f0fc8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 33310, "upload_time": "2019-08-27T15:50:46", "url": "https://files.pythonhosted.org/packages/d6/de/8cccbfdae80a6ae31a53127ba6d2649f21e2161b42bcd63199962b48d223/fastDamerauLevenshtein-1.0.6.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "b9984b3544bc29d38fd12a5d2b99d0de", "sha256": "b49fc697ed0667800db7b9bdda272d222e38d772900fbd9f888fed6c770a71d1" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.6-cp36-cp36m-win_amd64.whl", "has_sig": false, "md5_digest": "b9984b3544bc29d38fd12a5d2b99d0de", "packagetype": "bdist_wheel", "python_version": "cp36", "requires_python": null, "size": 24148, "upload_time": "2019-08-27T15:50:44", "url": "https://files.pythonhosted.org/packages/88/56/a7f3d68e9a38a7efe8c6c326e30128d6434d76ab7f159a157ee2bd365d30/fastDamerauLevenshtein-1.0.6-cp36-cp36m-win_amd64.whl" }, { "comment_text": "", "digests": { "md5": "0049c4a6fa4ac18777f3d1b0606f0fc8", "sha256": "8f2a219ef6bb72eb20419ec6ffb81750155d92d510b27a00270623f5c5708ff0" }, "downloads": -1, "filename": "fastDamerauLevenshtein-1.0.6.tar.gz", "has_sig": false, "md5_digest": "0049c4a6fa4ac18777f3d1b0606f0fc8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 33310, "upload_time": "2019-08-27T15:50:46", "url": "https://files.pythonhosted.org/packages/d6/de/8cccbfdae80a6ae31a53127ba6d2649f21e2161b42bcd63199962b48d223/fastDamerauLevenshtein-1.0.6.tar.gz" } ] }