{ "info": { "author": "Goncalo Pinheira", "author_email": "goncalopp+pypi@quorumverita.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 2 - Pre-Alpha", "Intended Audience :: Developers", "Operating System :: OS Independent", "Programming Language :: Python", "Topic :: Software Development :: Libraries" ], "description": "A simple pythonic OCR engine using opencv and numpy.\n\nOriginally inspired by\nhttp://stackoverflow.com/questions/9413216/simple-digit-recognition-ocr-in-opencv-python\n\nEssential Concepts\n==================\n\n#### Segmentation\n\nIn order for OCR to be performed on a image, several steps must be \nperformed on the source image. Segmentation is the process of \nidentifying the regions of the image that represent characters. \n\nThis project uses rectangles to model segments. \n\n#### Supervised learning with a classification problem\n\nThe [classification problem][] consists in identifying to which class a \nobservation belongs to (i.e.: which particular character is contained \nin a segment).\n\n[Supervised learning][] is a way of \"teaching\" a machine. Basically, an \nalgorithm is *trained* through *examples* (i.e.: this particular \nsegment contains the character `f`). After training, the machine \nshould be able to apply its acquired knowledge to new data.\n\nThe [k-NN algorithm], used in this project, is one of the simplest \nclassification algorithm.\n\n#### Grounding\n\nCreating a example image with already classified characters, for \ntraining purposes.\nSee [ground truth][].\n\n[classification problem]: https://en.wikipedia.org/wiki/Statistical_classification\n[Supervised learning]: https://en.wikipedia.org/wiki/Supervised_learning\n[k-NN algorithm]: https://en.wikipedia.org/wiki/K-nearest_neighbors_classification\n[ground truth]: https://en.wikipedia.org/wiki/Ground_truth\n\nHow to understand this project\n==============================\n\nUnfortunately, documentation is a bit sparse at the moment (I \ngladly accept contributions).\nThe project is well-structured, and most classes and functions have \ndocstrings, so that's probably a good way to start.\n\nIf you need any help, don't hesitate to contact me. You can find my \nemail on my github profile.\n\n\nHow to use\n==========\n\nPlease check `example.py` for basic usage with the existing pre-grounded images.\n\nYou can use your own images, by placing them on the `data` directory. \nGrounding images interactively can be accomplished by using `grounding.UserGrounder`.\nFor more details check `example_grounding.py`", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/goncalopp/simple-ocr-opencv", "keywords": null, "license": "UNKNOWN", "maintainer": null, "maintainer_email": null, "name": "simple-opencv-ocr", "package_url": "https://pypi.org/project/simple-opencv-ocr/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/simple-opencv-ocr/", "project_urls": { "Download": "UNKNOWN", "Homepage": "https://github.com/goncalopp/simple-ocr-opencv" }, "release_url": "https://pypi.org/project/simple-opencv-ocr/0.0/", "requires_dist": null, "requires_python": null, "summary": "A simple OCR with OpenCV", "version": "0.0" }, "last_serial": 2716265, "releases": { "0.0": [] }, "urls": [] }