{ "info": { "author": "Sanskrit programmers", "author_email": "sanskrit-programmers@googlegroups.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Intended Audience :: Developers", "Intended Audience :: Education", "Intended Audience :: Science/Research", "License :: OSI Approved :: MIT License", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.6", "Topic :: Text Processing :: Linguistic" ], "description": "sanskrit\\_parser\n================\n\nParsers for Sanskrit / \u0938\u0902\u0938\u094d\u0915\u0943\u0924\u092e\u094d\n\n|CI Build Status|\n\n**NOTE:** This project is still under development. Both over-generation\n(invalid forms/splits) and under-generation (missing valid forms/splits)\nare quite likely. Please see the Sanskrit Parser Stack section below for\ndetailed status. Report any issues\n`here `__.\n\nPlease feel free to ping us if you would like to collaborate on this\nproject.\n\nTry it out!\n-----------\n\nA web interface is available here -\nhttps://kmadathil.github.io/sanskrit\\_parser/ui/index.html\n\nInstallation\n------------\n\nThis project has been tested and developed using Python 2.7. A port to\nPython 3 has been completed, and everything *should* now work in both\nversions of Python.\n\n::\n\n pip install sanskrit_parser\n\nUsage\n-----\n\n- See `generated sphynx\n docs `__.\n- PS: Command line usage is also documented there.\n\nDeploying REST API server\n~~~~~~~~~~~~~~~~~~~~~~~~~\n\nRun:\n\n::\n\n sudo mkdir /var/www/.sanskrit_parser\n sudo chmod a+rwx /var/www/.sanskrit_parser\n\nContribution\n------------\n\n- Generate docs: ``cd docs; make html``\n\nSanskrit Parser Stack\n---------------------\n\nStack of parsing tools\n\nLevel 0\n~~~~~~~\n\nSandhi splitting subroutine Input: Phoneme sequence and Phoneme number\nto split at Action: Perform a sandhi split at given input phoneme number\nOuptut: left and right sequences (multiple options will be output). No\nsemantic validation will be performed (up to higher levels)\n\nCurrent Status\n^^^^^^^^^^^^^^\n\nModule that performs sandhi split/join and convenient rule definition is\nat ``lexical_analyzer/sandhi.py``.\n\nRule definitions (human readable!) are at\n``lexical_analyzer/sandhi_rules/*.txt``\n\nLevel 1\n~~~~~~~\n\n- From dhatu + lakAra + puruSha + vachana to pada and vice versa\n- From prAtipadika + vibhakti + vachana to pada and vice versa\n- Upasarga + dhAtu forms - forward and backwards\n- nAmadhAtu forms\n- Krt forms - forwards and backwards\n- Taddhita forms - forwards and backwards\n\nCurrent Status\n^^^^^^^^^^^^^^\n\nTo be done.\n\nHowever, we have a usable solution with inriaxmlwrapper + Prof. Gerard\nHuet's forms database to act as queriable form database. That gives us\nthe bare minimum we need from Level 1, so Level 2 can work.\n\nLevel 2\n~~~~~~~\n\nInput\n^^^^^\n\nSanskrit Sentence #### Action \\* Traverse the sentence, splitting it (or\nnot) at each location to determine all possible valid splits \\* Traverse\nfrom left to right \\* Using dynamic programming, assemble the results of\nall choices\n\n::\n\n To split or not to split at each phoneme\n\n If split, all possible left/right combination of phonemes that can result\n\n Once split, check if the left section is a valid pada (use level 1 tools to pick pada type and tag morphologically) \n\n If left section is valid, proceed to split the right section\n\n- At the end of this step, we will have all possible syntactically\n valid splits with morphological tags\n\nOutput\n^^^^^^\n\nAll semantically valid sandhi split sequences\n\nCurrent Status\n^^^^^^^^^^^^^^\n\nModule that performs sentence split is at\n``lexical_analyzer/SanksritLexicalAnalyzer.py``\n\nLevel 3\n~~~~~~~\n\nInput\n^^^^^\n\nSemantically valid sequence of tagged padas (output of Level 1) ####\nAction: \\* Assemble graphs of morphological constraints\n\n::\n\n viseShaNa - viseShya\n\n karaka/vibhakti\n\n vachana/puruSha constraints on tiGantas and subantas\n\n- Check validity of graphs #### Output\n\n1. Is the input sequence a morphologically valid sentence?\n2. Enhanced sequence of tagged padas, with karakas tagged, and a\n dependency graph associated\n\nCurrent Status\n^^^^^^^^^^^^^^\n\nEarly experimental version (simple sentences only) is at\n``morphological_analyzer/SanskritMorphologicalAnalyzer.py``\n\nSeq2Seq based Sanskrit Parser\n-----------------------------\n\nSee: Grammar as a Foreign Language : Vinyals & Kaiser et. al. Google\nhttp://arxiv.org/abs/1412.7449\n\n- Method: Seq2Seq Neural Network (n? layers)\n- Input Embedding with word2vec (optional)\n\nInput\n~~~~~\n\nSanskrit sentence ### Output Sentence split into padas with tags ###\nTrain/Test data DCS corpus, converted by Vishvas Vasuki\n\nCurrent Status\n^^^^^^^^^^^^^^\n\nNot begun\n\n.. |CI Build Status| image:: https://img.shields.io/travis/kmadathil/sanskrit_parser/master.svg\n :target: https://travis-ci.org/kmadathil/sanskrit_parser\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/kmadathil/sanskrit_parser", "keywords": "sanskrit samskritam", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "sanskrit-parser", "package_url": "https://pypi.org/project/sanskrit-parser/", "platform": "", "project_url": "https://pypi.org/project/sanskrit-parser/", "project_urls": { "Homepage": "https://github.com/kmadathil/sanskrit_parser" }, "release_url": "https://pypi.org/project/sanskrit-parser/0.0.4/", "requires_dist": [ "flask", "gensim", "indic-transliteration", "jsonpickle", "lxml", "networkx", "python-constraint", "requests", "sanskrit-util", "sentencepiece", "six", "sqlalchemy", "tinydb" ], "requires_python": "", "summary": "Tools for lexical and morphological analysis of Sanskrit", "version": "0.0.4" }, "last_serial": 5234319, "releases": { "0.0.1.dev1": [ { "comment_text": "", "digests": { "md5": "17a0dbc895eea7a2fd66f88bf3f50f41", "sha256": "0b3313d94fdf0abeca665301ee1ab28448db42cb3441d858be3e1bd2a4173a0c" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev1-py2-none-any.whl", "has_sig": false, "md5_digest": "17a0dbc895eea7a2fd66f88bf3f50f41", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": null, "size": 33547, "upload_time": "2017-08-01T06:17:52", "url": "https://files.pythonhosted.org/packages/7c/cf/2b4d311fc86659b7a7639173c23dfb5a378d28f066766cb971a9a666e94a/sanskrit_parser-0.0.1.dev1-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6746354a5771a7a0a3d6d6987021cc38", "sha256": "8bf8fb74efbf1024db0538343e9e538c3754c0248e37f37bd8569ce7a506c7d5" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev1.tar.gz", "has_sig": false, "md5_digest": "6746354a5771a7a0a3d6d6987021cc38", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 25653, "upload_time": "2017-08-01T06:17:54", "url": "https://files.pythonhosted.org/packages/6c/d6/504d15efd85fcbaf09c1ebed54b0890fead3e51209bc8a7386fb71e1a18f/sanskrit_parser-0.0.1.dev1.tar.gz" } ], "0.0.1.dev2": [ { "comment_text": "", "digests": { "md5": "b6825813045a7578cfe5dbcc593bc855", "sha256": "90836dd09a6cb9662c84100ce96e4ca8cc32fe7ec5b07b98e88eacaac830402a" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev2-py2-none-any.whl", "has_sig": false, "md5_digest": "b6825813045a7578cfe5dbcc593bc855", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": null, "size": 33539, "upload_time": "2017-08-01T07:20:41", "url": "https://files.pythonhosted.org/packages/59/9d/dffd26fa4fbe1ec9643f7b402e05cae885d533edfe076cc5f55815f49bea/sanskrit_parser-0.0.1.dev2-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "72cdbd2490b2d85a1fe672be74da9202", "sha256": "7aacdaa99ac6ae6ed9663e171783eb8fc20606f1c10230e955ba94dbe8ba4c22" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev2.tar.gz", "has_sig": false, "md5_digest": "72cdbd2490b2d85a1fe672be74da9202", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 25673, "upload_time": "2017-08-01T07:20:44", "url": "https://files.pythonhosted.org/packages/15/ee/beb0443f17c20e67b0a45a19d49116094d3b385798a788816e1874ed491f/sanskrit_parser-0.0.1.dev2.tar.gz" } ], "0.0.1.dev3": [ { "comment_text": "", "digests": { "md5": "5063b8becebfdedc8f21337b80046400", "sha256": "414d433bebe2c8e3bf97ebdde1bbb35a9a4cfc872a67d2d9695139bbc63b38d7" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev3-py2.7.egg", "has_sig": false, "md5_digest": "5063b8becebfdedc8f21337b80046400", "packagetype": "bdist_egg", "python_version": "2.7", "requires_python": null, "size": 52051, "upload_time": "2017-08-09T06:38:43", "url": "https://files.pythonhosted.org/packages/b7/94/f7104bbfd435ea946b7289c0122a6fe79288204fa6354f76eeaeafe068ba/sanskrit_parser-0.0.1.dev3-py2.7.egg" }, { "comment_text": "", "digests": { "md5": "d04478900fd6eaad72f5c30c400a5cf2", "sha256": "ecd1d4d28233a7f6c2255e7d7f6990fcfab8c6617ce4a04d1310e924ec4cb9c1" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev3-py2-none-any.whl", "has_sig": false, "md5_digest": "d04478900fd6eaad72f5c30c400a5cf2", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": null, "size": 33684, "upload_time": "2017-08-09T06:38:40", "url": "https://files.pythonhosted.org/packages/3f/6a/147213ed02dcb46949e14e0b05f615c554156031ededefbe87b26df39e04/sanskrit_parser-0.0.1.dev3-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "94ad846ab2ae3270dcea0eb4f250fb00", "sha256": "347857ed69492a71703eb3e3be33af5ccdb31124337c5468059b2f1705bf84f0" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev3.tar.gz", "has_sig": false, "md5_digest": "94ad846ab2ae3270dcea0eb4f250fb00", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 25861, "upload_time": "2017-08-09T06:38:45", "url": "https://files.pythonhosted.org/packages/f2/9c/1632eb71192315c5276ad83b8c943b51c3b1876d4b90dd374fc45dffec6f/sanskrit_parser-0.0.1.dev3.tar.gz" } ], "0.0.1.dev4": [ { "comment_text": "", "digests": { "md5": "309ba0ad061527ecca199ee33a4e3e40", "sha256": "ddd5be7d842a1bf92b58a0bb73267c300fd664201f2aa3adead2b4f8d90f9253" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev4-py2-none-any.whl", "has_sig": false, "md5_digest": "309ba0ad061527ecca199ee33a4e3e40", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": null, "size": 37151, "upload_time": "2017-10-01T06:19:47", "url": "https://files.pythonhosted.org/packages/8a/af/af0ffb5390b0a333c937867343a2eaca74d5f15e05fdc11cf456c050d3c7/sanskrit_parser-0.0.1.dev4-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "af34de69e31a1f36508008fa442a70e4", "sha256": "2ee9cd435dbd794fbf0a8e28cbebe24679b61aee71ed1cdf2769e20347cf09f3" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev4.tar.gz", "has_sig": false, "md5_digest": "af34de69e31a1f36508008fa442a70e4", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 27334, "upload_time": "2017-10-01T06:19:49", "url": "https://files.pythonhosted.org/packages/24/b7/43e13340f1417429f33ee9aab9c32b4b003fa037b1d563246be92ed230ac/sanskrit_parser-0.0.1.dev4.tar.gz" } ], "0.0.1.dev5": [ { "comment_text": "", "digests": { "md5": "6f816916d0e32ed86f1b1e00aabbf7e0", "sha256": "5da77d382bf770f65a65de76258153aeab7e7bd35f31e9cea18d478a616b57d8" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev5-py2-none-any.whl", "has_sig": false, "md5_digest": "6f816916d0e32ed86f1b1e00aabbf7e0", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": null, "size": 39962, "upload_time": "2017-11-07T08:56:28", "url": "https://files.pythonhosted.org/packages/5b/1d/48bee24684c2bad9ba4f8d33437796a540c56fbdd9f620a39664fdf9808d/sanskrit_parser-0.0.1.dev5-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "ef43235729ceedbc4efc02ca0e20d14a", "sha256": "09ae4873efba6adbc02e3ae756e13122e226a83015344cab054bbe5489874e49" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev5.tar.gz", "has_sig": false, "md5_digest": "ef43235729ceedbc4efc02ca0e20d14a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 29384, "upload_time": "2017-11-07T08:56:30", "url": "https://files.pythonhosted.org/packages/d2/4d/65dda39114a480ec6aeec6c14921d4e16132ce0c549cf4e1a8b5bc57ce6b/sanskrit_parser-0.0.1.dev5.tar.gz" } ], "0.0.1.dev6": [ { "comment_text": "", "digests": { "md5": "0f6911d8e7a1043180972b2998cabef4", "sha256": "ff80de0c243f7c9caf952601f417108b5762e42b8ff68a59931d67fce89b2c00" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev6-py2-none-any.whl", "has_sig": false, "md5_digest": "0f6911d8e7a1043180972b2998cabef4", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": null, "size": 47229, "upload_time": "2017-12-09T15:39:39", "url": "https://files.pythonhosted.org/packages/82/f1/3dc7eb1d18bda544c55a06b8aa7c4af37024516b62dabf5a5a7266464bd6/sanskrit_parser-0.0.1.dev6-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "948cd618e358e1820bf11c8d844ccd44", "sha256": "4fbeed791625fa79b40f6c60cff39c96bf37b76d12e68b76cd3adf33117afa05" }, "downloads": -1, "filename": "sanskrit_parser-0.0.1.dev6.tar.gz", "has_sig": false, "md5_digest": "948cd618e358e1820bf11c8d844ccd44", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 31696, "upload_time": "2017-12-09T15:39:41", "url": "https://files.pythonhosted.org/packages/02/cd/e66c4ae83610a2a070a2fa3f89ecbdcd520d97a02d9d58af79f2558a293a/sanskrit_parser-0.0.1.dev6.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "9627c6620039439069a2665bfa79c6b8", "sha256": "408ca984e76d6b0507d9728635427961ae766688a5c1f1d2560eb74010fe38e2" }, "downloads": -1, "filename": "sanskrit_parser-0.0.2-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "9627c6620039439069a2665bfa79c6b8", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 59193, "upload_time": "2019-01-28T17:38:07", "url": "https://files.pythonhosted.org/packages/a4/25/c1a5050954be8c0e931ee209f203f5598273cba3fa43f394f327b1f8cecc/sanskrit_parser-0.0.2-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "a143222dcfe0b4baad08b9ca49221678", "sha256": "d585d1938f63cc3122c3e7003568a1b92cea86ac73f8f1a19048162d61103037" }, "downloads": -1, "filename": "sanskrit_parser-0.0.2.tar.gz", "has_sig": false, "md5_digest": "a143222dcfe0b4baad08b9ca49221678", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 36765, "upload_time": "2019-01-28T17:38:08", "url": "https://files.pythonhosted.org/packages/e2/3d/3674dbd009ed98124c6577fcc43636bcb9ad119e6df189abd23e828126d6/sanskrit_parser-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "2ed5e1b2a27f71f5391d6ca9efcf2411", "sha256": "fa4d8a192e73bbe420fa2f5f9d1e3071b65aa074efe6a68e89ef0ca2e75f0944" }, "downloads": -1, "filename": "sanskrit_parser-0.0.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "2ed5e1b2a27f71f5391d6ca9efcf2411", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 63125, "upload_time": "2019-04-15T22:05:13", "url": "https://files.pythonhosted.org/packages/15/11/993142f7668392f4242d53d22445d95982c041365a9492304a473810b13a/sanskrit_parser-0.0.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "bcf9326fd5c40f880048f94d29edc2ef", "sha256": "2b3f34f977e3db37929e6e4d5c8655ff573c94a0dc49b4acdea47e959a69d7e8" }, "downloads": -1, "filename": "sanskrit_parser-0.0.3.tar.gz", "has_sig": false, "md5_digest": "bcf9326fd5c40f880048f94d29edc2ef", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 38794, "upload_time": "2019-04-15T22:05:15", "url": "https://files.pythonhosted.org/packages/0f/b5/051997f34a899eaeff7cc95967048274c57683d8de30786915c0e41536e9/sanskrit_parser-0.0.3.tar.gz" } ], "0.0.4": [ { "comment_text": "", "digests": { "md5": "ca1afd5ed17b9686d95663dc23dd210d", "sha256": "2a707d5666a97c9bf500e2a09e62eb0dd555e886e8d9759b8125dd683e0f090e" }, "downloads": -1, "filename": "sanskrit_parser-0.0.4-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "ca1afd5ed17b9686d95663dc23dd210d", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 63106, "upload_time": "2019-05-06T18:47:10", "url": "https://files.pythonhosted.org/packages/9e/37/404f5f3f1710f6f2564fb3966b18e5c36d897ea594033abd73c0cf729607/sanskrit_parser-0.0.4-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "1b6b232ac5600157dbfb4e94d30e4f22", "sha256": "ca880aeaa76dd8f6f5291b679a69c916be09b2da7fc35bf77ddc509eaef08d2c" }, "downloads": -1, "filename": "sanskrit_parser-0.0.4.tar.gz", "has_sig": false, "md5_digest": "1b6b232ac5600157dbfb4e94d30e4f22", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 38797, "upload_time": "2019-05-06T18:47:11", "url": "https://files.pythonhosted.org/packages/43/9a/0e2319b42e181c5a497a64af9168e032c5552995722f9ff685a6e6b6dfcb/sanskrit_parser-0.0.4.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "ca1afd5ed17b9686d95663dc23dd210d", "sha256": "2a707d5666a97c9bf500e2a09e62eb0dd555e886e8d9759b8125dd683e0f090e" }, "downloads": -1, "filename": "sanskrit_parser-0.0.4-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "ca1afd5ed17b9686d95663dc23dd210d", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 63106, "upload_time": "2019-05-06T18:47:10", "url": "https://files.pythonhosted.org/packages/9e/37/404f5f3f1710f6f2564fb3966b18e5c36d897ea594033abd73c0cf729607/sanskrit_parser-0.0.4-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "1b6b232ac5600157dbfb4e94d30e4f22", "sha256": "ca880aeaa76dd8f6f5291b679a69c916be09b2da7fc35bf77ddc509eaef08d2c" }, "downloads": -1, "filename": "sanskrit_parser-0.0.4.tar.gz", "has_sig": false, "md5_digest": "1b6b232ac5600157dbfb4e94d30e4f22", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 38797, "upload_time": "2019-05-06T18:47:11", "url": "https://files.pythonhosted.org/packages/43/9a/0e2319b42e181c5a497a64af9168e032c5552995722f9ff685a6e6b6dfcb/sanskrit_parser-0.0.4.tar.gz" } ] }