{ "info": { "author": "Sheean Spoel, Digital Humanities Lab, Utrecht University", "author_email": "s.j.j.spoel@uu.nl", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "[![Build Status](https://travis-ci.org/UUDigitalHumanitieslab/corpus2alpino.svg?branch=master)](https://travis-ci.org/UUDigitalHumanitieslab/corpus2alpino)\n\n# FoLiA and TEI to Alpino XML\n\nConverts [FoLiA](https://proycon.github.io/folia/) and [TEI](http://www.tei-c.org) XML files to [Alpino](www.let.rug.nl/vannoord/alp/Alpino) XML files. Each sentence in the input file is parsed separately.\n\n## Usage\n\n### Command Line\n\n```bash\npip install corpus2alpino\ncorpus2alpino -s localhost:7001 folia.xml -o alpino.xml\n```\n\nOr from project root:\n\n```bash\npython -m corpus2alpino -s localhost:7001 folia.xml -o alpino.xml\n```\n\n### Library\n\n```python\nfrom corpus2alpino.converter import Converter\nfrom corpus2alpino.annotators.alpino import AlpinoAnnotator\nfrom corpus2alpino.collectors.filesystem import FilesystemCollector\nfrom corpus2alpino.targets.memory import MemoryTarget\nfrom corpus2alpino.writers.lassy import LassyWriter\n\nalpino = AlpinoAnnotator(\"localhost\", 7001)\nconverter = Converter(FilesystemCollector([\"folia.xml\"]),\n # Not needed when using the PaQuWriter\n annotators=[alpino],\n # This can also be ConsoleTarget, FilesystemTarget\n target=MemoryTarget(),\n # Set to merge treebanks, also possible to use PaQuWriter\n writer=LassyWriter(True))\n\n# get the Alpino XML output, combined into one treebank XML file\nparses = converter.convert()\nprint(''.join(parses)) # \n```\n\n### Unit Test\n\n```bash\npython -m unittest\n```\n\n### Upload to PyPi\n\nSee: https://packaging.python.org/tutorials/packaging-projects/#generating-distribution-archives\n\nMake sure `setuptools` and `wheel` are installed. Then from the virtualenv:\n\n```bash\npython setup.py sdist bdist_wheel\ntwine upload dist/*\n```\n\n## Requirements\n\n* [Alpino parser](http://www.let.rug.nl/vannoord/alp/Alpino) running as a server: `Alpino batch_command=alpino_server -notk server_port=7001`\n* Python 3.6 or higher (developed using 3.6.3).\n* [libfolia-dev](https://packages.ubuntu.com/bionic/libfolia-dev)\n* [libicu-dev](https://packages.ubuntu.com/bionic/libicu-dev)\n* [libxml2-dev](https://packages.ubuntu.com/bionic/libxml2-dev)\n* [libticcutils2-dev](https://packages.ubuntu.com/bionic/libticcutils2-dev)\n* [libucto-dev](https://packages.ubuntu.com/bionic/libucto-dev)\n* [ucto](https://packages.ubuntu.com/bionic/ucto)\n* [tqdm](https://pypi.org/project/tqdm/)\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/UUDigitalHumanitieslab/corpus2alpino", "keywords": "", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "corpus2alpino", "package_url": "https://pypi.org/project/corpus2alpino/", "platform": "", "project_url": "https://pypi.org/project/corpus2alpino/", "project_urls": { "Homepage": "https://github.com/UUDigitalHumanitieslab/corpus2alpino" }, "release_url": "https://pypi.org/project/corpus2alpino/0.2.3/", "requires_dist": [ "argparse", "Cython", "PyNLPl", "python-ucto", "tei-reader", "tqdm" ], "requires_python": ">=3.6", "summary": "Converts FoLiA and TEI files to Alpino XML files", "version": "0.2.3" }, "last_serial": 4435171, "releases": { "0.1.3": [ { "comment_text": "", "digests": { "md5": "b0b54299d0d40f39d8f3dd9c0a22ea42", "sha256": "3611724100e391934952b0f721ec0f274a492f655525acac5772cec58d34effc" }, "downloads": -1, "filename": "corpus2alpino-0.1.3.tar.gz", "has_sig": false, "md5_digest": "b0b54299d0d40f39d8f3dd9c0a22ea42", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4212, "upload_time": "2018-02-27T11:36:15", "url": "https://files.pythonhosted.org/packages/8a/84/14fa120e87d39ed018a9c17ada08255cc2ce84d3a81fc2fe83561826b85f/corpus2alpino-0.1.3.tar.gz" } ], "0.1.4": [ { "comment_text": "", "digests": { "md5": "795b54eea20689ba7e28f3cacdfa22fe", "sha256": "58553878ad1c0b83dffd87bcb065c6264b469ed9bbc05bb896320899e4049027" }, "downloads": -1, "filename": "corpus2alpino-0.1.4.tar.gz", "has_sig": false, "md5_digest": "795b54eea20689ba7e28f3cacdfa22fe", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4215, "upload_time": "2018-02-27T12:43:43", "url": "https://files.pythonhosted.org/packages/f7/33/0b43f764ca9789d4ce0d3380625a07941709144818da4bbb5203dd7db2f6/corpus2alpino-0.1.4.tar.gz" } ], "0.1.5": [ { "comment_text": "", "digests": { "md5": "63c5ac2b35a04bd81647a4d3d296f4c2", "sha256": "ff8a9d18b846c3a074258c15e680df732985d51220c1c2b122887fd9535d18d2" }, "downloads": -1, "filename": "corpus2alpino-0.1.5.tar.gz", "has_sig": false, "md5_digest": "63c5ac2b35a04bd81647a4d3d296f4c2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5124, "upload_time": "2018-03-06T13:54:37", "url": "https://files.pythonhosted.org/packages/1e/a0/d6b9c73c739bbdfed71055da921f4c2384763fb0b575db8c4cc1a1924c22/corpus2alpino-0.1.5.tar.gz" } ], "0.1.6": [ { "comment_text": "", "digests": { "md5": "fd3c0bb3b97eca574396e962068dfd8b", "sha256": "ff25d27d3d506554475d04b79f2464312f4202ff0bfafb6e4d3f1c16f86dc2f6" }, "downloads": -1, "filename": "corpus2alpino-0.1.6.tar.gz", "has_sig": false, "md5_digest": "fd3c0bb3b97eca574396e962068dfd8b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5124, "upload_time": "2018-03-06T15:00:52", "url": "https://files.pythonhosted.org/packages/b5/ad/1449cf1a7134f5b851695031a8a5247b149e2474b90e060055af7b22ebd7/corpus2alpino-0.1.6.tar.gz" } ], "0.1.7": [ { "comment_text": "", "digests": { "md5": "c6aa4d99e832fc6252c56181ec5b9430", "sha256": "e13692739852006f65d1f0801578de206035257ac3c78fae60225ca88985920d" }, "downloads": -1, "filename": "corpus2alpino-0.1.7.tar.gz", "has_sig": false, "md5_digest": "c6aa4d99e832fc6252c56181ec5b9430", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5411, "upload_time": "2018-03-06T15:47:52", "url": "https://files.pythonhosted.org/packages/07/c5/ea8483561e0367829f3f2328349c3f2b49070e1a6ff69c362f5c144ad5e8/corpus2alpino-0.1.7.tar.gz" } ], "0.1.8": [ { "comment_text": "", "digests": { "md5": "009615f213bfc7e82fdc1a15e52187fd", "sha256": "703edc342a8441365988ce08c26193730d693467263debf00f9f5ca7324362d8" }, "downloads": -1, "filename": "corpus2alpino-0.1.8.tar.gz", "has_sig": false, "md5_digest": "009615f213bfc7e82fdc1a15e52187fd", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5677, "upload_time": "2018-03-07T14:49:44", "url": "https://files.pythonhosted.org/packages/ed/ad/a81ea8fbbb53a8eda19f75e32dd338440b537ac7ff71410d027e50a4d6ee/corpus2alpino-0.1.8.tar.gz" } ], "0.1.9": [ { "comment_text": "", "digests": { "md5": "3a066a8bb6a766285b25f3ad26a4a4c3", "sha256": "f764424d5f12bf3d8643938a7e2580766fd0db420a5ef38980ba1f4153ae0852" }, "downloads": -1, "filename": "corpus2alpino-0.1.9.tar.gz", "has_sig": false, "md5_digest": "3a066a8bb6a766285b25f3ad26a4a4c3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5926, "upload_time": "2018-03-20T15:23:07", "url": "https://files.pythonhosted.org/packages/87/0e/104b8af3c185238bdf0309e62861dbb5065c2a8975fb623977c5f2b27e1a/corpus2alpino-0.1.9.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "f875f02084386d3a2273ec40f8c37331", "sha256": "b56053916ba8ce6418b48cb3b8ae125653e402c134fb1405ef8c5f8c8b32e52b" }, "downloads": -1, "filename": "corpus2alpino-0.2.0.tar.gz", "has_sig": false, "md5_digest": "f875f02084386d3a2273ec40f8c37331", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 4561, "upload_time": "2018-08-10T13:24:38", "url": "https://files.pythonhosted.org/packages/f1/08/457072a6f13f945cd4c58f420a71d153247a26a6b1e8d958c587034004fe/corpus2alpino-0.2.0.tar.gz" } ], "0.2.2": [ { "comment_text": "", "digests": { "md5": "7b8bae72756f2a025b36a8c189267f15", "sha256": "6e7e9c2e39b7ab5da12d553229e3f16bfcd3de096edfdd4ab38fe6187b201380" }, "downloads": -1, "filename": "corpus2alpino-0.2.2.tar.gz", "has_sig": false, "md5_digest": "7b8bae72756f2a025b36a8c189267f15", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 4564, "upload_time": "2018-10-31T09:56:50", "url": "https://files.pythonhosted.org/packages/5b/51/9c756395cb5124a50fbd120bf26ec760bde134be6d80cff616096a0b7676/corpus2alpino-0.2.2.tar.gz" } ], "0.2.3": [ { "comment_text": "", "digests": { "md5": "8055c4e8cb591a23ee7e003067315980", "sha256": "db603a380173ebc6340f6e899ce5c900a005c8e1461426710114e42a3175e6a1" }, "downloads": -1, "filename": "corpus2alpino-0.2.3-py3-none-any.whl", "has_sig": false, "md5_digest": "8055c4e8cb591a23ee7e003067315980", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 19459, "upload_time": "2018-10-31T10:10:55", "url": "https://files.pythonhosted.org/packages/b8/9e/be623a7d3af5fd645c75b96d1a13c97137dc29e56d93041c6df8d609b13f/corpus2alpino-0.2.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e3cf86c8836a91e471882d80a9c0ff5b", "sha256": "e54f249573594f032c7d856f39f332bf62d358af9ae92df2989e34e555f8674a" }, "downloads": -1, "filename": "corpus2alpino-0.2.3.tar.gz", "has_sig": false, "md5_digest": "e3cf86c8836a91e471882d80a9c0ff5b", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 13484, "upload_time": "2018-10-31T10:10:57", "url": "https://files.pythonhosted.org/packages/4e/80/1eb44d480ce8411feebb21514968c8aef20993317b03d3a21acd4b67992c/corpus2alpino-0.2.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "8055c4e8cb591a23ee7e003067315980", "sha256": "db603a380173ebc6340f6e899ce5c900a005c8e1461426710114e42a3175e6a1" }, "downloads": -1, "filename": "corpus2alpino-0.2.3-py3-none-any.whl", "has_sig": false, "md5_digest": "8055c4e8cb591a23ee7e003067315980", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 19459, "upload_time": "2018-10-31T10:10:55", "url": "https://files.pythonhosted.org/packages/b8/9e/be623a7d3af5fd645c75b96d1a13c97137dc29e56d93041c6df8d609b13f/corpus2alpino-0.2.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e3cf86c8836a91e471882d80a9c0ff5b", "sha256": "e54f249573594f032c7d856f39f332bf62d358af9ae92df2989e34e555f8674a" }, "downloads": -1, "filename": "corpus2alpino-0.2.3.tar.gz", "has_sig": false, "md5_digest": "e3cf86c8836a91e471882d80a9c0ff5b", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 13484, "upload_time": "2018-10-31T10:10:57", "url": "https://files.pythonhosted.org/packages/4e/80/1eb44d480ce8411feebb21514968c8aef20993317b03d3a21acd4b67992c/corpus2alpino-0.2.3.tar.gz" } ] }