{ "info": { "author": "Mats Byrkjeland", "author_email": "matsbyr@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Topic :: Scientific/Engineering :: Artificial Intelligence" ], "description": "The Oslo-Bergen Tagger for Python\n=================================\n\nThis is a Python library for `The Oslo-Bergen\nTagger `__, which parses the output\nof the tagger to a friendly format. Only Python 3 is supported at this\ntime.\n\nThe library is in beta. See `Roadmap <#roadmap>`__ for things that need\nto get implemented before a v1.0.0 can be released.\n\nInstallation\n------------\n\nYou need to have The Oslo-Bergen Tagger installed, and the environment\nvariable ``OBT_PATH`` set to the path of its installation directory. You\ncan use the provided code snippet below, or install it using the\ninstructions in `The-Oslo-Bergen-Tagger GitHub\nrepository `__. The\nfollowing code snippet installs it in your home directory. If you want\nto install it somewhere else, you can change the ``INSTALL_DIR``\nvariable on the first line to your preferred installation directory.\n\n.. code:: bash\n\n INSTALL_DIR=$HOME\n THIS_DIR=$PWD\n cd $INSTALL_DIR\n git clone git@github.com:noklesta/The-Oslo-Bergen-Tagger.git\n cd The-Oslo-Bergen-Tagger\n ./bootstrap.sh\n export OBT_PATH=$INSTALL_DIR/The-Oslo-Bergen-Tagger\n echo 'export OBT_PATH=$OBT_PATH' >> $HOME/.bashrc\n cd $THIS_DIR\n\nYou can then install this Python library with pip. To install for all\nusers, do:\n\n.. code:: bash\n\n sudo pip3 install obt\n\nTo just install for your user, do:\n\n.. code:: bash\n\n pip3 install --user obt\n\nAnd you are good to go!\n\nUsage\n-----\n\nFirst, import the library\n\n.. code:: python\n\n import obt\n\nThen, you can tag a string by passing it to the ``tag_bm`` function:\n\n.. code:: python\n\n my_string = \"Jeg er streng.\"\n tags = obt.tag_bm(my_string)\n\nOr you can pass a file name using the ``file`` keyword argument:\n\n.. code:: python\n\n tags = obt.tag_bm(file=\"my_document.txt\")\n\nThe resulting ``tags`` will be an array of tag objects, like so:\n\n.. code:: python\n\n [\n {\n \"tall\": \"ent\",\n \"type\": \"pers hum\",\n \"base\": \"jeg\",\n \"person\": \"1\",\n \"word_tag\": \"\",\n \"kasus\": \"nom\",\n \"raw_tags\": \"pron ent pers hum nom 1\",\n \"word\": \"Jeg\",\n \"ordklasse\": \"pron\"\n },\n {\n \"word_tag\": \"\",\n \"base\": \"v\\u00e6re\",\n \"tilleggstagger\": [\n \"a5\",\n \"pr1\",\n \"pr2\",\n \"\"\n ],\n \"tid\": \"pres\",\n \"raw_tags\": \"verb pres a5 pr1 pr2 \",\n \"word\": \"er\",\n \"ordklasse\": \"verb\"\n },\n {\n \"type\": \"appell\",\n \"best\": \"ub\",\n \"base\": \"streng\",\n \"word_tag\": \"\",\n \"tall\": \"ent\",\n \"ordklasse\": \"subst\",\n \"raw_tags\": \"subst appell mask ub ent\",\n \"word\": \"streng\",\n \"kj\\u00f8nn\": \"mask\"\n },\n {\n \"word_tag\": \"<.>\",\n \"base\": \"$.\",\n \"tilleggstagger\": [\n \"<<<\",\n \"\",\n \"<<<\"\n ],\n \"raw_tags\": \"clb <<< <<<\",\n \"word\": \".\",\n \"ordklasse\": \"clb\"\n }\n ]\n\nYou can easily save this to a JSON file with the ``obt.save_json``\nfunction:\n\n.. code:: python\n\n obt.save_json(tags, 'my_tags.json')\n\nFormat\n------\n\nA documentation of the tag format will come here.\n\nRoadmap\n-------\n\nBefore a v1.0.0 release, the following boxes should be checked: - [ ]\nPut \"tilleggstagger\" in proper items in tags object. - [ ] Implement\nfunction for ``./tag-nostat-bm.sh`` from\nhttps://github.com/noklesta/The-Oslo-Bergen-Tagger - [ ] Implement\nfunction for ``./tag-nostat-nn.sh`` from\nhttps://github.com/noklesta/The-Oslo-Bergen-Tagger - [ ] Python 2\nsupport", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/draperunner/obt", "keywords": "pos-tagging nlp pos", "license": "MIT", "maintainer": null, "maintainer_email": null, "name": "obt", "package_url": "https://pypi.org/project/obt/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/obt/", "project_urls": { "Download": "UNKNOWN", "Homepage": "https://github.com/draperunner/obt" }, "release_url": "https://pypi.org/project/obt/0.1.0/", "requires_dist": null, "requires_python": null, "summary": "A Python library for The Oslo-Bergen Tagger", "version": "0.1.0" }, "last_serial": 2918442, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "f7d3d8f69471e62f5acd173a59c7babb", "sha256": "472a6ff553dbccf69250b3242e9a04599cd7d07ff3c6616ea4b4fe454cc2ca48" }, "downloads": -1, "filename": "obt-0.1.0.tar.gz", "has_sig": false, "md5_digest": "f7d3d8f69471e62f5acd173a59c7babb", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4309, "upload_time": "2017-06-01T22:37:39", "url": "https://files.pythonhosted.org/packages/8a/8e/5b47b6a88f031683323c59f449b752b15ee4fb302c33c05fbdbb61d10791/obt-0.1.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "f7d3d8f69471e62f5acd173a59c7babb", "sha256": "472a6ff553dbccf69250b3242e9a04599cd7d07ff3c6616ea4b4fe454cc2ca48" }, "downloads": -1, "filename": "obt-0.1.0.tar.gz", "has_sig": false, "md5_digest": "f7d3d8f69471e62f5acd173a59c7babb", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4309, "upload_time": "2017-06-01T22:37:39", "url": "https://files.pythonhosted.org/packages/8a/8e/5b47b6a88f031683323c59f449b752b15ee4fb302c33c05fbdbb61d10791/obt-0.1.0.tar.gz" } ] }