{
"info": {
"author": "Mats Byrkjeland",
"author_email": "matsbyr@gmail.com",
"bugtrack_url": null,
"classifiers": [
"Development Status :: 4 - Beta",
"Intended Audience :: Developers",
"License :: OSI Approved :: MIT License",
"Programming Language :: Python :: 3",
"Programming Language :: Python :: 3.3",
"Programming Language :: Python :: 3.4",
"Programming Language :: Python :: 3.5",
"Topic :: Scientific/Engineering :: Artificial Intelligence"
],
"description": "The Oslo-Bergen Tagger for Python\n=================================\n\nThis is a Python library for `The Oslo-Bergen\nTagger `__, which parses the output\nof the tagger to a friendly format. Only Python 3 is supported at this\ntime.\n\nThe library is in beta. See `Roadmap <#roadmap>`__ for things that need\nto get implemented before a v1.0.0 can be released.\n\nInstallation\n------------\n\nYou need to have The Oslo-Bergen Tagger installed, and the environment\nvariable ``OBT_PATH`` set to the path of its installation directory. You\ncan use the provided code snippet below, or install it using the\ninstructions in `The-Oslo-Bergen-Tagger GitHub\nrepository `__. The\nfollowing code snippet installs it in your home directory. If you want\nto install it somewhere else, you can change the ``INSTALL_DIR``\nvariable on the first line to your preferred installation directory.\n\n.. code:: bash\n\n INSTALL_DIR=$HOME\n THIS_DIR=$PWD\n cd $INSTALL_DIR\n git clone git@github.com:noklesta/The-Oslo-Bergen-Tagger.git\n cd The-Oslo-Bergen-Tagger\n ./bootstrap.sh\n export OBT_PATH=$INSTALL_DIR/The-Oslo-Bergen-Tagger\n echo 'export OBT_PATH=$OBT_PATH' >> $HOME/.bashrc\n cd $THIS_DIR\n\nYou can then install this Python library with pip. To install for all\nusers, do:\n\n.. code:: bash\n\n sudo pip3 install obt\n\nTo just install for your user, do:\n\n.. code:: bash\n\n pip3 install --user obt\n\nAnd you are good to go!\n\nUsage\n-----\n\nFirst, import the library\n\n.. code:: python\n\n import obt\n\nThen, you can tag a string by passing it to the ``tag_bm`` function:\n\n.. code:: python\n\n my_string = \"Jeg er streng.\"\n tags = obt.tag_bm(my_string)\n\nOr you can pass a file name using the ``file`` keyword argument:\n\n.. code:: python\n\n tags = obt.tag_bm(file=\"my_document.txt\")\n\nThe resulting ``tags`` will be an array of tag objects, like so:\n\n.. code:: python\n\n [\n {\n \"tall\": \"ent\",\n \"type\": \"pers hum\",\n \"base\": \"jeg\",\n \"person\": \"1\",\n \"word_tag\": \"\",\n \"kasus\": \"nom\",\n \"raw_tags\": \"pron ent pers hum nom 1\",\n \"word\": \"Jeg\",\n \"ordklasse\": \"pron\"\n },\n {\n \"word_tag\": \"\",\n \"base\": \"v\\u00e6re\",\n \"tilleggstagger\": [\n \"a5\",\n \"pr1\",\n \"pr2\",\n \"\"\n ],\n \"tid\": \"pres\",\n \"raw_tags\": \"verb pres a5 pr1 pr2 \",\n \"word\": \"er\",\n \"ordklasse\": \"verb\"\n },\n {\n \"type\": \"appell\",\n \"best\": \"ub\",\n \"base\": \"streng\",\n \"word_tag\": \"\",\n \"tall\": \"ent\",\n \"ordklasse\": \"subst\",\n \"raw_tags\": \"subst appell mask ub ent\",\n \"word\": \"streng\",\n \"kj\\u00f8nn\": \"mask\"\n },\n {\n \"word_tag\": \"<.>\",\n \"base\": \"$.\",\n \"tilleggstagger\": [\n \"<<<\",\n \"\",\n \"<<<\"\n ],\n \"raw_tags\": \"clb <<< <<<\",\n \"word\": \".\",\n \"ordklasse\": \"clb\"\n }\n ]\n\nYou can easily save this to a JSON file with the ``obt.save_json``\nfunction:\n\n.. code:: python\n\n obt.save_json(tags, 'my_tags.json')\n\nFormat\n------\n\nA documentation of the tag format will come here.\n\nRoadmap\n-------\n\nBefore a v1.0.0 release, the following boxes should be checked: - [ ]\nPut \"tilleggstagger\" in proper items in tags object. - [ ] Implement\nfunction for ``./tag-nostat-bm.sh`` from\nhttps://github.com/noklesta/The-Oslo-Bergen-Tagger - [ ] Implement\nfunction for ``./tag-nostat-nn.sh`` from\nhttps://github.com/noklesta/The-Oslo-Bergen-Tagger - [ ] Python 2\nsupport",
"description_content_type": null,
"docs_url": null,
"download_url": "UNKNOWN",
"downloads": {
"last_day": -1,
"last_month": -1,
"last_week": -1
},
"home_page": "https://github.com/draperunner/obt",
"keywords": "pos-tagging nlp pos",
"license": "MIT",
"maintainer": null,
"maintainer_email": null,
"name": "obt",
"package_url": "https://pypi.org/project/obt/",
"platform": "UNKNOWN",
"project_url": "https://pypi.org/project/obt/",
"project_urls": {
"Download": "UNKNOWN",
"Homepage": "https://github.com/draperunner/obt"
},
"release_url": "https://pypi.org/project/obt/0.1.0/",
"requires_dist": null,
"requires_python": null,
"summary": "A Python library for The Oslo-Bergen Tagger",
"version": "0.1.0"
},
"last_serial": 2918442,
"releases": {
"0.1.0": [
{
"comment_text": "",
"digests": {
"md5": "f7d3d8f69471e62f5acd173a59c7babb",
"sha256": "472a6ff553dbccf69250b3242e9a04599cd7d07ff3c6616ea4b4fe454cc2ca48"
},
"downloads": -1,
"filename": "obt-0.1.0.tar.gz",
"has_sig": false,
"md5_digest": "f7d3d8f69471e62f5acd173a59c7babb",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 4309,
"upload_time": "2017-06-01T22:37:39",
"url": "https://files.pythonhosted.org/packages/8a/8e/5b47b6a88f031683323c59f449b752b15ee4fb302c33c05fbdbb61d10791/obt-0.1.0.tar.gz"
}
]
},
"urls": [
{
"comment_text": "",
"digests": {
"md5": "f7d3d8f69471e62f5acd173a59c7babb",
"sha256": "472a6ff553dbccf69250b3242e9a04599cd7d07ff3c6616ea4b4fe454cc2ca48"
},
"downloads": -1,
"filename": "obt-0.1.0.tar.gz",
"has_sig": false,
"md5_digest": "f7d3d8f69471e62f5acd173a59c7babb",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 4309,
"upload_time": "2017-06-01T22:37:39",
"url": "https://files.pythonhosted.org/packages/8a/8e/5b47b6a88f031683323c59f449b752b15ee4fb302c33c05fbdbb61d10791/obt-0.1.0.tar.gz"
}
]
}