{ "info": { "author": "LOGILAB S.A. (Paris, FRANCE)", "author_email": "contact@logilab.fr", "bugtrack_url": null, "classifiers": [ "Environment :: Web Environment", "Framework :: CubicWeb", "Programming Language :: JavaScript", "Programming Language :: Python" ], "description": "Summary\n-------\nCube for named entities source and recognition (NER).\n\n\nThis cube provides:\n\n - the notion of NerSource (i.e. Named Entities Source), e.g. dbpedia or dbpedia-en (for\n Dbpedia in english).\n\n - the notion of NerEntry, which is a token/word/entry that could be recognized.\n Basically it requires a \"label\" and a \"cwuri\", but an \"unormalize_label\"\n could be given for quicker match, a \"weight\" for disambiguation or\n a \"lang\" for sorting. It should be related to a NerSource.\n\n - the notion of NerProcess, which is an entity type that stores the parameters\n for a Named Entities Recognition: a \"name\", an \"host\" (appid or url of a sparql endpoint),\n a request (RQL or SPARQL, with the \"token\" key for substitution), a type ('rql' or 'sparql')\n for now, and a lang (for sorting).\n\n\nBasically a lexic could be defined (NerSource), that contains entries (NerEntry).\nThus processes (NerProcess) could be defined in other applications to retrieve these entries\nin some content.\n\n\n\nInstallation\n------------\n\nCreation of the instance:\n\n * Create an instance using: cubicweb-ctl create ner \n\n * Create the instance's database using: cubicweb-ctl db-create \n\n\nCreating entities\n-----------------\n\nFor creating a NerSource (in a cw shell):\n\n session.create_entity('NerSource', name=)\n\nE.g.:\n\n session.create_entity('NerSource', name=u\"dbpedia-38-en\")\n\n\nFor creating a simple NerEntry (in a cw shell):\n\n session.create_entity('NerEntry', label=