{ "info": { "author": "LOGILAB S.A. (Paris, FRANCE)", "author_email": "contact@logilab.fr", "bugtrack_url": null, "classifiers": [ "Environment :: Web Environment", "Framework :: CubicWeb", "Programming Language :: JavaScript", "Programming Language :: Python" ], "description": "Summary\n-------\nCube for news storage and analysis\n\n\n\nThis cube provides an implementation of Semnews:\n\n - store news articles and tweets.\n\n - extract and synthetize information.\n\n - provide semantic useful and original visualisation.\n\n - analytics tools and datamining/machine learning processings.\n\n\n\nInstallation\n------------\n\nCreation of the instance:\n\n * Create an instance using: cubicweb-ctl create semnews \n\n * Create the instance's database using: cubicweb-ctl db-create \n\n\nAdd articles sources\n--------------------\n\nSource of articles could be created using:\n\n * Blogs/RSS feeds::\n\n session.create_entity('CWSource', name=, type=u'datafeed',\n parser=u'rss-parser', lang=,\n url=,\n config=u'synchronization-interval=120min')\n\n * Tweet::\n\n session.create_entity('CWSource', name=, type=u'datafeed',\n parser=u'tweet-parser', lang=,\n url=,\n config=u'synchronization-interval=120min')\n\n\nThe synchronization interval could be setted to a more specific value, or setted to \"no\" for manual synchronization\nonly.\n\n\nSemnews comes with some predifined blogs/tweets/rss feeds:\n\n\n * Some french political blogs. You can add them using::\n\n cubicweb-ctl shell /migration/examples_blogs_fr.py\n\n\n * Some international english newspapers. You can add them using::\n\n cubicweb-ctl shell /migration/examples_newspapers.py\n\n\n * Some french newspapers. You can add them using::\n\n cubicweb-ctl shell /migration/examples_newspapers_fr.py\n\n\n * Some french politician tweets. You can add them using::\n\n cubicweb-ctl shell /migration/examples_twitters_fr.py\n\n\n\nAdd Named Entities sources\n--------------------------\n\nSemnews is based on a named entities process, that you have to define::\n\n\n session.create_entity('NerProcess', name=, host=,\n type=, lang=,\n request=)\n\nSee the document of the NER cube for more details.\nExample of source::\n\n session.create_entity('NerProcess', name=u'dbpedia38-en', host=u'ner',\n type=u'rql', lang=u'en',\n request=u'Any U WHERE X label %(token)s, X cwuri U, '\n 'X ner_source NS, NS name \"dbpedia38-en\"')\n\n\n\n\nCommands\n--------\n\nSemnews provide to commands:\n\n * A command to extract named entities from articles::\n\n cubicweb-ctl process-ner \n\n\n * A command to cleanup recognized entities according to some Dbpedia categories\n (see entities/external_resources.py)::\n\n cubicweb-ctl cleanup-ner ", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://www.cubicweb.org/project/cubicweb-semnews", "keywords": null, "license": "LGPL", "maintainer": null, "maintainer_email": null, "name": "cubicweb-semnews", "package_url": "https://pypi.org/project/cubicweb-semnews/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/cubicweb-semnews/", "project_urls": { "Download": "UNKNOWN", "Homepage": "http://www.cubicweb.org/project/cubicweb-semnews" }, "release_url": "https://pypi.org/project/cubicweb-semnews/0.2.0/", "requires_dist": null, "requires_python": null, "summary": "Cube for news storage and analysis", "version": "0.2.0" }, "last_serial": 1044234, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "ba68caf6b3a8dcff1d8f829efe56b9a0", "sha256": "cf0f8966e8d84ea45f587951324b2e1d01157d1a33572e3def5e270dafa68448" }, "downloads": -1, "filename": "cubicweb-semnews-0.1.0.tar.gz", "has_sig": false, "md5_digest": "ba68caf6b3a8dcff1d8f829efe56b9a0", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 148757, "upload_time": "2013-06-11T13:03:31", "url": "https://files.pythonhosted.org/packages/16/c4/23eaa08f77d8fe7da370d853843c4ff0d7059501527dc80cc487afdf49c6/cubicweb-semnews-0.1.0.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "da4030b45f98087cd411f2de564862fe", "sha256": "6e2fc0d98cc4be03e530c1b0cb1e42b23bb1a3238610e8f073f7756f640f207c" }, "downloads": -1, "filename": "cubicweb-semnews-0.2.0.tar.gz", "has_sig": false, "md5_digest": "da4030b45f98087cd411f2de564862fe", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 151225, "upload_time": "2014-03-28T10:46:32", "url": "https://files.pythonhosted.org/packages/a6/f2/3835295e596c255acf22754beb5d13ee32bfc8a46ac59dc1c2c7b8f36ac7/cubicweb-semnews-0.2.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "da4030b45f98087cd411f2de564862fe", "sha256": "6e2fc0d98cc4be03e530c1b0cb1e42b23bb1a3238610e8f073f7756f640f207c" }, "downloads": -1, "filename": "cubicweb-semnews-0.2.0.tar.gz", "has_sig": false, "md5_digest": "da4030b45f98087cd411f2de564862fe", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 151225, "upload_time": "2014-03-28T10:46:32", "url": "https://files.pythonhosted.org/packages/a6/f2/3835295e596c255acf22754beb5d13ee32bfc8a46ac59dc1c2c7b8f36ac7/cubicweb-semnews-0.2.0.tar.gz" } ] }