{ "info": { "author": "Crossref", "author_email": "labs@crossref.org", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# Citation style classifier\n\nCitation style classifier can automatically infer citation style from a reference string. The classifier is a Logistic Regression model trained on 90,000 reference strings. The following citation styles are supported by default:\n\n * acm-sig-proceedings\n * american-chemical-society\n * american-chemical-society-with-titles\n * american-institute-of-physics\n * american-sociological-association\n * apa\n * bmc-bioinformatics\n * chicago-author-date\n * elsevier-without-titles\n * elsevier-with-titles\n * harvard3\n * ieee\n * iso690-author-date-en\n * modern-language-association\n * springer-basic-author-date\n * springer-lecture-notes-in-computer-science\n * vancouver\n * unknown\n\nThe package contains the training data, the classification model, and the code for feature extraction, selection, training and prediction.\n\n## Installation\n\n pip3 install styleclass\n\n## Classification\n\nFrom command line:\n\n styleclass_classify -r \"reference string\"\n styleclass_classify -i /file/with/reference/strings/one/per/line -o /output/file\n\nIn Python code:\n\n from styleclass.classify import classify\n from styleclass.train import get_default_model\n\n model = get_default_model()\n prediction = classify(\"reference string\", *model)\n prediction = classify([\"reference string #1\", \"reference string #2\", \"reference string #3\"], *model)\n\n## Data\n\nStyleclass package contains [two datasets](https://gitlab.com/crossref/citation_style_classifier/tree/master/styleclass/datasets): training set and test set. Each of them contains a sample of 5,000 DOIs formatted in 17 citation styles (listed above), which gives 85,000 reference strings. Both datasets were generated automatically using Crossref REST API.\n\nA new dataset can be generated using the script `styleclass_generate_dataset`.\n\n## Models\n\nThe [default model](https://gitlab.com/crossref/citation_style_classifier/tree/master/styleclass/models) was trained on the training dataset. Before the training, the dataset was cleaned and enriched with random noise. 5,000 strings with \"unknown\" style were also generated and added to the dataset.\n\nScript `styleclass_train_model` can be used to train a new model. This is useful especially when you need to operate of a different set of citation styles than our default. The script prepares the data for training in the same was as was done for training of the default model.\n\n## Evaluation\n\n`styleclass_evaluate` script can be used to evaluate exisitng model on a test set, in terms of accuracy.\n\nThe accuracy of the default model estimated on our test set is 95%.\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://gitlab.com/crossref/citation_style_classifier", "keywords": "", "license": "", "maintainer": "", "maintainer_email": "", "name": "styleclass", "package_url": "https://pypi.org/project/styleclass/", "platform": "", "project_url": "https://pypi.org/project/styleclass/", "project_urls": { "Homepage": "https://gitlab.com/crossref/citation_style_classifier" }, "release_url": "https://pypi.org/project/styleclass/0.0.5/", "requires_dist": [ "scipy (==1.3.0)", "numpy (==1.16.4)", "pandas (==0.25.0)", "scikit-learn (==0.21.2)" ], "requires_python": "", "summary": "Citation style classifier", "version": "0.0.5" }, "last_serial": 5874920, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "1c4bf1097c47d066cbf82a924d143c0c", "sha256": "c6352a33961f798d75517b293192b25654e1c48ef9f56b889f24beac38693bd6" }, "downloads": -1, "filename": "styleclass-0.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "1c4bf1097c47d066cbf82a924d143c0c", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5976748, "upload_time": "2019-07-26T20:34:52", "url": "https://files.pythonhosted.org/packages/86/7c/60e549b11435e1c2e8a5d29ff959bf10c19190162c9f690ccb240442d6ba/styleclass-0.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "524948fd7b21b2cf1b32a935414f2c03", "sha256": "682ae33a64b0509a8ffae1e6e8271fb8933ea72b3e74d9f8f5a1c793e353c017" }, "downloads": -1, "filename": "styleclass-0.0.1.tar.gz", "has_sig": false, "md5_digest": "524948fd7b21b2cf1b32a935414f2c03", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5957526, "upload_time": "2019-07-26T20:34:56", "url": "https://files.pythonhosted.org/packages/0c/1f/44dde54500b64ffd6643554aea7a190d04571259a5fc948e6811cb8bc4c4/styleclass-0.0.1.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "2f627483ccb056ceb662f33072777130", "sha256": "eb20acc7542a3cd5196ee16fe832e348bc0c5e74869f11661991063150f3ada9" }, "downloads": -1, "filename": "styleclass-0.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "2f627483ccb056ceb662f33072777130", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5943466, "upload_time": "2019-08-02T14:14:25", "url": "https://files.pythonhosted.org/packages/8e/5c/2258be19f3ac7d1239bb9ffe41a08b90ac21b3f383aa81749eff63ba8abc/styleclass-0.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "69bce52ae4faf3d989aa652f071f8f20", "sha256": "f8a8063ef20782a88a35bcb9f076594bcccb887283c7baa628be6eda34ac7693" }, "downloads": -1, "filename": "styleclass-0.0.2.tar.gz", "has_sig": false, "md5_digest": "69bce52ae4faf3d989aa652f071f8f20", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5923623, "upload_time": "2019-08-02T14:14:28", "url": "https://files.pythonhosted.org/packages/87/73/522fae8cec4cff5b7f6b6ec7f0cadd226418f598ed8466b52077d9ce22c9/styleclass-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "e5f01da1130cd757e38d980644c14dd4", "sha256": "c02c0950e26fe11f722eb0cc79859bfede6fe6f1d381e0651fc05ff661016f1e" }, "downloads": -1, "filename": "styleclass-0.0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "e5f01da1130cd757e38d980644c14dd4", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5943466, "upload_time": "2019-08-05T20:17:17", "url": "https://files.pythonhosted.org/packages/3d/ae/d249f4b3b858248eb0ecf7428d261598d7426db8167052feac8f128823ec/styleclass-0.0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f32bbceb69d48018f81f533033f710f2", "sha256": "2673413eb4c54857aff2285df1ad646eb5abd65b6508a2a40b3c59821d1c0075" }, "downloads": -1, "filename": "styleclass-0.0.3.tar.gz", "has_sig": false, "md5_digest": "f32bbceb69d48018f81f533033f710f2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5923628, "upload_time": "2019-08-05T20:17:19", "url": "https://files.pythonhosted.org/packages/95/6d/0406a5b96396190f8bc0fd3a495c07c486f6306bce45e63a3b961935f197/styleclass-0.0.3.tar.gz" } ], "0.0.4": [ { "comment_text": "", "digests": { "md5": "835f5ff1248a94aa2dbcb04388b71516", "sha256": "68900e3982abb276204165304e0c0094e260f084cdd0f5897998dac377fbda19" }, "downloads": -1, "filename": "styleclass-0.0.4-py3-none-any.whl", "has_sig": false, "md5_digest": "835f5ff1248a94aa2dbcb04388b71516", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5907935, "upload_time": "2019-08-16T18:38:07", "url": "https://files.pythonhosted.org/packages/01/fb/44c871918fa79df71ff81682ee5eabe75712736340900b9954e30032d351/styleclass-0.0.4-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "05e1f9b92740a513f50ca352154e00ae", "sha256": "60c8101e8d06af6b5b8ecdbac8d0c59117940dba776af0889ad6a300ceb54510" }, "downloads": -1, "filename": "styleclass-0.0.4.tar.gz", "has_sig": false, "md5_digest": "05e1f9b92740a513f50ca352154e00ae", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5888421, "upload_time": "2019-08-16T18:38:10", "url": "https://files.pythonhosted.org/packages/18/c8/2a073ada4049b6dbb52e33b3b266f3ae3888f258ed0060fcf6365d1895ac/styleclass-0.0.4.tar.gz" } ], "0.0.5": [ { "comment_text": "", "digests": { "md5": "76f5f7ed6153e2db4e260c0e74656424", "sha256": "34cae3f586548e021a2120e8574abdd4f5ce50c365ceb8d5d2d19c964bf18907" }, "downloads": -1, "filename": "styleclass-0.0.5-py3-none-any.whl", "has_sig": false, "md5_digest": "76f5f7ed6153e2db4e260c0e74656424", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5907934, "upload_time": "2019-09-23T17:20:16", "url": "https://files.pythonhosted.org/packages/cb/bc/240ca0bbded515eeae2c9596a9a900120d4a2f02352462028f8af563b59c/styleclass-0.0.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "dd39f4a47bd6997bf7eca5a6ba96bc3b", "sha256": "f4674a719549a435beb98ef6b8c23c23d4ac47473aaa7cdfd68dfb18a1167d2a" }, "downloads": -1, "filename": "styleclass-0.0.5.tar.gz", "has_sig": false, "md5_digest": "dd39f4a47bd6997bf7eca5a6ba96bc3b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5888423, "upload_time": "2019-09-23T17:20:19", "url": "https://files.pythonhosted.org/packages/ac/01/009f912b04e27bfeede7f7fb3112bdfeb8f5bf18938f454d872559be31e5/styleclass-0.0.5.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "76f5f7ed6153e2db4e260c0e74656424", "sha256": "34cae3f586548e021a2120e8574abdd4f5ce50c365ceb8d5d2d19c964bf18907" }, "downloads": -1, "filename": "styleclass-0.0.5-py3-none-any.whl", "has_sig": false, "md5_digest": "76f5f7ed6153e2db4e260c0e74656424", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5907934, "upload_time": "2019-09-23T17:20:16", "url": "https://files.pythonhosted.org/packages/cb/bc/240ca0bbded515eeae2c9596a9a900120d4a2f02352462028f8af563b59c/styleclass-0.0.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "dd39f4a47bd6997bf7eca5a6ba96bc3b", "sha256": "f4674a719549a435beb98ef6b8c23c23d4ac47473aaa7cdfd68dfb18a1167d2a" }, "downloads": -1, "filename": "styleclass-0.0.5.tar.gz", "has_sig": false, "md5_digest": "dd39f4a47bd6997bf7eca5a6ba96bc3b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5888423, "upload_time": "2019-09-23T17:20:19", "url": "https://files.pythonhosted.org/packages/ac/01/009f912b04e27bfeede7f7fb3112bdfeb8f5bf18938f454d872559be31e5/styleclass-0.0.5.tar.gz" } ] }