{ "info": { "author": "Yuka Langbuana", "author_email": "langbuana.yuka@gmail.com", "bugtrack_url": null, "classifiers": [], "description": "# AMUNRA\n**amunra** is a tool that aims to simplify the process of data gathering on news website. Data gathering itself is time consuming, *I intend to make it quicker*. \n\nIt automatically gathers links from a given search query, and also extract the content of a page from a link. Please note that currently **amunra** can only gather data from [Detik.com](https://www.detik.com) and [Kompasiana.com](https://www.kompasiana.com)\n\n**More websites are coming soon!**\n\nInstallation\n------------\n\nInstall with `pip`\n```\n$ pip install amunra\n```\n\nNow you can load the scrapper!\n```\nimport amunra\n\nra = amunra.DetikScrapper()\n```\n\nUsage\n------\n### Gather data instantly\n```\nra.gather_data(\"detik_saham.csv\", \"saham\", \"detik\", 2)\n```\nOpen `detik_saham.csv` and you will have a nice csv file ready for [pandas](https://pandas.pydata.org/)\n```\nurl,title,body,word_count\n\"https://news.detik.com/berita/d-4441490/kampanye-di-bali-sandiaga-janji-tolak-reklamasi\",\"Kampanye di Bali Sandiaga Janji Tolak Reklamasi\",bali cawapres sandiaga uno mengatakan menolak proyek reklamasi yang merusak lingkungan alam khusus untuk tanjung benoa bali sandiaga janji akan mereview kebijakan itu jika lebih banyak merugikan nelayanhal itu disampaikan sandiaga saat berdialog dengan masyarakat tanjung benoa bendesa adat tanjung benoa made wijaya meminta sandiaga untuk meninjau kembali peraturan untuk proyek yang merusak lingkungan jika terpilih nanti kami berharap jika terpilih pak sandi bisa meninjau kembali proyek yang merusak terumbu karang dan kelangsungan hidup anak cucu juga kesejahteraan para nelayan ucap made wijaya di tanjung benoa bali minggu 2422019baca juga sandiaga ingin kembangkan pariwisata halal di balisandiaga menuturkan isu reklamasi ini juga dia bawa saat kampanye pilgub dki 2017 lalu pemberhentian reklamasi teluk jakarta pun akhirnya dia penuhi saat menjabat sebagai wagub dki jakarta bersama gubernur dki anies baswedan kalau masyarakat bali merasa reklamasi merusak lingkungan dan mengancam penghidupan para nelayan bersama masyarakat bali prabowosandi akan menolak reklamasi tegas sandi baca juga instruksi sandiaga ke emakemak kawal tps selfie dengan form c1sandiaga berjanji dia akan konsisten memenuhi janji kampanyenya apalagi jika proyek reklamasi itu lebih banyak merugikan masyarakat karena janji itu utang jika tidak ditagih di dunia akan kena di akhirat ucap sandisimak juga catat nih sandiaga janji tak lagi makan apel imporgambasvideo 20detik,204\n...\n```\n---\n\n### Getting links from a search term\n```\nra.get_urls(\"urls.txt\", \"saham\", \"detik\", 2)\n```\nOpen `urls.txt` and you should see this:\n```\nhttps://finance.detik.com/bursa-dan-valas/d-4422865/balik-ke-zona-merah-ihsg-parkir-di-6515\nhttps://finance.detik.com/bursa-dan-valas/d-4422769/menguat-140-saham-smartfren-dipelototi-bei\nhttps://finance.detik.com/bursa-dan-valas/d-4422490/ihsg-dibuka-menguat-tipis-di-awal-pekan\nhttps://finance.detik.com/foto-bisnis/d-4414756/mengintip-bangunan-lantai-bursa-di-china\nhttps://finance.detik.com/foto-bisnis/d-4379933/melantai-di-bursa-saham-kibif-naik-4118\nhttps://finance.detik.com/foto-bisnis/d-4375140/bank-mandiri-angkat-direktur-komersial-baru\nhttps://finance.detik.com/foto-bisnis/d-4370450/bri-kembali-tunjuk-wadirut-baru\nhttps://finance.detik.com/foto-bisnis/d-4354330/sah-kini-51-saham-freeport-milik-indonesia\nhttps://finance.detik.com/perencanaan-keuangan/d-4421163/gaji-rp-3-juta-mau-naik-haji-bisa-kok\nhttps://finance.detik.com/bursa-dan-valas/d-4420943/dirut-bank-danamon-dapat-hibah-saham-rp-3-miliar\n...\n```\n---\n### Getting content from a link\n```\nra.parse_from_url(\"output.csv\", \n\"https://finance.detik.com/bursa-dan-valas/d-4422769/menguat-140-saham-smartfren-dipelototi-bei\")\n```\nOpen `output.csv` and you will have a nice csv file ready for [pandas](https://pandas.pydata.org/)\n```\nurl,title,body,word_count\n\"https://finance.detik.com/bursa-dan-valas/d-4422769/menguat-140-saham-smartfren-dipelototi-bei\",\"Menguat 140% Saham Smartfren Dipelototi BEI\",jakarta pt bursa efek indonesia bei meningkatkan pengawasan terhadap saham pt smartfren telecom tbk fren saham fren kini masuk dalam daftar unusual market activity umabei memandang adanya pergerakan harga yang di luar kewajaran di saham fren oleh karena itu pelaku pasar diminta untuk mencermati lebih dalam terhadap saham frensehubungan dengan terjadinya uma atas saham fren tersebut perlu kami sampaikan bahwa bursa saat ini sedang mencermati perkembangan pola transaksi saham ini kata kepala divisi pengawasan transaksi bei lidia m pandjaitan dilansir dari keterbukaan informasi senin 1122019jika dilihat saham fren memang terus menguat setidaknya selama sebulan ini tercatat saham fren yang kini bertengger di level rp 228 sudah meningkat 140 dalam waktu 1 bulanpeningkatan drastis terjadi pada seminggu kebelakang ini pada perdagangan 6 februari 2018 saham fren dalam sehari bisa meroker 333baca juga tips memilih saham zombiehingga siang ini saham fren tercatat sudah menguat 755 ke posisi rp 228 di posisi itu saham fren sudah ditransaksikan sebanyak rp 712 miliar ,158\n```\n\nYou can also do a batch operation by loading up a file from the above `get_urls()` function and load it to **amunra**:\n```\nra.parse_from_file(\"output.csv\", \"urls.txt\")\n```\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/syahbandar/amunra", "keywords": "", "license": "", "maintainer": "", "maintainer_email": "", "name": "amunra", "package_url": "https://pypi.org/project/amunra/", "platform": "", "project_url": "https://pypi.org/project/amunra/", "project_urls": { "Homepage": "https://github.com/syahbandar/amunra" }, "release_url": "https://pypi.org/project/amunra/0.2/", "requires_dist": [ "bs4", "requests" ], "requires_python": "", "summary": "A news portal scraping tool", "version": "0.2" }, "last_serial": 4896762, "releases": { "0.1.2": [ { "comment_text": "", "digests": { "md5": "e04a77b1f479c05d16c443b4fbcc2939", "sha256": "88a0e142a9df7795b11fedd80e9caa01110f9d6e001ea8758e142625374e590e" }, "downloads": -1, "filename": "amunra-0.1.2.tar.gz", "has_sig": false, "md5_digest": "e04a77b1f479c05d16c443b4fbcc2939", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3550, "upload_time": "2019-02-12T11:01:53", "url": "https://files.pythonhosted.org/packages/51/74/ab6f87556f5d32136fd743fafb340f0166307b8df917164e5e29d94ad3ce/amunra-0.1.2.tar.gz" } ], "0.1.3": [ { "comment_text": "", "digests": { "md5": "44bdb1da915136d75177296b412fb51b", "sha256": "b8ae5bbf598acb288c76fcbe73bc732a21cf7931a29b4c866c7f7b37f5e76c99" }, "downloads": -1, "filename": "amunra-0.1.3.tar.gz", "has_sig": false, "md5_digest": "44bdb1da915136d75177296b412fb51b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3913, "upload_time": "2019-02-12T11:11:03", "url": "https://files.pythonhosted.org/packages/39/71/3a802c3e14527368ea00d74ee6067a02f3ba90bb2ef4fb40db6769b245a0/amunra-0.1.3.tar.gz" } ], "0.2": [ { "comment_text": "", "digests": { "md5": "2aa6563b17475e344e07cecb7e98ec0c", "sha256": "9d6cbc1f9adcc6d28dd9b7d47f643a50aa64fef1eead7baae6bb6644ee569341" }, "downloads": -1, "filename": "amunra-0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "2aa6563b17475e344e07cecb7e98ec0c", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 4109, "upload_time": "2019-03-05T00:05:50", "url": "https://files.pythonhosted.org/packages/db/b3/866bb9bc34fc7ad81d7d3cc64f409f10a0a538fa42209d7ff4b289d4a6e5/amunra-0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "4213484dd9c276ef17211c4657ec4b52", "sha256": "16f704d53011a7d4a709b4d56955ca51c8e617638c988ceeff0bf06fc05ec4c7" }, "downloads": -1, "filename": "amunra-0.2.tar.gz", "has_sig": false, "md5_digest": "4213484dd9c276ef17211c4657ec4b52", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3487, "upload_time": "2019-03-05T00:05:51", "url": "https://files.pythonhosted.org/packages/06/d6/74d585b53c8e87466d0bfe943f7f5ec2810272665b85cf460a016f6a3a63/amunra-0.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "2aa6563b17475e344e07cecb7e98ec0c", "sha256": "9d6cbc1f9adcc6d28dd9b7d47f643a50aa64fef1eead7baae6bb6644ee569341" }, "downloads": -1, "filename": "amunra-0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "2aa6563b17475e344e07cecb7e98ec0c", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 4109, "upload_time": "2019-03-05T00:05:50", "url": "https://files.pythonhosted.org/packages/db/b3/866bb9bc34fc7ad81d7d3cc64f409f10a0a538fa42209d7ff4b289d4a6e5/amunra-0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "4213484dd9c276ef17211c4657ec4b52", "sha256": "16f704d53011a7d4a709b4d56955ca51c8e617638c988ceeff0bf06fc05ec4c7" }, "downloads": -1, "filename": "amunra-0.2.tar.gz", "has_sig": false, "md5_digest": "4213484dd9c276ef17211c4657ec4b52", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3487, "upload_time": "2019-03-05T00:05:51", "url": "https://files.pythonhosted.org/packages/06/d6/74d585b53c8e87466d0bfe943f7f5ec2810272665b85cf460a016f6a3a63/amunra-0.2.tar.gz" } ] }