2020-01-24T00:17:42.095401 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2018/2633(RSP)&l=en |
2020-01-24T00:17:59.340706 | dossier | error | this url returns a hard 404: https://oeil.secure.europarl.europa.eu/oeil//popups/ficheprocedure.do?reference=2018/0063(COD)&l=en |
2020-01-24T00:18:18.113145 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2017/2840(RSP)&l=en |
2020-01-24T00:18:21.597335 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2012/2047(INI)&l=en |
2020-01-24T00:19:28.846426 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2015/2148(INI)&l=en |
2020-01-24T00:20:39.019636 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2017/2848(RSP)&l=en |
2020-01-24T00:20:44.984279 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2018/2669(RSP)&l=en |
2020-01-24T00:20:52.660865 | dossier | error | no content found for section "Attention!" of 2015/2542(DEA) |
2020-01-24T00:20:52.714913 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2014/2230(INI)&l=en |
2020-01-24T00:20:52.732306 | dossier | error | no content found for section "Delegated act was actually rejected!" of 2015/2542(DEA) |
2020-01-24T00:21:42.047106 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2016/0282(COD)&l=en |
2020-01-24T00:21:42.174613 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2015/2222(INI)&l=en |
2020-01-24T00:21:43.397207 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2017/2024(INL)&l=en |
2020-01-24T00:23:01.240813 | dossier | error | this url returns a hard 404: http://www.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2016/0400(COD)&l=en |
2020-01-24T00:26:00.643864 | dossier | error | this url returns a hard 404: https://oeil.secure.europarl.europa.eu/oeil/popups/ficheprocedure.do?reference=2019/2110(INI)&l=en |
2020-01-24T00:27:22.542009 | amendment | error | could not find non-empty line above pagebreak in http://www.europarl.europa.eu/doceo/document/LIBE-AM-646874_EN.pdf |
2020-01-24T00:27:22.542555 | mgr | error | failed to execute amendment job {'onfinished': {'daisy': True}, 'url': 'http://www.europarl.europa.eu/doceo/document/LIBE-AM-646874_EN.pdf', 'meps': 'Bettina VOLLATH'} (ValueError('no EN marker found: http://www.europarl.europa.eu/doceo/document/LIBE-AM-646874_EN.pdf')) |
2020-01-24T00:27:22.606465 | mgr | error | Traceback (most recent call last): |
File "/home/pt/pt2/scraper_service.py", line 63, in consume
ret = scraper.scrape(**job)
File "scrapers/amendment.py", line 653, in scrape
text, PE=getraw(url)
File "scrapers/amendment.py", line 55, in getraw
return unpaginate(text,pdf)
File "scrapers/amendment.py", line 269, in unpaginate
raise ValueError("no EN marker found: %s" % url)
ValueError: no EN marker found: http://www.europarl.europa.eu/doceo/document/LIBE-AM-646874_EN.pdf |
2020-01-24T00:27:23.764418 | amendment | error | could not find non-empty line above pagebreak in http://www.europarl.europa.eu/doceo/document/LIBE-AM-646877_EN.pdf |
2020-01-24T00:27:23.785917 | mgr | error | failed to execute amendment job {'onfinished': {'daisy': True}, 'url': 'http://www.europarl.europa.eu/doceo/document/LIBE-AM-646877_EN.pdf', 'meps': 'Bettina VOLLATH'} (ValueError('no EN marker found: http://www.europarl.europa.eu/doceo/document/LIBE-AM-646877_EN.pdf')) |
2020-01-24T00:27:23.966036 | mgr | error | Traceback (most recent call last): |
File "/home/pt/pt2/scraper_service.py", line 63, in consume
ret = scraper.scrape(**job)
File "scrapers/amendment.py", line 653, in scrape
text, PE=getraw(url)
File "scrapers/amendment.py", line 55, in getraw
return unpaginate(text,pdf)
File "scrapers/amendment.py", line 269, in unpaginate
raise ValueError("no EN marker found: %s" % url)
ValueError: no EN marker found: http://www.europarl.europa.eu/doceo/document/LIBE-AM-646877_EN.pdf |
2020-01-24T00:28:23.879390 | amendment | error | [!] couldn't find ref: Committee on Development |
2020-01-24T00:28:24.982089 | amendment | error | [!] couldn't find ref: Committee on Development |
2020-01-24T00:28:24.991248 | amendment | error | couldn't find dossier reference in source pdf: http://www.europarl.europa.eu/doceo/document/DEVE-AM-641421_EN.pdf |
2020-01-24T00:28:26.960940 | amendment | error | [!] couldn't find ref: Committee on Development |
2020-01-24T00:28:26.981448 | amendment | error | couldn't find dossier reference in source pdf: http://www.europarl.europa.eu/doceo/document/DEVE-AM-641422_EN.pdf |
2020-01-24T00:29:36.210060 | amendment | error | [!] couldn't find ref: Committee on Economic and Monetary Affairs |
2020-01-24T00:29:42.192528 | amendment | error | [!] couldn't find ref: Committee on Economic and Monetary Affairs |
2020-01-24T00:30:21.047167 | amendment | error | [!] couldn't find ref: Committee on the Environment, Public Health and Food Safety |
2020-01-24T00:32:42.808554 | amendment | error | [!] couldn't find ref: Committee on Development |