推荐学习书目
Learn Python the Hard Way
Python Sites
PyPI - Python Package Index
http://diveintopython.org/toc/index.html
Pocoo
值得关注的项目
PyPy
Celery
Jinja2
Read the Docs
gevent
pyenv
virtualenv
Stackless Python
Beautiful Soup
结巴中文分词
Green Unicorn
Sentry
Shovel
Pyflakes
pytest
Python 编程
pep8 Checker
Styles
PEP 8
Google Python Style Guide
Code Style from The Hitchhiker's Guide
Ewig
V2EX  ›  Python

scrapy 爬虫 报错如下

  •  
  •   Ewig · Jan 4, 2019 · 4233 views
    This topic created in 2712 days ago, the information mentioned may be changed or developed.
    2019-01-04 17:55:52 [csrc][scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://www.csrc.gov.cn/pub/zjhpublic/G00306202/201810/P020181012566963570242.pdf. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
    2019-01-04 17:55:52 [csrc][scrapy.downloadermiddlewares.retry] DEBUG: Retrying <GET http://www.csrc.gov.cn/pub/zjhpublic/G00306202/201810/P020181012566963570242.pdf> (failed 1 times): [<twisted.python.failure.Failure twisted.internet.error.ConnectionDone: Connection was closed cleanly.>, <twisted.python.failure.Failure twisted.web.http._DataLoss: >]


    寻原因
    1 replies    2020-10-22 19:00:56 +08:00
    codecore42
        1
    codecore42  
       Oct 22, 2020
    遇到了同样的问题,您这边解决了吗?
    About   ·   Help   ·   Advertise   ·   Blog   ·   API   ·   FAQ   ·   Solana   ·   1559 Online   Highest 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 28ms · UTC 16:40 · PVG 00:40 · LAX 09:40 · JFK 12:40
    ♥ Do have faith in what you're doing.