defcheckkk(url): data ={ "url":url } response = json.loads(requests.post(thisurl, json.dumps(data)).text) # print(response["info"]) if response["info"] == 2: return1 else: return0
Selenium库
驱动
浏览器版本更新,驱动不匹配。报错:
1 2
selenium.common.exceptions.SessionNotCreatedException: Message: session not created: This version of ChromeDriver only supports Chrome version 118 Current browser version is 131.0.6778.140 with binary path xxx\chrome.exe
The chromedriver version (131.0.6775.0) detected in PATH at /usr/local/bin/chromedriver might not be compatible with the detected chrome version (131.0.6778.139); currently, chromedriver 131.0.6778.108 is recommended for chrome 131.*, so it is advised to delete the driver in PATH and retry
(更新报错)The chromedriver version (131.0.6778.108) detected in PATH at /usr/local/bin/chromedriver might not be compatible with the detected chrome version (131.0.6778.139); currently, chromedriver 131.0.6778.204 is recommended for chrome 131.*, so it is advised to delete the driver in PATH and retry
File "D:\Python\Python38\lib\site-packages\selenium\webdriver\chrome\webdriver.py", line 84, in __init__ super().__init__( File "D:\Python\Python38\lib\site-packages\selenium\webdriver\chromium\webdriver.py", line 104, in __init__ super().__init__( File "D:\Python\Python38\lib\site-packages\selenium\webdriver\remote\webdriver.py", line 286, in __init__ self.start_session(capabilities, browser_profile) File "D:\Python\Python38\lib\site-packages\selenium\webdriver\remote\webdriver.py", line 378, in start_session response = self.execute(Command.NEW_SESSION, parameters) File "D:\Python\Python38\lib\site-packages\selenium\webdriver\remote\webdriver.py", line 440, in execute self.error_handler.check_response(response) File "D:\Python\Python38\lib\site-packages\selenium\webdriver\remote\errorhandler.py", line 245, in check_response raise exception_class(message, screen, stacktrace) selenium.common.exceptions.SessionNotCreatedException: Message: session not created: DevToolsActivePort file doesn't exist Stacktrace: GetHandleVerifier [0x00007FF713938E92+54786] (No symbol) [0x00007FF7138A55B2] (No symbol) [0x00007FF71375A64B] (No symbol) [0x00007FF71378CA50] (No symbol) [0x00007FF713787C46] (No symbol) [0x00007FF7137853BE] (No symbol) [0x00007FF7137C3FBB] (No symbol) [0x00007FF7137C3A30] (No symbol) [0x00007FF7137BBC43] (No symbol) [0x00007FF713790941] (No symbol) [0x00007FF713791B84] GetHandleVerifier [0x00007FF713C87EE2+3524178] GetHandleVerifier [0x00007FF713CDD790+3874560] GetHandleVerifier [0x00007FF713CD5D0F+3843199] GetHandleVerifier [0x00007FF7139D5026+694166] (No symbol) [0x00007FF7138B0A28] (No symbol) [0x00007FF7138ACA34] (No symbol) [0x00007FF7138ACB62] (No symbol) [0x00007FF71389CC23] BaseThreadInitThunk [0x00007FFA21F374B4+20] RtlUserThreadStart [0x00007FFA232826A1+33]
from lxml import etree res = requests.get(image_url, headers=headers).text html = etree.HTML(res)
Chrome has crashed & ChromeOptions启动参数
浏览器和驱动都安装好了,版本也是匹配的,但运行就报错:
1 2 3
selenium.common.exceptions.SessionNotCreatedException: Message: session not created: Chrome failed to start: exited normally. (session not created: DevToolsActivePort file doesn't exist) (The process started from chrome location /opt/google/chrome/google-chrome is no longer running, so ChromeDriver is assuming that Chrome has crashed.)
Request blocked.
We can't connect to the server for this app or website at this time. There might be too much traffic or a configuration error. Try again later, or contact the app or website owner.
If you provide content to customers through CloudFront, you can find steps to troubleshoot and help prevent this error by reviewing the CloudFront documentation.
Generated by cloudfront (CloudFront)
Request ID: OCKGdsv8zSPr8Vpi-JubSXpxV9SzCniI6bBxx1QGDq9MRuo3nYQtHQ==
options_ = webdriver.ChromeOptions() # options_.add_argument( # 'User-Agent=Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/110.0.0.0 Safari/537.36') options_.add_argument('user-agent=Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/87.0.4280.88 Safari/537.36')
429请求过多
当请求过于频繁时服务器会拒绝连接发送429状态码的页面,需要关闭过多的连接并循环请求:
1 2 3 4 5 6 7 8 9
response = '' for i inrange(20): # 循环请求网站 session = requests.Session() session.keep_alive = False#不保持会话连接 response = session.get(downurl, headers=headers, timeout=20, proxies=proxies, cookies=cookie_dict) print(response.status_code) time.sleep(20) if response.status_code == 200: break
1
dvid image error HTTPConnectionPool(host='172.25.76.14', port=8001): Max retries exceeded with url: /bullet/dvidshubpost (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f50d1737100>: Failed to establish a new connection: [Errno 113] No route to host'))
SSLEOFError
1
requests.exceptions.SSLError: HTTPSConnectionPool(host='xxx', port=443): Max retries exceeded with url: xxx.mp4 (Caused by SSLError(SSLEOFError(8, '[SSL: UNEXPECTED_EOF_WHILE_READING] EOF occurred in violation of protocol (_ssl.c:1007)')))