Python3.5+requests 爬取网站遇到中文乱码怎么办（???è????????è?ˉ?o??′2? ????é?￠）爬虫

import requests from bs4 import BeautifulSoupurl = 'http://quote.eastmoney.com/stocklist.html' user_agent = 'Mozilla/4.0 (compatible; MSIE 5.5; Windows NT)' headers = {'User-Agent': user_agent} req = requests.get(url, headers=headers) req.encoding = 'utf-8' bs = BeautifulSoup(req.content, 'html.parser')# type: BeautifulSoup quotesearch = bs.find('div', attrs={'id': 'quotesearch'}) print(quotesearch)

运行以上代码，显示结果如下：

???31é·Y(300737) °?·éêy?Y(300738) ?÷??μ??·(300739) óù?ò??(300740) ?a±|1é·Y(300741)

1.解决思路一：查看网页的编码方式
【Python3.5+requests 爬取网站遇到中文乱码怎么办（???è????????è?ˉ?o??′2? ????é?￠）】F12打开网站地址，查看最上方head，发现编码方式为‘gb2312’(charset=gb2312)，修改代码第八行req.encoding = 'gb2312',重新运行代码。运行结果未改变，仍有乱码。
2.解决思路二：修改代码第九行bs = BeautifulSoup(req.text, 'html.parser')，将req.content改为req.text，运行代码，结果正常，无乱码。
原理：