python爬虫之BeautifulSoup初始运行时警告

本人想对http://www.pythonscraping.com/pages/page1.html网页进行信息爬取不料在爬取的过程中出现了以下问题，在此附上代码，以便读者检验

方法/步骤

Python在进行网页爬虫时，代码如下：from urllib.request import urlopen from bs4 import BeautifulSoup html = urlopen('http://www.pythonscraping.com/pages/page1.html') bsObj = BeautifulSoup(html) print (bsObj.h1) nameList = bsObj.findAll(text='the prince') print(len(nameList))

在运行代码时出现了一个用户警告，如下：UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ('lxml'). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.0

紧接着是因此此警告的原因，初始化时，未加上解析器类型The code that caused this warning is on line 4 of the file C:/Users/15057/PycharmProjects/Stock/Stockinvest/test.py. To get rid of this warning, change code that looks like this: BeautifulSoup(YOUR_MARKUP}) to this: BeautifulSoup(YOUR_MARKUP, 'lxml') markup_type=markup_type)

详细情况如图所示：

将代码bsObj = BeautifulSoup(html.read())改为bsObj = BeautifulSoup(html.read(),'lxml')就不会报错了

上一篇：python如何爬取网页里的伪元素

下一篇：在windows下搭建python的scrapy爬虫框架

欧尼酱

python爬虫之BeautifulSoup初始运行时警告

如何用python写爬虫

python爬虫怎么写

Python写网络爬虫-Urllib库

python爬取三种方法

python爬虫socket.timeout的使用

怎么用python爬数据

用python爬取商品页面信息

Python爬虫：如何爬虫实现以及2大解析方法

Python爬虫--BeautifulSoup（2）

python爬虫--爬取网页数据的一般步骤

python turtle教程6

python3爬虫怎样构建请求头，怎样构建header

如何使用python语言中pandas模块randn和randint

如何使用python语言pandas模块pivot_table方法

Python爬虫动态ip代理防止被封的方法

如何使用python语言中的pandas的cumsum创建数据

如何用python3爬取招聘网站

python3使用urllib爬去ajax加载的页面实例

python如何爬取网页里的伪元素

python爬虫之BeautifulSoup初始运行时警告

如何大的录音文件变小

【父亲节】父亲节礼物选购建议

页眉页脚怎么设置

怎样学习软件更高效

文学、英语、综合，3类学习型APP应用

CD转MP3高品质实现方法

电脑截图技巧

WPS中如何取消文字前面自动编号呢？

Audition如何过零点

告诉你一些CCNA学习软件

如何设置电脑时间为24小时制？

1G文件压缩成1M的方法

电脑亮度调整方法

多媒体数据压缩技术标准

电脑上如何录制屏幕 电脑录制屏幕视频

qq音乐车载模式在哪

学习软件的应用

电脑如何截屏？

CD转Mp3的方法

如何自动播放CD

电脑上如何录制屏幕电脑录制屏幕视频