Python抓取百度查询结果的方法

yipeiwu_com6年前 (2020-03-06)Python爬虫

本文实例讲述了Python抓取百度查询结果的方法。分享给大家供大家参考。具体实现方法如下：

#win python 2.7.x
import re,sys,urllib,codecs
xh = urllib.urlopen("http://www.baidu.com/s?q1=123&rn=100").read().decode('utf-8') 
rc = re.compile(r'<td class=f><h3 class="t"><a.*?>(?P<first>.*?)</h3>',re.I)
match = rc.finditer(xh)
rcr = re.compile(r'<[^>]+>',re.I)
f = codecs.open("xiaohei.txt", "w", "utf-8")
for i in rc.finditer(xh):
  ss = i.group(0)
  s1 = rcr.sub('',ss)
  print (s1)
  f.write(s1)
f.close()

希望本文所述对大家的Python程序设计有所帮助。

返回列表

上一篇：Python字符转换

下一篇：PHP生成静态页面详解

python批量爬取下载抖音视频

本文实例为大家分享了python批量爬取下载抖音视频的具体代码，供大家参考，具体内容如下 import os import requests import re import sys...

Python爬虫获取图片并下载保存至本地的实例

1、抓取煎蛋网上的图片。 2、代码如下： import urllib.request import os #to open the url def url_open(url): r...

python爬虫中get和post方法介绍以及cookie作用

首先确定你要爬取的目标网站的表单提交方式，可以通过开发者工具看到。这里推荐使用chrome。这里我用163邮箱为例打开工具后再Network中，在Name选中想要了解的网站，右侧...

Python网络爬虫实例讲解

聊一聊Python与网络爬虫。 1、爬虫的定义爬虫：自动抓取互联网数据的程序。 2、爬虫的主要框架爬虫程序的主要框架如上图所示，爬虫调度端通过URL管理器获取待爬取的URL链接，若...

python&MongoDB爬取图书馆借阅记录

直接上需求和代码首先是需要爬取的链接和网页：http://211.81.31.34/uhtbin/cgisirsi/x/0/0/57/49?user_id=LIBSCI_ENGI&pa...

宜配屋

Python抓取百度查询结果的方法

相关文章

python批量爬取下载抖音视频

Python爬虫获取图片并下载保存至本地的实例

python爬虫中get和post方法介绍以及cookie作用

Python网络爬虫实例讲解

python&MongoDB爬取图书馆借阅记录

© YiPeiWu.com 【宜配屋】粤ICP备17031333号

Powered By Z-BlogPHP. Theme by TOYEAN.

宜配屋

Python抓取百度查询结果的方法

相关文章

python批量爬取下载抖音视频

Python爬虫获取图片并下载保存至本地的实例

python爬虫中get和post方法介绍以及cookie作用

Python网络爬虫实例讲解

python&MongoDB爬取图书馆借阅记录

© YiPeiWu.com 【宜配屋】 粤ICP备17031333号 var _hmt = _hmt || [];(function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?8aa60ae04b767b2af31903508928acc0"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s);})();

Powered By Z-BlogPHP. Theme by TOYEAN.

© YiPeiWu.com 【宜配屋】粤ICP备17031333号