宜配屋

Python爬虫爬取一个网页上的图片地址实例代码

yipeiwu_com6年前 (2020-03-06)Python爬虫

本文实例主要是实现爬取一个网页上的图片地址，具体如下。

读取一个网页的源代码：

import urllib.request
def getHtml(url):
  html=urllib.request.urlopen(url).read()
  return html
print(getHtml(http://image.baidu.com/search/flip?tn=baiduimage&ie=utf-8&word=%E5%A3%81%E7%BA%B8&ct=201326592&lm=-1&v=flip))

利用正则表达式爬取一个网页上的图片地址：

import re
import urllib.request
def getHtml(url):
  html=urllib.request.urlopen(url).read()
  return html
def getImg(html):
  r=r'"thumbURL":"(http://img.+?\.jpg)"' #定义正则
  imglist=re.findall(r,html)
  return imglist
html=str(getHtml("http://image.baidu.com/search/flip?tn=baiduimage&ie=utf-8&word=%E5%A3%81%E7%BA%B8&ct=201326592&lm=-1&v=flip"))
print(getImg(html))

运行结果：

总结

以上就是本文关于Python爬虫爬取一个网页上的图片地址实例代码的全部内容，希望对大家有所帮助。感兴趣的朋友可以继续参阅本站其他相关专题，如有不足之处，欢迎留言指出。感谢朋友们对本站的支持！

Python爬虫爬取一个网页上的图片地址实例代码

相关文章

零基础写python爬虫之爬虫编写全记录

Python3.4编程实现简单抓取爬虫功能示例

Python实现的爬取百度贴吧图片功能完整示例

详解python 爬取12306验证码

用python3 urllib破解有道翻译反爬虫机制详解

© YiPeiWu.com 【宜配屋】粤ICP备17031333号

Powered By Z-BlogPHP. Theme by TOYEAN.

宜配屋

Python爬虫爬取一个网页上的图片地址实例代码

相关文章

零基础写python爬虫之爬虫编写全记录

Python3.4编程实现简单抓取爬虫功能示例

Python实现的爬取百度贴吧图片功能完整示例

详解python 爬取12306验证码

用python3 urllib破解有道翻译反爬虫机制详解

© YiPeiWu.com 【宜配屋】 粤ICP备17031333号 var _hmt = _hmt || [];(function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?8aa60ae04b767b2af31903508928acc0"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s);})();

Powered By Z-BlogPHP. Theme by TOYEAN.

© YiPeiWu.com 【宜配屋】粤ICP备17031333号