宜配屋

python 获取网页编码方式实现代码

yipeiwu_com6年前 (2020-03-06)Python基础

python 获取网页编码方式实现代码

<span style="font-family: Arial, Helvetica, sans-serif; background-color: rgb(255, 255, 255);">
  </span><span style="font-family: Arial, Helvetica, sans-serif; background-color: rgb(255, 255, 255);">
python开发，自动化获取网页编码方式用到了chardet库，字符集检测，这个类在python2.7中没有，需要在官网上下载。
这里我下载好了chardet-2.3.0.tar.gz压缩包文件，只需要将压缩包文件解压后的chardet文件放到python安装包下的
python27/lib/site-packages/下，就可以了。</span>

然后import chardet

下面写了一个自动化检测的函数供检测Url连接，然后返回网页url的编码方式。

import chardet #字符集检测 
import urllib 
 
url="http://www.jd.com" 
 
 
def automatic_detect(url): 
  content=urllib.urlopen(url).read() 
  result=chardet.detect(content) 
 
  encoding=result['encoding'] 
 
  return encoding 
 
urls=['http://www.baidu.com','http://www.163.com','http://dangdang.com'] 
for url in urls: 
  print url,automatic_detect(url)

上面用到了chardet类的detect方法，返回字典，然后取出编码方式encoding

感谢阅读，希望能帮助到大家，谢谢大家对本站的支持！

python 获取网页编码方式实现代码

相关文章

python opencv读mp4视频的实例

Python实现小数转化为百分数的格式化输出方法示例

python web框架 django wsgi原理解析

Python实现TCP协议下的端口映射功能的脚本程序示例

python中判断文件编码的chardet(实例讲解)

© YiPeiWu.com 【宜配屋】粤ICP备17031333号

Powered By Z-BlogPHP. Theme by TOYEAN.

宜配屋

python 获取网页编码方式实现代码

相关文章

python opencv读mp4视频的实例

Python实现小数转化为百分数的格式化输出方法示例

python web框架 django wsgi原理解析

Python实现TCP协议下的端口映射功能的脚本程序示例

python中判断文件编码的chardet(实例讲解)

© YiPeiWu.com 【宜配屋】 粤ICP备17031333号 var _hmt = _hmt || [];(function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?8aa60ae04b767b2af31903508928acc0"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s);})();

Powered By Z-BlogPHP. Theme by TOYEAN.

© YiPeiWu.com 【宜配屋】粤ICP备17031333号