宜配屋

Python抓取手机号归属地信息示例代码

yipeiwu_com6年前 (2020-03-06)Python爬虫

前言

本文给大家介绍的是利用Python抓取手机归属地信息，文中给出了详细的示例代码，相信对大家的理解和学习很有帮助，以下为Python代码，较为简单，供参考。

示例代码

# -*- coding:utf-8 -*-
import requests,re
o = open('data.txt','a')
e = open('error.txt','a')
baseUrl = 'http://www.iluohe.com/'
r = requests.get('http://www.iluohe.com/all.shtml',)
links = re.findall('<a href="(city/.*?/.*?)" target',r.content.decode("gbk").encode("utf-8"))
for link in links:
 link = baseUrl+link
 cityData = requests.get(link)
 if cityData.status_code >= 300 :
 e.writelines(link+"\n")
 else:
 cityData = cityData.content.decode("gbk").encode("utf-8")
 provinceTemp = re.findall('<div class="NameSzu"><a href=".*?">(.*?)</a></div>',cityData)
 if provinceTemp:
  province = provinceTemp[0]
  city = re.findall('<meta name="description" content="(.*?)共有',cityData)[0]
  tempData = re.findall('<div class="ab_menu.*?</span>(.*?) \(.*?</div>.*?<ul>(.*?)</ul>',cityData)
  for temp in tempData:
  carrier = temp[0]
  numbers = re.findall('">(.*?)</a></li>',temp[1])
  for number in numbers:
   text = number + "," + carrier + "," + city + "," + province
   o.writelines(text)
   o.writelines('\n')
 else:
  e.writelines(link+"\n")
o.close()
print "over!"

总结

以上就是这篇文章的全部内容了，希望本文的内容对大家的学习或者工作能带来一定的帮助，如果有疑问大家可以留言交流。

Python抓取手机号归属地信息示例代码

相关文章

利用Python爬虫给孩子起个好名字

教你用python3根据关键词爬取百度百科的内容

Python基于分析Ajax请求实现抓取今日头条街拍图集功能示例

在Python中使用cookielib和urllib2配合PyQuery抓取网页信息

Python爬虫实现验证码登录代码实例

© YiPeiWu.com 【宜配屋】粤ICP备17031333号

Powered By Z-BlogPHP. Theme by TOYEAN.

宜配屋

Python抓取手机号归属地信息示例代码

相关文章

利用Python爬虫给孩子起个好名字

教你用python3根据关键词爬取百度百科的内容

Python基于分析Ajax请求实现抓取今日头条街拍图集功能示例

在Python中使用cookielib和urllib2配合PyQuery抓取网页信息

Python爬虫实现验证码登录代码实例

© YiPeiWu.com 【宜配屋】 粤ICP备17031333号 var _hmt = _hmt || [];(function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?8aa60ae04b767b2af31903508928acc0"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s);})();

Powered By Z-BlogPHP. Theme by TOYEAN.

© YiPeiWu.com 【宜配屋】粤ICP备17031333号