Python课程-w73班
Louyj's Blog
Toggle navigation
Python课程-w73班
Home
班级课件
课堂代码
课后作业
学习资料
Archives
Tags
2022-08-07课堂代码
2022-08-07 08:30:28
39
0
0
python-w73
``` import json import os import urllib.request import re response = urllib.request.urlopen( 'https://image.baidu.com/search/albumsdetail?tn=albumsdetail&word=%E5%9F%8E%E5%B8%82%E5%BB%BA%E7%AD%91%E6%91%84%E5%BD%B1%E4%B8%93%E9%A2%98&fr=searchindex_album%20&album_tab=%E5%BB%BA%E7%AD%91&album_id=7&rn=30') html = response.read().decode() # print(html) # https://t7.baidu.com/it/u=1595072465,3644073269&fm=193&f=GIF ''' - 目标明确: 要爬什么东西 - 抓取网页到本地进行分析(规律) - 爬取 result = re.findall('src="(.+?)"', content) print(result) ''' # linkData开头, 单引号中间的字符串, linkData: ('要找的内容') result = re.findall("linkData: ('.+?')", html) image_list = json.loads(eval(result[0])) if not os.path.exists("images"): os.mkdir("images") index = 0 for image_item in image_list: image_url = image_item['thumbnailUrl'].replace('\/', '/') urllib.request.urlretrieve(image_url, "images/image-" + str(index) + '.gif') index = index + 1 print(image_url) # json.dumps() # python对象(list,dict)转换成json字符串 # json.loads() # 把json字符串转换成python对象(list,dict) ```
Pre:
2022-08-27课件
Next:
2022-08-07课件
0
likes
39
Weibo
Wechat
Tencent Weibo
QQ Zone
RenRen
Submit
Sign in
to leave a comment.
No Leanote account?
Sign up now.
0
comments
More...
Table of content
No Leanote account? Sign up now.