当前位置:网站首页>网站某个按钮样式爬取片段

网站某个按钮样式爬取片段

2022-04-23 15:15:00 萱泽科技

有兴趣的可以试一下。参考学习!

      
      
import requests
import os


class Code:
def __init__(self):
self.url = [
"https://api.uiverse.io/buttons?type=button&orderBy=popular",
"https://api.uiverse.io/buttons?type=button-of-the-day&orderBy=popular",
"https://api.uiverse.io/buttons?type=card&orderBy=popular",
"https://api.uiverse.io/buttons?type=checkbox&orderBy=popular",
"https://api.uiverse.io/buttons?type=input&orderBy=popular",
"https://api.uiverse.io/buttons?type=spinner&orderBy=popular",
"https://api.uiverse.io/buttons?type=switch&orderBy=popular",
]
self.headers = {
"user-agent": "Mozilla / 5.0(Windows NT 10.0;WOW64) AppleWebKit / 537.36(KHTML, likeGecko) Chrome / 86.0.4240.198Safari / 537.36"
}

def send_request(self, url):
return requests.get(url, headers=self.headers, timeout=3).json()

def run(self):
for url in self.url:
html_str = self.send_request(url)
for val in html_str['buttons']:
css = val['scopedCss']
html = val['html']
first = css.index('.')
last = css.find(' ', first)
print(first)
print(css[first+1:last])

content = (
'<html lang="en">'
'<head>'
'<meta charset="UTF-8">'
'<meta name="viewport" content="width=device-width, initial-scale=1.0">'
'<title > Document </title>'
'<style >' + css + '</style>'
'</head>'
'<body><div class="' + css[first+1:last] + '">' + html + '</div></body>'
'</html>'
)
# break
path = "C:/Users/Administrator/Downloads/Documents"+url[36:-16]
if not os.path.exists(path=path):
os.makedirs(path)
with open(path + '/' + val['id'] + '.html', 'w',
encoding='utf-8') as ht:
ht.write(content)


def main():
code = Code()
code.run()


if __name__ == '__main__':
main()
  • 1.
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.

版权声明
本文为[萱泽科技]所创,转载请带上原文链接,感谢
https://blog.51cto.com/u_15612978/5248579