当前位置:网站首页>爬取蝉妈妈数据平台商品数据
爬取蝉妈妈数据平台商品数据
2022-04-23 05:46:00 【圆滚滚的程序员】
本文旨在交流学习,勿作他用,否则后果自负
环境 linux+pycharm+anaconda
import json
import csv
import requests
from usere_agent import UA
from requests.packages.urllib3.exceptions import InsecureRequestWarning
requests.packages.urllib3.disable_warnings(InsecureRequestWarning)
headers = {
'Accept': '*/*',
'Accept-Encoding': 'gzip, deflate, br',
'Accept-Language': 'zh-CN,zh;q=0.9',
'Connection': 'keep-alive',
'Content-Length': '238',
'Content-Type': 'application/x-www-form-urlencoded',
'Host': 'api-service.chanmama.com',
'Origin': 'https://www.chanmama.com',
'Referer': 'https://www.chanmama.com/promotionRank',
'User-Agent': UA
}
cl_url=["女装","男装","美妆护理","鞋包饰品","日用百货","母婴玩具","食品生鲜","运动户外","鲜花家纺","宠物农资","汽车配件","手机数码","生活家电","家装建材","其他",]
url = 'https://api-service.chanmama.com/v1/product/search'
for i in cl_url:
for j in range(1, 1000):
data1 = {
'keyword': '', 'keyword_type': '', 'page': '{}'.format(j), 'price': '', 'size': '100',
'filter_coupon': '0', 'is_aweme_goods': '0', 'tb_max_commission_rate': '', 'day_pv_count': '',
'day_order_count': '', 'cat': "{}".format(i), 'platform': '', 'sort': "day_order_count",
'order_by': "desc",
}
d = requests.post(url=url, headers=headers, data=(json.dumps(data1)), verify=False).json()
# print(d)
try:
if d:
data_list = d['data']['list']
#print(data_list)
if data_list:
for k in data_list:
img = k['image'] # 图片
#print(img)
title = k['title'] # 商品名
brand = k["brand"] # 品牌
u = k["url"] # 链接
market_price = k["market_price"] # 市场价格
cat = k['cat'] # 分类
shop_name = k["shop_name"] # 店铺名
sales = k["sales"] # 全网月销量
tb_max_commission_rate = k["tb_max_commission_rate"] # 佣金比例
day_pv_count = k["day_pv_count"] # 昨日浏览量
conversion_rate = k["conversion_rate"] # 昨日转化率
day_order_count = k["day_order_count"] # 昨日销量
tb_coupon_price = k["tb_coupon_price"] # 优惠价
print(cat, shop_name, brand, title, img, market_price, sales, tb_max_commission_rate,
day_pv_count, conversion_rate, day_order_count, tb_coupon_price, u)
with open('/media/liu/_dde_data/project/spider/供应商/cmm_data/' + cat + '.csv', 'a+') as f:
f_csv = csv.writer(f)
f_csv.writerow([cat,shop_name,brand,title,img,market_price,sales,tb_max_commission_rate,day_pv_count,conversion_rate,day_order_count,tb_coupon_price,u])
else:
continue
except Exception as e:
continue
版权声明
本文为[圆滚滚的程序员]所创,转载请带上原文链接,感谢
https://blog.csdn.net/qq_39483957/article/details/106268765
边栏推荐
- Rust 中的 Cell 共享可变指针
- 4. Print form
- Programming training
- What is the difference between the basic feasible solution and the basic feasible solution in linear programming?
- Rainbow (DP)
- Best practices for MySQL storage time
- 20 excellent plug-ins recommended by idea
- 基于pygame库编写的五子棋游戏
- Protected (members modified by protected are visible to this package and its subclasses)
- -- SQL query and return limit rows
猜你喜欢
![[leetcode 401] binary Watch](/img/a5/538caf3a1a6143a47d79d947717554.png)
[leetcode 401] binary Watch

SQL -- data definition

A sharp tool to improve work efficiency

Framework analysis 1 Introduction to system architecture

Kalman filter and inertial integrated navigation

Substring Inversion (Easy Version)

Algèbre linéaire chapitre 2 - matrice et son fonctionnement

从源代码到可执行文件的过程

小区房价可视化

Techniques et principes de détection
随机推荐
Qthread simple test understanding
Cf6d lizards and fundamentals 2 problem solving
Advanced operation of idea debug
[leetcode 401] binary Watch
List segmentation best practices
Use of multithreaded executors
从源代码到可执行文件的过程
IO multiplexing of 09 redis
SVN简单操作命令
[leetcode 67] sum of two binary numbers
Generate excel template (drop-down selection, multi-level linkage)
Substring Inversion (Easy Version)
解决ArcGIS分区统计显示太多唯一值执行失败
线程和进程的关系和区别是什么
Troubleshooting of data deleted and reappeared problems
Example of ticket selling with reentrant lock
Option的正确打开方式
POJ - 2955 brackets interval DP
[leetcode 290] word rules
H. Are You Safe? Convex hull naked problem