在线观看不卡亚洲电影_亚洲妓女99综合网_91青青青亚洲娱乐在线观看_日韩无码高清综合久久

鍍金池/ 問(wèn)答/Python/ 怎么用python爬取京東電子產(chǎn)品參數(shù)?

怎么用python爬取京東電子產(chǎn)品參數(shù)?

1 比如想爬取華碩游戲本的每一臺(tái)筆記本的參數(shù),如截圖(1)(2):
圖片描述
圖片描述

鏈接:https://list.jd.com/list.html...

該怎么爬取呢?自己找不到類(lèi)似的例子。

回答
編輯回答
怪痞
#!/usr/bin/env python3.6
# -*- coding: utf-8 -*-
import re
import requests
from bs4 import BeautifulSoup
headers = {'user-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/62.0.3202.75 Safari/537.36'}
def urls():
    url = 'https://list.jd.com/list.html?cat=670,671,1105&ev=exbrand_8551&page=1&delivery=1&delivery_daofu=0&sort=sort_totalsales15_desc&trans=1&JL=4_11_0#J_main'
    html = requests.get(url,headers=headers)
    find_urls = re.findall('a target="_blank" href="(.*?)" ',html.text)[1:]
    for find_url in find_urls:
        find_text('http:'+find_url)

def find_text(find_url):
    html =requests.get(find_url,headers=headers).text
    soup = BeautifulSoup(html,'html5lib')
    find_texts = soup.find('div',class_='Ptable')
    print(find_texts.get_text('\n',strip=True))
    print(100*'*')

if __name__ == '__main__':
    urls()

2018年7月14日 13:34