在线观看不卡亚洲电影_亚洲妓女99综合网_91青青青亚洲娱乐在线观看_日韩无码高清综合久久

鍍金池/ 問(wèn)答/HTML5  Python  HTML/ 同樣一個(gè)下載地址,用python爬蟲(chóng)爬取的種子文件大小為0,而用瀏覽器是可以正常

同樣一個(gè)下載地址,用python爬蟲(chóng)爬取的種子文件大小為0,而用瀏覽器是可以正常下載下來(lái)的?

1.訪(fǎng)問(wèn)某個(gè)網(wǎng)頁(yè),用瀏覽器可以下載其中嵌入的種子文件,種子文件大小是正常的,用迅雷工具也可以正常下載,但是用python爬蟲(chóng)爬取,并且下載下來(lái)的數(shù)據(jù)大小為0?
2.這是我自己寫(xiě)的代碼。

url = 'http://www.gawu88.space/thread-9431970-1-1.html'
headers = {
    'Cookie':'__cfduid=d15f7eb39310b0301f07e1f744ca70a3d1526800937; _ga=GA1.2.942865751.1526800940; A8tI_2132_saltkey=njU69xqb; A8tI_2132_lastvisit=1526797339; A8tI_2132_adult_warn=1; A8tI_2132_auth=7d44BRr5TCxDGN9zYzcgtvgTYZzopZtEOJjzAO323fO%2BdvFoIjRzKH31yzmid2IjzmB9bQ5PLK%2B1iWLRV%2BnD6zp8PwkV; A8tI_2132_lastcheckfeed=7589318%7C1526800977; A8tI_2132_smile=2D1; A8tI_2132_atarget=1; _gid=GA1.2.849215201.1527331040; cus_cookie=5; A8tI_2132_adv_gid=18; A8tI_2132_self_unique_code=6357ea0d-3640-91bf-a290-cdc483f40ded; A8tI_2132_ignore_notice=1; __insp_wid=1484672786; __insp_nv=true; __insp_targlpu=aHR0cDovL3d3dy5nYXd1ODguc3BhY2UvcG9ydGFsLmh0bWw%3D; __insp_targlpt=6K665Z2b6Zeo5oi3X_adj_WQp_iuuuWdm1%2FmgKflkKfmiJDkurrorrrlnZs%3D; __insp_norec_sess=true; A8tI_2132_sign_close=1; A8tI_2132_notification_readed_ids=57457151; A8tI_2132_noticeTitle=1; A8tI_2132_notification_unread_tips=1527519801; A8tI_2132_credit_max_num=0; A8tI_2132_credit_remain_num=0; A8tI_2132_sendmail=1; A8tI_2132_st_t=7589318%7C1527520644%7C1dc26593f0230c7c6b43bde6c98103c9; A8tI_2132_forum_lastvisit=D_180_1526811032D_181_1527427919D_815_1527520227D_798_1527520644; A8tI_2132_visitedfid=798D815D181D307D791D216D11D180D142D27; A8tI_2132_ulastactivity=1527520644%7C0; A8tI_2132_self_uid=7589318; A8tI_2132_self_fid=798; A8tI_2132_st_p=7589318%7C1527520650%7C570a2893a0834543f205c6bc2090a236; A8tI_2132_viewid=tid_9478918; A8tI_2132_self_tid=9478918; A8tI_2132_lastact=1527520651%09misc.php%09seccode; A8tI_2132_seccode=129607798.bd627f2e523f8c47f4; __insp_slim=1527520653270',
    'Host':'www.gawu88.space',
    'Referer':'http://www.gawu88.space/forum-798-1.html',
    'Accept-Encoding':'',
    'User-Agent':'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/64.0.3282.140 Safari/537.36',
}
response = requests.get(url,headers=headers)
html = etree.HTML(response.text)
print(response.text)
hrefs ='http://www.gawu88.space/'+ html.xpath('//span[@style="white-space: nowrap"]/a/@href')[0]
req = requests.get(hrefs,headers=headers)
file_name = "f:/1.torrent"
with open(file_name,"wb") as f:
    f.write(req.content)
    f.close()

3.如果我不加入headers,雖然下載下來(lái)的種子數(shù)據(jù)不再為0,但是下載的種子文件是一個(gè)空文件,里面沒(méi)有下載數(shù)據(jù)。
4.我想知道的是為什么不能夠下載種子文件,有沒(méi)有什么解決方法?還是我的請(qǐng)求頭headers構(gòu)造有問(wèn)題?希望各位朋友能夠幫忙解決一下。謝謝。

回答
編輯回答
扯不斷

為什么不試試萬(wàn)能的wireshark呢?抓個(gè)包,把所有header照抄過(guò)來(lái),再一個(gè)一個(gè)去掉,看看是哪個(gè)header有影響咯。當(dāng)然也有可能是服務(wù)器要求你必須先對(duì)你的referer發(fā)送一次get請(qǐng)求,還有可能是文件的下載和報(bào)錯(cuò)方式不對(duì)。反正抓個(gè)包看看就知道啦

2018年9月15日 19:39