[問題] 爬蟲問題

看板Python作者xm3fu0 (你爸爸的蛋)時間6年前 (2018/09/29 00:27)推噓1(1推 0噓 8→)

留言9則, 3人參與討論串5/5 (看更多)

狀況是這樣的有一個csv裡面有n個網址這幾個網址的格式類似我目標都是要求出其中的table 那應該要如何寫呢？我自己寫的程式碼如下 import requests from bs4 import BeautifulSoup f = open(r"C:\python\scripts\xxx.csv","r") lines=f.readlines() lens=len(lines) list = [] for index in range(lens): temp = lines[index] res = requests.get(temp) soup = BeautifulSoup(res.text) list.append(soup.select('table')[0]) 我試著把I+=1擺進迴圈發現temp = lines[index]沒辦法執行完畢註：xxx.csv檔案的資料都是網址，只有一個column的資料資料類型都是http:\\...... 麻煩各路高手了(跪 -- ※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 101.15.82.53 ※ 文章網址: https://www.ptt.cc/bbs/Python/M.1538152032.A.3F3.html

→

s860134

09/29 03:11, 6年前 , 1^F

09/29 03:11, 1^F

→

s860134

09/29 03:12, 6年前 , 2^F

09/29 03:12, 2^F

→

s860134

09/29 03:12, 6年前 , 3^F

09/29 03:12, 3^F