如何用scrapy提取不在标签内的文字

 我来答

1个回答

#热议# 网上掀起『练心眼子』风潮，真的能提高情商吗？

折柳成萌

高粉答主

2017-09-30 · 繁杂信息太多，你要学会辨别

知道顶级答主

回答量：4.4万

采纳率：96%

帮助的人：6290万

我也去答题访问个人页

关注

展开全部

代码如下

def parse(self,response):
states = {}
list1 = []
list2 = []

for row in response.xpath("//*[@id='info']/*"):
if row.xpath("span[@class='pl']/text()"):
title = row.xpath("span[@class='pl']/text()").extract()[0].strip()
text = row.xpath("a/text()").extract()[0].strip()
states[title]=text
elif row.xpath("text()"):
list1.append(row.xpath("text()").extract()[0].strip()[:-1])

for row in response.xpath("//*[@id='info']/text()").extract():
if row.strip():
list2.append(row.strip())

for i in range(len(list1)):
states[list1[i]]=list2[i]

for n in states:
print n,states[n]

本回答由提问者推荐

已赞过 已踩过<

评论收起

推荐律师服务：若未解决您的问题，请您详细描述您的问题，通过百度律临进行免费专业咨询

如何用scrapy提取不在标签内的文字

其他类似问题

为你推荐：