怎么使用beautifulsoup获取指定div标签内容

 我来答

2个回答

#合辑# 机票是越早买越便宜吗？

dayinspring

高粉答主

2016-02-21 · 繁杂信息太多，你要学会辨别

知道大有可为答主

回答量：2.3万

采纳率：92%

帮助的人：3552万

我也去答题访问个人页

关注

展开全部

代码如下：
print(soup.prettify())
# <html>
# <head>
# <title>
# The Dormouse's story
# </title>
# </head>
# <body>
# 
# 
# The Dormouse's story
# 
# 
# 
# Once upon a time there were three little sisters; and their names were
# <a class="sister" href="http://example.com/elsie" id="link1">
# Elsie
# </a>
# ,
# <a class="sister" href="http://example.com/lacie" id="link2">
# Lacie
# </a>
# and
# <a class="sister" href="http://example.com/tillie" id="link2">
# Tillie
# </a>
# ; and they lived at the bottom of a well.
# 
# 
# ...
# 
# </body>
# </html>

Here are some simple ways to navigate that data structure:
soup.title
# <title>The Dormouse's story</title>

soup.title.name
# u'title'

soup.title.string
# u'The Dormouse's story'

soup.title.parent.name
# u'head'

soup.p
# The Dormouse's story

soup.p['class']
# u'title'

soup.a
# <a class="sister" href="http://example.com/elsie" id="link1">Elsie</a>

soup.find_all('a')
# [<a class="sister" href="http://example.com/elsie" id="link1">Elsie</a>,
# <a class="sister" href="http://example.com/lacie" id="link2">Lacie</a>,
# <a class="sister" href="http://example.com/tillie" id="link3">Tillie</a>]

soup.find(id="link3")
# <a class="sister" href="http://example.com/tillie" id="link3">Tillie</a>

已赞过 已踩过<

评论收起

匿名用户
推荐于2016-08-04

展开全部

f = urllib2.urlopen(url)
req = f.read()
  
soup = BeautifulSoup(req)
content = soup.findAll(attrs={"name":"readonlycounter2"})
subId = content[0].string.split(',')[1]
subName = soup.html.body.h1.span.string
  
content = soup.findAll(attrs={"class":"subdes_td"})
subType = content[0].string
subLeg = content[1].string
  
content = soup.findAll(attrs={"colspan":"3"})
subTime = content[2].string
subFile = content[7].div.string

本回答被提问者和网友采纳

已赞过已踩过<

你对这个回答的评价是？
评论收起

推荐律师服务：若未解决您的问题，请您详细描述您的问题，通过百度律临进行免费专业咨询

怎么使用beautifulsoup获取指定div标签内容

其他类似问题

为你推荐：