版本信息:
python 2.7.12
lxml 3.8.0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
|
from lxml import etree html_str = """ <div id="box1">this from blog.csdn.net/lncxydjq , DO NOT COPY! <div id="box2">***** <!--can u get me, bitch?--> </div> </div> """ html = etree.HTML(html_str) print html.xpath( '//div[@id="box1"]/div/node()' )[ 1 ] print type (html.xpath( '//div[@id="box1"]/div/node()' )[ 1 ]) print html.xpath( '//div[@id="box1"]/div/node()' )[ 1 ].text """output: <!--can u get me, bitch?--> <type 'lxml.etree._Comment'> can u get me, bitch? """ |
以上这篇python xpath获取页面注释的方法就是小编分享给大家的全部内容了,希望能给大家一个参考,也希望大家多多支持服务器之家。
原文链接:https://blog.csdn.net/lncxydjq/article/details/77880824