今天有個需求需要用到selenium和beautifulsoup,需要獲取單個節點的html源碼並轉換爲soup對象
ele = self.find_element_by_id("id")
ele.get_attribute("innerHTML")
innerHTML
輸出當前標籤的文本內容,如果標籤內有子標籤,會連子標籤本身和子標籤內的文本內容一起輸出
<body>
<div id="div1">這是一個標籤div</div>
<span id="span1">這是一個標籤span</span>
<div id="div2">
<span>div2裏面的span1</span>
<span>div2裏面的span2</span>
<img src="" alt="">
</div>
<script>
var div1 = document.getElementById("div1").innerText;
var span1 = document.getElementById("span1").innerText;
var div2 = document.getElementById("div2").innerText;
console.log(div1);
console.log(span1);
console.log(div2);
</script>
</body>
outerHTML
輸出當前標籤的本身和標籤內的文本內容,如果有子標籤,那麼子標籤本身和標籤內的文本內容也將一起輸出
<body>
<div id="div1">這是一個標籤div</div>
<span id="span1">這是一個標籤span</span>
<div id="div2">
<span>div2裏面的span1</span>
<span>div2裏面的span2</span>
<img src="" alt="">
</div>
<script>
var div1 = document.getElementById("div1").outerHTML;
var span1 = document.getElementById("span1").outerHTML;
var div2 = document.getElementById("div2").outerHTML;
console.log(div1);
console.log(span1);
console.log(div2);
</script>
</body>