[Java] 14-3 jsoup用DOM方法解析html－給你魚竿

DOM就是Document Object Model

也就是將整個html檔案看成一個tree, 上面有很多節點和內容值

以下就介紹其作法

1. 官網https://jsoup.org/cookbook/extracting-data/dom-navigation

2. 這邊介紹將html的中的每行取出來

在使用的時候, 可以用Elements或Element來裝取內容

用getElementsByTag("Tag名稱")等等之類的方式來取得

String html = "<html><head><title>First parse</title></head>"
 + "<body>"
 + "Line 1."
 + "Line 2."
 + "Line 3."
 + "</body></html>";
 Document doc = Jsoup.parse(html);

 Elements contents = doc.getElementsByTag("p");
 for (Element content : contents) {
 System.out.println(content.text());
 }

3. 下列是相關的methos

找元素

getElementById(String id)
getElementsByTag(String tag)
getElementsByClass(String className)
getElementsByAttribute(String key) (and related methods)
Element siblings: siblingElements(), firstElementSibling(),lastElementSibling(); nextElementSibling(), previousElementSibling()
Graph: parent(), children(), child(int index)
元素內容
- attr(String key) to get and attr(String key, String value) to set attributes
- attributes() to get all attributes
- id(), className() and classNames()
- text() to get and text(String value) to set the text content
- html() to get and html(String value) to set the inner HTML content
- outerHtml() to get the outer HTML value
- data() to get data content (e.g. of script and style tags)
- tag() and tagName()
- 操作html或Text

RX1226

給你魚竿

RX1226 發表在痞客邦留言(0) 人氣()

給你魚竿

凡事起頭難, 就給你難的

公告版位

[Java] 14-3 jsoup用DOM方法解析html

歷史上的今天

留言列表

Google AdSense

文章分類

Android (27)

Arduino (1)

Java (24)

C/C++ (3)

Java EE (2)

HTML (2)

CSS (2)

JavaScript (3)

Bootstrap (1)

PHP (1)

Android Studio (2)

Eclipse (2)

SQLite (1)

Oracle SQL (2)

SourceTree (1)

Bitbucket (1)

GitHub (1)

CentOS (1)

Google Cloud Platform (1)

Parse (1)

Paypal (1)

TortoiseGit (1)

TortoiseSVN (1)

Synology NAS (1)

Unreal 4 (2)

Game (1)

GameMaker (1)

Normal (1)

Blog (1)

Blog生命史 (1)

網站推薦 (1)

軟體推薦 (1)

廣告賺錢 (1)

證照 (2)

電腦組裝 (1)

最新文章

熱門文章

參觀人氣

POWERED BY