xref: /aosp_15_r20/external/jsoup/src/main/javadoc/overview.html (revision 6da8f8c4bc310ad659121b84dd089062417a2ce2)
1*6da8f8c4SAndroid Build Coastguard Worker<!DOCTYPE html>
2*6da8f8c4SAndroid Build Coastguard Worker<html lang="en">
3*6da8f8c4SAndroid Build Coastguard Worker<head>
4*6da8f8c4SAndroid Build Coastguard Worker  <title>jsoup Javadoc overview</title>
5*6da8f8c4SAndroid Build Coastguard Worker</head>
6*6da8f8c4SAndroid Build Coastguard Worker<body>
7*6da8f8c4SAndroid Build Coastguard Worker<h1>jsoup: Java HTML parser that makes sense of real-world HTML soup.</h1>
8*6da8f8c4SAndroid Build Coastguard Worker
9*6da8f8c4SAndroid Build Coastguard Worker<p><b>jsoup</b> is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs
10*6da8f8c4SAndroid Build Coastguard Worker  and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors.</p>
11*6da8f8c4SAndroid Build Coastguard Worker
12*6da8f8c4SAndroid Build Coastguard Worker<p>jsoup implements the <a href="https://html.spec.whatwg.org/multipage/">WHATWG HTML</a> specification, and parses HTML to the same DOM
13*6da8f8c4SAndroid Build Coastguard Worker  as modern browsers do.</p>
14*6da8f8c4SAndroid Build Coastguard Worker
15*6da8f8c4SAndroid Build Coastguard Worker<ul>
16*6da8f8c4SAndroid Build Coastguard Worker  <li>parse HTML from a URL, file, or string
17*6da8f8c4SAndroid Build Coastguard Worker  <li>find and extract data, using DOM traversal or CSS selectors
18*6da8f8c4SAndroid Build Coastguard Worker  <li>manipulate the HTML elements, attributes, and text
19*6da8f8c4SAndroid Build Coastguard Worker  <li>clean user-submitted content against a safelist, to prevent XSS
20*6da8f8c4SAndroid Build Coastguard Worker  <li>output tidy HTML
21*6da8f8c4SAndroid Build Coastguard Worker</ul>
22*6da8f8c4SAndroid Build Coastguard Worker
23*6da8f8c4SAndroid Build Coastguard Worker<p>jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating,
24*6da8f8c4SAndroid Build Coastguard Worker  to invalid tag-soup; jsoup will create a sensible parse tree.</p>
25*6da8f8c4SAndroid Build Coastguard Worker
26*6da8f8c4SAndroid Build Coastguard Worker<p>See <a href="https://jsoup.org/"><b>jsoup.org</b></a> for downloads, documentation, and examples.</p>
27*6da8f8c4SAndroid Build Coastguard Worker
28*6da8f8c4SAndroid Build Coastguard Worker@author <a href="https://jonathanhedley.com/">Jonathan Hedley</a>
29*6da8f8c4SAndroid Build Coastguard Worker
30*6da8f8c4SAndroid Build Coastguard Worker</body>
31*6da8f8c4SAndroid Build Coastguard Worker</html>
32