1*6da8f8c4SAndroid Build Coastguard Worker<!DOCTYPE html> 2*6da8f8c4SAndroid Build Coastguard Worker<html lang="en"> 3*6da8f8c4SAndroid Build Coastguard Worker<head> 4*6da8f8c4SAndroid Build Coastguard Worker <title>jsoup Javadoc overview</title> 5*6da8f8c4SAndroid Build Coastguard Worker</head> 6*6da8f8c4SAndroid Build Coastguard Worker<body> 7*6da8f8c4SAndroid Build Coastguard Worker<h1>jsoup: Java HTML parser that makes sense of real-world HTML soup.</h1> 8*6da8f8c4SAndroid Build Coastguard Worker 9*6da8f8c4SAndroid Build Coastguard Worker<p><b>jsoup</b> is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs 10*6da8f8c4SAndroid Build Coastguard Worker and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors.</p> 11*6da8f8c4SAndroid Build Coastguard Worker 12*6da8f8c4SAndroid Build Coastguard Worker<p>jsoup implements the <a href="https://html.spec.whatwg.org/multipage/">WHATWG HTML</a> specification, and parses HTML to the same DOM 13*6da8f8c4SAndroid Build Coastguard Worker as modern browsers do.</p> 14*6da8f8c4SAndroid Build Coastguard Worker 15*6da8f8c4SAndroid Build Coastguard Worker<ul> 16*6da8f8c4SAndroid Build Coastguard Worker <li>parse HTML from a URL, file, or string 17*6da8f8c4SAndroid Build Coastguard Worker <li>find and extract data, using DOM traversal or CSS selectors 18*6da8f8c4SAndroid Build Coastguard Worker <li>manipulate the HTML elements, attributes, and text 19*6da8f8c4SAndroid Build Coastguard Worker <li>clean user-submitted content against a safelist, to prevent XSS 20*6da8f8c4SAndroid Build Coastguard Worker <li>output tidy HTML 21*6da8f8c4SAndroid Build Coastguard Worker</ul> 22*6da8f8c4SAndroid Build Coastguard Worker 23*6da8f8c4SAndroid Build Coastguard Worker<p>jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, 24*6da8f8c4SAndroid Build Coastguard Worker to invalid tag-soup; jsoup will create a sensible parse tree.</p> 25*6da8f8c4SAndroid Build Coastguard Worker 26*6da8f8c4SAndroid Build Coastguard Worker<p>See <a href="https://jsoup.org/"><b>jsoup.org</b></a> for downloads, documentation, and examples.</p> 27*6da8f8c4SAndroid Build Coastguard Worker 28*6da8f8c4SAndroid Build Coastguard Worker@author <a href="https://jonathanhedley.com/">Jonathan Hedley</a> 29*6da8f8c4SAndroid Build Coastguard Worker 30*6da8f8c4SAndroid Build Coastguard Worker</body> 31*6da8f8c4SAndroid Build Coastguard Worker</html> 32