jsoup 1.9.1 发布,HTML 解析器_html/css_WEB-ITnose
jsoup 1.9.1 发布。
更新日志:
改进:
-
Added support for HTTP and SOCKS request proxies, specifiable per connection. See Connection.proxy(String, int).
-
Added support for sending plain HTTP request bodies in POST and PUT requests, with Connection.requestBody(String).
-
Added support in Jsoup.Connect() for HEAD, OPTIONS, and TRACE.
-
Added support for HTTP 307 Temporary Redirect (replays posts, if applicable).
-
Performance improvements when parsing HTML, particularly on Android Dalvik.
-
Added support for writing HTML into Appendable objects (like OutputStreamWriter), to enable stream serialization. See Node.html(T)
-
Added support for XML namespaces when converting jsoup documents to W3C documents.
-
Added support for UTF-16 and UTF-32 character set detection from byte-order-marks (BOM).
-
Added support for tags with non-ascii (unicode) letters.
-
Added Connection.data(String) to retrieve a data KeyVal by its key. Useful to update form data before submission.
Bug 修复
-
Fixed an issue in the Parent selector where it would not match against the root element it was applied to.
-
Fix an issue where Elements.select(String) would not return every matching element if they had the same content.
-
Added not-null validators to Element.appendText() and Element.prependText()
-
Fixed an issue when moving moving nodes using Element.insert(int, Collection) where the sibling index would be set incorrectly, leading to the original loads being lost.
-
Reverted Node.equals() and Node.hashCode() back to identity (object) comparisons, as deep content inspection had negative performance impacts and hashkey stability problems. Functionality replaced with Node.hasSameValue().
-
In Connection, if the same header key is seen multiple times, combine their values with a comma per the HTTP RFC, instead of keeping just one value. Also fixes an issue where header values could be out of order.
下载地址:
-
Source code (zip)
-
Source code (tar.gz)
jsoup 是一款 Java 的HTML 解析器,可直接解析某个URL地址、HTML文本内容。它提供了一套非常省力的API,可通过DOM,CSS以及类似于 JQuery 的操作方法来取出和操作数据。
jsoup的主要功能如下:
-
从一个URL,文件或字符串中解析HTML;
-
使用DOM或CSS选择器来查找、取出数据;
-
可操作HTML元素、属性、文本;
jsoup是基于MIT协议发布的,可放心使用于商业项目。

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

HTML is suitable for beginners because it is simple and easy to learn and can quickly see results. 1) The learning curve of HTML is smooth and easy to get started. 2) Just master the basic tags to start creating web pages. 3) High flexibility and can be used in combination with CSS and JavaScript. 4) Rich learning resources and modern tools support the learning process.

HTML defines the web structure, CSS is responsible for style and layout, and JavaScript gives dynamic interaction. The three perform their duties in web development and jointly build a colorful website.

WebdevelopmentreliesonHTML,CSS,andJavaScript:1)HTMLstructurescontent,2)CSSstylesit,and3)JavaScriptaddsinteractivity,formingthebasisofmodernwebexperiences.

GiteePages static website deployment failed: 404 error troubleshooting and resolution when using Gitee...

AnexampleofastartingtaginHTMLis,whichbeginsaparagraph.StartingtagsareessentialinHTMLastheyinitiateelements,definetheirtypes,andarecrucialforstructuringwebpagesandconstructingtheDOM.

To achieve the effect of scattering and enlarging the surrounding images after clicking on the image, many web designs need to achieve an interactive effect: click on a certain image to make the surrounding...

The Y-axis position adaptive algorithm for web annotation function This article will explore how to implement annotation functions similar to Word documents, especially how to deal with the interval between annotations...

HTML, CSS and JavaScript are the three pillars of web development. 1. HTML defines the web page structure and uses tags such as, etc. 2. CSS controls the web page style, using selectors and attributes such as color, font-size, etc. 3. JavaScript realizes dynamic effects and interaction, through event monitoring and DOM operations.
