Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Upstream changes: https://github.com/sparklemotion/nokogiri/releases/tag/v1.15.3 https://github.com/sparklemotion/nokogiri/releases/tag/v1.15.2 https://github.com/sparklemotion/nokogiri/releases/tag/v1.15.1 https://github.com/sparklemotion/nokogiri/releases/tag/v1.15.0 1.15.3 / 2023-07-05 Fixed * Passing an object that is not a kind of XML::Node as the first parameter to CDATA.new now raises a TypeError. Previously this would result in either a segfault (CRuby) or a Java exception (JRuby). [#2920] * Passing an object that is not a kind of XML::Node as the first parameter to Schema.from_document now raises a TypeError. Previously this would result in either a segfault (CRuby) or a Java exception (JRuby). [#2920] * [CRuby] Passing an object that is not a kind of XML::Node as the second parameter to Text.new now raises a TypeError. Previously this would result in a segfault. [#2920] * [CRuby] Replacing a node's children via methods like Node#inner_html=, # children=, and #replace no longer defensively dups the node's next sibling if it is a Text node. This behavior was originally adopted to work around libxml2's memory management (see #283 and #595) but should not have included operations involving xmlAddChild(). [#2916] * [JRuby] Fixed NPE when serializing an unparented HTML node. [#2559, #2895] (Thanks, @cbasguti!) 1.15.2 / 2023-05-24 Dependencies * [JRuby] Vendored org.nokogiri:nekodtd is updated to v0.1.11.noko2. This is functionally equivalent to v0.1.11.noko1 but restores support for Java 8. Fixed * [JRuby] Java 8 support is restored, fixing a regression present in v1.14.0..v1.14.4 and v1.15.0..v1.15.1. [#2887] 1.15.1 / 2023-05-19 Dependencies * [CRuby] Vendored libxml2 is updated to v2.11.4 from v2.11.3. For details please see https://gitlab.gnome.org/GNOME/libxml2/-/releases/v2.11.4 Fixed * [CRuby] The libxml2 update fixes an encoding regression when push-parsing UTF-8 sequences. [#2882, upstream issue and commit] 1.15.0 / 2023-05-15 Notes Ability to opt into system malloc and free Since 2009, Nokogiri has configured libxml2 to use ruby_xmalloc et al for memory management. This has provided benefits for memory management, but comes with a performance penalty. Users can now opt into using system malloc for libxml2 memory management by setting an environment variable: # "default" here means "libxml2's default" which is system malloc NOKOGIRI_LIBXML_MEMORY_MANAGEMENT=default Benchmarks show that this setting will significantly improve performance, but be aware that the tradeoff may involve poorer memory management including bloated heap sizes and/or OOM conditions. You can read more about this in the decision record at adr/ 2023-04-libxml-memory-management.md. Dependencies * [CRuby] Vendored libxml2 is updated to v2.11.3 from v2.10.4. For details please see: + https://gitlab.gnome.org/GNOME/libxml2/-/releases/v2.11.0 + https://gitlab.gnome.org/GNOME/libxml2/-/releases/v2.11.1 + https://gitlab.gnome.org/GNOME/libxml2/-/releases/v2.11.2 + https://gitlab.gnome.org/GNOME/libxml2/-/releases/v2.11.3 * [CRuby] Vendored libxslt is updated to v1.1.38 from v1.1.37. For details please see: + https://gitlab.gnome.org/GNOME/libxslt/-/releases/v1.1.38 Added * Encoding objects may now be passed to serialization methods like #to_xml, # to_html, #serialize, and #write_to to specify the output encoding. Previously only encoding names (strings) were accepted. [#2774, #2798] (Thanks, @ellaklara!) * [CRuby] Users may opt into using system malloc for libxml2 memory management. For more detail, see note above or adr/ 2023-04-libxml-memory-management.md. Changed * [CRuby] Schema.from_document now makes a defensive copy of the document if it has blank text nodes with Ruby objects instantiated for them. This prevents unsafe behavior in libxml2 from causing a segfault. There is a small performance cost, but we think this has the virtue of being "what the user meant" since modifying the original is surprising behavior for most users. Previously this was addressed in v1.10.9 by raising an exception. Fixed * [CRuby] XSLT.transform now makes a defensive copy of the document if it has blank text nodes with Ruby objects instantiated for them and the template uses xsl:strip-spaces. This prevents unsafe behavior in libxslt from causing a segfault. There is a small performance cost, but we think this has the virtue of being "what the user meant" since modifying the original is surprising behavior for most users. Previously this would allow unsafe memory access and potentially segfault. [#2800] Improved * Nokogiri::XML::Node::SaveOptions#inspect now shows the names of the options set in the bitmask, similar to ParseOptions. [#2767] * #inspect and pretty-printing are improved for AttributeDecl, ElementContent, ElementDecl, and EntityDecl. * [CRuby] The C extension now uses Ruby's TypedData API for managing all the libxml2 structs. Write barriers may improve GC performance in some extreme cases. [#2808] (Thanks, @etiennebarrie and @byroot!) * [CRuby] ObjectSpace.memsize_of reports a pretty good guess of memory usage when called on Nokogiri::XML::Document objects. [#2807] (Thanks, @etiennebarrie and @byroot!) * [CRuby] Users installing the "ruby" platform gem and compiling libxml2 and libxslt from source will now be using a modern config.guess and config.sub that supports new architectures like loongarch64. [#2831] (Thanks, @zhangwenlong8911!) * [CRuby] HTML5 parser: + adjusts the specified attributes, adding xlink:arcrole and removing xml:base [#2841, #2842] + allows <hr> in <select> [whatwg/html#3410, whatwg/html#9124] * [JRuby] Node#first_element_child now returns nil if there are only non-element children. Previously a null pointer exception was raised. [# 2808, #2844] * Documentation for Nokogiri::XSLT now has usage examples including custom function handlers. Deprecated * Passing a Nokogiri::XML::Node as the first parameter to CDATA.new is deprecated and will generate a warning. This parameter should be a kind of Nokogiri::XML::Document. This will become an error in a future version of Nokogiri. * Passing a Nokogiri::XML::Node as the first parameter to Schema.from_document is deprecated and will generate a warning. This parameter should be a kind of Nokogiri::XML::Document. This will become an error in a future version of Nokogiri. * Passing a Nokogiri::XML::Node as the second parameter to Text.new is deprecated and will generate a warning. This parameter should be a kind of Nokogiri::XML::Document. This will become an error in a future version of Nokogiri. * [CRuby] Calling a custom XPath function without the nokogiri namespace is deprecated and will generate a warning. Support for non-namespaced functions will be removed in a future version of Nokogiri. (Note that JRuby has never supported non-namespaced custom XPath functions.)
- Loading branch information