xpath() around 10x slower in JRuby #741

ghostganz · 2012-08-01T15:43:38Z

For some reason the xpath() method is a lot slower in nokogiri-java 1.5.5 than the MRI version.

I have code that processes around 1.5 MB of XML, using a lot of nested xpath loops. It takes only 1.7 seconds in MRI but around 150 seconds in JRuby 1.6.7.2 on Java 7. Running with VisualVM shows that almost all time is spent in XmlXpathContext.tryGetNodeSet().

I think this is a regression, because I don't recall this being so slow with older versions of Nokogiri.

jvshahid · 2012-08-01T16:20:52Z

Can you send an example document and code sample that does xpath queries similar to yours.

brianjriddle · 2012-08-02T07:42:06Z

Here is an example repo that exposes the problem

https://github.com/TV4/nokogiri-xpath-issue-741

The repo includes an example xml file and code that exposes the problem.

ghostganz · 2012-08-02T08:10:36Z

Using the example code above, I found that this is a regression introduced with nokogiri 1.5.0. It runs much faster with nokogiri 1.4.7

jvshahid · 2012-08-02T10:23:51Z

thanks, I'll take a look.

jvshahid · 2012-08-02T13:12:46Z

This isn't really a regression, Nokogiri used FFI prior to 1.5.0. Newer versions of Nokogiri have a pure java implementation which looks to be awfully slower than the C implementation.

@yokolet this looks like a performance issue with Java's JAXP XPath implementation, see this bug report. What do you think ?

yokolet · 2012-08-02T15:15:25Z

hmmm, I agree with Java's XPath implementation is very slow.

I did some research. There might be a couple of ways to improve performance.
For example, http://stackoverflow.com/questions/6340802/java-xpath-apache-jaxp-implementation-performance .

Also, we can cache compiled XPathExpression. Nokogiri's xpath method calls XmlXpathContext.evaluate method, which compiles a given xpath.

The example code repeatedly calls xpath method with the same xpath string. Our implementation compiles the same xpath every time outer each gives a next element. We can avoid this.

jvshahid · 2012-08-02T19:31:33Z

did you mean XmlXpathContext.compile ? if that's the case then i don't
think compile is the bottle neck. most of the time is spent in evaluate
which uses the compiled expression.

yokolet · 2012-08-03T04:44:55Z

I know the real bottle neck is xpath evaluation, which is on line 143 in XmlXpathContext.java. Do you think we can improve xpath evaluation itself?

I think that's very hard. So, I thought of other options we might improve performance.

Slight possibility is Xerces/JAXP options such as om.sun.org.apache.xml.internal.dtm.DTMManage might help performance.

jvshahid · 2012-09-13T17:40:17Z

i looked further into this problem, one of the problems is that we're creating a new XPathContext every time we evaluate the xpath expression which is very expensive. I looked at a couple of libraries and Xalan seems to be the best option compatible with DOM (i.e. doesn't use an internal implementation).

One solution is to use CachedXpathAPI which has the side effect of not exposing setter functions for function and variable resolvers. The other option is to use the internal api (may be subclass CachedXpathAPI). The only drawback is that Nokogiri have to maintain the XPathContext and invalidates it when the document changes, otherwise xpath expressions may return invalid results.

I tested the CachedXpathContext on my machine and ran the script in 23 seconds on JRuby as opposed to 0.7 seconds on MRI. This is better than the current state of the java implementation of Nokogiri which runs the script in 130 seconds on my machine.

what do you guys think ?

evanworley · 2012-10-04T01:08:58Z

I've seen this in other platforms/languages, and providing the ability to pre-compile and use cached xpath expressions can greatly speed up processing time. I'm currently working on a project where we are running 10-20 xpath expressions per document, over 1M documents. The penalty for compiling the xpath expression string every time is sever (20M invocations). I'd propose exposing an interface to allow xpath compilation, and then updating the element.xpath method to take either strings or pre-compiled xpath expressions.

jrochkind · 2013-07-09T17:20:10Z

The much slower performance of jruby nokogiri is a problem for cross-platform use.

Moving from FFI to native java, one would have thought, would make nokogiri work more reliably and higher performance on jruby -- but it's had the opposite effect, making me not want to use nokogiri on jruby due to performance, and making nokogiri even less suitable for a cross-platform solution.

Any hope for any work on this?

flavorjones · 2013-07-09T18:34:16Z

Hi Jonathan,

Are you trolling? I honestly can't tell. The FFI port was deprecated a few
years ago. If leaving that behind made Nokogiri unusable, I would have
expected to hear from you sooner on this topic.

The state of the world is, that there's one feature, admittedly a
commonly-used one, that's slower in native Java than in C, and there's a
proposed solution under review.

-mike

On Tue, Jul 9, 2013 at 1:20 PM, Jonathan Rochkind
[email protected]:

The much slower performance of jruby nokogiri is a problem for
cross-platform use.

Moving from FFI to native java, one would have thought, would make
nokogiri work more reliably and higher performance on jruby -- but it's had
the opposite effect, making me not want to use nokogiri on jruby due to
performance, and making nokogiri even less suitable for a cross-platform
solution.

Any hope for any work on this?

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/741#issuecomment-20690360
.

flavorjones · 2013-07-09T18:47:04Z

In any case, @jvshahid and @yokolet, can we make this a priority for an upcoming release?

ghostganz · 2013-07-10T08:54:46Z

Leaving FFI for native Java did make nokogiri unusable for us, that's why I posted this bug in the first place.

These days we no longer need to parse XML documents, otherwise we'd still be stuck at 1.4.

jvshahid · 2013-07-15T20:11:05Z

I made some major modifications to the JRuby xpath code. The code on the issue-741 branch uses the internal xpath api and caches the XPathContext so we don't have to create the expensive DTMManager every time we evaluate a xpath expression. There are two drawbacks of this approach:

We're using an internal api
We now have a cached XPathContext. That means unless we have to be very careful to clear the cache everytime the dom is modified, xpath expressions might return nodes that don't exist anymore in the tree or fail to find new nodes added to the dom if we didn't clear the cache.

Currently the entire test suite passes. We had good test coverage on most of the operations that modify the dom, but I'm worried I missed a couple of them. @yokolet can you help eyeball the changes.

By the way, the test time went down from 80 seconds to 6 seconds. It's still slower than MRI (which takes less than a second), but I think it's good enough for the first pass.

Cheers,

ryanmt · 2013-11-15T18:42:39Z

For future reference, would just using css instead of xpath be faster? How does that leave jruby vs MRI?

flavorjones · 2013-11-18T12:12:15Z

On Fri, Nov 15, 2013 at 1:42 PM, Ryan Taylor [email protected]:

For future reference, would just using css instead of xpath be faster? How
does that leave jruby vs MRI?

No. Nokogiri compiles CSS queries to XPath before execution, as underlying
XML libraries don't support CSS natively.

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/741#issuecomment-28592918
.

knu · 2014-04-09T09:49:54Z

@jvshahid,

test_slow_jruby_xpath is failing too often recently.
Looks like 10 seconds is not enough for Travis to complete the test.
https://travis-ci.org/sparklemotion/nokogiri/jobs/22595529#L178

Was there any regression or is it just because Travis has become slow in performance?

flavorjones · 2014-04-09T10:53:00Z

I believe this is Travis running on slow machines. The test suite takes
anywhere from 20 seconds to run (good) to 50 seconds (which causes this
test to fail).
On Apr 9, 2014 5:49 AM, "Akinori MUSHA" [email protected] wrote:

@jvshahid https://github.com/jvshahid,

test_slow_jruby_xpath is failing too often recently.
Looks like 10 seconds is not enough for Travis to complete the test.
https://travis-ci.org/sparklemotion/nokogiri/jobs/22595529#L178

Was there any regression or is it just because Travis has become slow in
performance?

Reply to this email directly or view it on GitHubhttps://github.com//issues/741#issuecomment-39946359
.

knu · 2014-04-09T15:56:09Z

OK, let's see if extending the time limit to 20 on Travis CI works... (b094730)

jvshahid · 2014-04-09T15:57:40Z

We should probably disable the test. I'm not really happy with the current way of measuring performance, it's very brittle and will break on slower machines. Is there a better way to do this ?

flavorjones · 2014-04-09T16:20:42Z

@zenspider wrote minitest/benchmark to do something similar to this,
where you can assert {constant,linear,exponential} performance
characteristics of an algorithm.

http://www.zenspider.com/projects/minitest.html#benchmarks

http://docs.seattlerb.org/minitest/Minitest/Benchmark.html

Would that be appropriate?

On Wed, Apr 9, 2014 at 11:57 AM, John Shahid [email protected]:

We should probably disable the test. I'm not really happy with the current
way of measuring performance, it's very brittle and will break on slower
machines. Is there a better way to do this ?

Reply to this email directly or view it on GitHubhttps://github.com//issues/741#issuecomment-39981422
.

jvshahid · 2014-04-09T16:41:03Z

I actually prefer this over timing the operation. The test will have to modified slightly to generate a xml document that's proportional to n. I'll try to get this done over the weekend and go through the backlog of the JRuby stories.

knu · 2014-04-10T03:25:09Z

Sounds great. Obviously, an absolute time limit wouldn't be the way to go.

I tweaked the value specially for Travis CI for the moment, so unless this is a regression we can move forward to the upcoming release.

flavorjones · 2017-01-27T19:57:20Z

@jvshahid this test is failing in the concourse pipelines, because of noisy neighbor problems when all the tests are running at the same time.

Is there a real worry about regression here? If not, maybe we just delete it rather than make it more complicated? I'm sensitive to the amount of work we'd need to put in to make this not-brittle.

jvshahid · 2017-01-27T20:49:59Z

I'm not happy with that test either, but I don't know if there's a better way to test performance regression. I'm okay with deleting it. That code doesn't change frequently to justify having the brittle tests.

flavorjones · 2017-01-27T22:12:04Z

Roger, gonna skip it for all platforms.

flavorjones · 2017-01-27T22:23:20Z

See 2d2893a

jvshahid added a commit that referenced this issue Jul 15, 2013

fix issue #741.

e03e5c7

This was referenced Nov 19, 2013

Make xpath on JRuby faster #1004

Closed

#xpath performance #760

Closed

jvshahid closed this as completed in 2e2b204 Nov 26, 2013

peoplesmeat mentioned this issue Apr 3, 2018

JRuby XPATH Memory Usage #1749

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xpath() around 10x slower in JRuby #741

xpath() around 10x slower in JRuby #741

ghostganz commented Aug 1, 2012

jvshahid commented Aug 1, 2012

brianjriddle commented Aug 2, 2012

ghostganz commented Aug 2, 2012

jvshahid commented Aug 2, 2012

jvshahid commented Aug 2, 2012

yokolet commented Aug 2, 2012

jvshahid commented Aug 2, 2012

yokolet commented Aug 3, 2012

jvshahid commented Sep 13, 2012

evanworley commented Oct 4, 2012

jrochkind commented Jul 9, 2013

flavorjones commented Jul 9, 2013

flavorjones commented Jul 9, 2013

ghostganz commented Jul 10, 2013

jvshahid commented Jul 15, 2013

ryanmt commented Nov 15, 2013

flavorjones commented Nov 18, 2013

knu commented Apr 9, 2014

flavorjones commented Apr 9, 2014

knu commented Apr 9, 2014

jvshahid commented Apr 9, 2014

flavorjones commented Apr 9, 2014

jvshahid commented Apr 9, 2014

knu commented Apr 10, 2014

flavorjones commented Jan 27, 2017

jvshahid commented Jan 27, 2017

flavorjones commented Jan 27, 2017

flavorjones commented Jan 27, 2017

xpath() around 10x slower in JRuby #741

xpath() around 10x slower in JRuby #741

Comments

ghostganz commented Aug 1, 2012

jvshahid commented Aug 1, 2012

brianjriddle commented Aug 2, 2012

ghostganz commented Aug 2, 2012

jvshahid commented Aug 2, 2012

jvshahid commented Aug 2, 2012

yokolet commented Aug 2, 2012

jvshahid commented Aug 2, 2012

yokolet commented Aug 3, 2012

jvshahid commented Sep 13, 2012

evanworley commented Oct 4, 2012

jrochkind commented Jul 9, 2013

flavorjones commented Jul 9, 2013

flavorjones commented Jul 9, 2013

ghostganz commented Jul 10, 2013

jvshahid commented Jul 15, 2013

ryanmt commented Nov 15, 2013

flavorjones commented Nov 18, 2013

knu commented Apr 9, 2014

flavorjones commented Apr 9, 2014

knu commented Apr 9, 2014

jvshahid commented Apr 9, 2014

flavorjones commented Apr 9, 2014

jvshahid commented Apr 9, 2014

knu commented Apr 10, 2014

flavorjones commented Jan 27, 2017

jvshahid commented Jan 27, 2017

flavorjones commented Jan 27, 2017

flavorjones commented Jan 27, 2017