Improve XML Scanner performance #444

angelozerr · 2019-06-17T07:50:07Z

In XML Scanner, it seems that consumed CPU time is spent in 3 regex to extract element name, attribute name and attribute value. This issue is to change just thoses regexp into a Java code to improve performance.

After testing that, a large file like nasa.xml is parsed 2-3 times faster

Fix #444 This PR improve XMLScanner performance by replacing regex with java code for the 3 regexp which are the most used (element name, attribute name, attribute value). After testing that, a large file like nasa.xml is parsed 2-3 times faster. You can see this time when you start XMLScannerPerformance and DOMParserPerformance. Signed-off-by: azerr <[email protected]>

angelozerr self-assigned this Jun 17, 2019

angelozerr added the performance This issue or enhancement is related to performance concerns label Jun 17, 2019

angelozerr mentioned this issue Jun 17, 2019

Improve XMLScanner performance #445

Merged

angelozerr added this to the v0.8.0 milestone Jun 17, 2019

angelozerr closed this as completed in #445 Jun 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve XML Scanner performance #444

Improve XML Scanner performance #444

angelozerr commented Jun 17, 2019 •

edited

Loading

Improve XML Scanner performance #444

Improve XML Scanner performance #444

Comments

angelozerr commented Jun 17, 2019 • edited Loading

angelozerr commented Jun 17, 2019 •

edited

Loading