Detecting empty elements with XML::Reader #262

jfieber · 2010-04-22T15:41:38Z

The Reader pull-parser doesn't expose a way to determine if an element is empty when there is no separate end tag (), since no end element is encountered (or synthesized). I presume some call to xmlTextReaderIsEmptyElement is in order.

flavorjones · 2010-04-22T18:06:38Z

Can you please write up a (failing) unit test expressing exactly what you'd like to see? Linking to a gist from this ticket would be perfect.

flavorjones · 2010-05-04T11:08:26Z

Ping: jflieber -- is this still needed?

jfieber · 2010-05-04T13:58:37Z

Yes, I was pulled in another direction for awhile but can get back to this.

jfieber · 2010-05-04T14:44:14Z

I've dug into this and clumsy handling of empty elements appears to be a "feature" of the libxml2 reader. See http://xmlsoft.org/xmlreader.html#Extracting

Exposing IsEmptyElement via nokogiri would be a nice-to-have -- to know that an end element won't be in the pipeline for a given start element -- but empty elements can be identified by the depth of whatever follows the start element.

So, exposing the IsEmptyElement is a wishlist item, but not a bug.

flavorjones · 2010-05-04T15:06:08Z

Can you please write up a (failing) unit test expressing exactly what you'd like to see? Linking to a gist from this ticket would be perfect.

vladzloteanu · 2010-05-05T17:16:59Z

I also believe this would be nice to have. http://stackoverflow.com/questions/2775307/nokogiri-pull-parser-nokogirixmlreader-issue-with-self-closing-tag . Basically, using the Reader#read method, i would like to know if i'm on a <tag> or on a <tag/> node.

flavorjones · 2010-05-06T10:54:12Z

I'm glad everyone agrees this is a nice feature to have. High five!

P.S. Can you please write up a (failing) unit test expressing exactly what you'd like to see? Linking to a gist from this ticket would be perfect.

P.P.S. Srsly, peepul. Failing unit test.

vladzloteanu · 2010-05-06T12:07:13Z

http://gist.github.com/392060

flavorjones · 2010-05-06T13:05:07Z

Awesome! Thank you.

What would everyone think about making the method name be "empty?" instead of "empty_element?" ? The precedent is in Nokogiri::HTML::ElementDescription (as well as String, Array, Hash, etc.). Plus it means less typing. :)

vladzloteanu · 2010-05-06T13:15:43Z

You're right. I'm OK with it. Q: because it's more generic, what will it return for other node types? (false or undefined?). EG </tag> is empty?

flavorjones · 2010-05-06T19:44:27Z

Hum. Maybe we should go for a more semantic method name, like self_closing??

johndouthat · 2010-05-06T21:44:37Z

I second self_closing? .

tenderlove · 2010-06-23T20:03:29Z

adding Reader#empty_element? and Reader#self_closing? closed by 52a2473

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detecting empty elements with XML::Reader #262

Detecting empty elements with XML::Reader #262

jfieber commented Apr 22, 2010

flavorjones commented Apr 22, 2010

flavorjones commented May 4, 2010

jfieber commented May 4, 2010

jfieber commented May 4, 2010

flavorjones commented May 4, 2010

vladzloteanu commented May 5, 2010

flavorjones commented May 6, 2010

vladzloteanu commented May 6, 2010

flavorjones commented May 6, 2010

vladzloteanu commented May 6, 2010

flavorjones commented May 6, 2010

johndouthat commented May 6, 2010

tenderlove commented Jun 23, 2010

Detecting empty elements with XML::Reader #262

Detecting empty elements with XML::Reader #262

Comments

jfieber commented Apr 22, 2010

flavorjones commented Apr 22, 2010

flavorjones commented May 4, 2010

jfieber commented May 4, 2010

jfieber commented May 4, 2010

flavorjones commented May 4, 2010

vladzloteanu commented May 5, 2010

flavorjones commented May 6, 2010

vladzloteanu commented May 6, 2010

flavorjones commented May 6, 2010

vladzloteanu commented May 6, 2010

flavorjones commented May 6, 2010

johndouthat commented May 6, 2010

tenderlove commented Jun 23, 2010