Skip to content

Commit

Permalink
parser pull: Add support for reusing parser (#220)
Browse files Browse the repository at this point in the history
GitHub: Fix GH-214 

This is for parsing XML documents stream. We can use one parser to parse
multiple XML documents with this feature.

Co-authored-by: Dmitry Pogrebnoy <[email protected]>
  • Loading branch information
DmitryPogrebnoy and Dmitry Pogrebnoy authored Nov 8, 2024
1 parent ed9168e commit 20562ec
Show file tree
Hide file tree
Showing 3 changed files with 41 additions and 0 deletions.
4 changes: 4 additions & 0 deletions lib/rexml/parsers/baseparser.rb
Original file line number Diff line number Diff line change
Expand Up @@ -181,6 +181,10 @@ def add_listener( listener )

def stream=( source )
@source = SourceFactory.create_from( source )
reset
end

def reset
@closed = nil
@have_root = false
@document_status = nil
Expand Down
4 changes: 4 additions & 0 deletions lib/rexml/parsers/pullparser.rb
Original file line number Diff line number Diff line change
Expand Up @@ -93,6 +93,10 @@ def pull
def unshift token
@my_stack.unshift token
end

def reset
@parser.reset
end
end

# A parsing event. The contents of the event are accessed as an +Array?,
Expand Down
33 changes: 33 additions & 0 deletions test/test_pullparser.rb
Original file line number Diff line number Diff line change
Expand Up @@ -156,6 +156,39 @@ def test_peek
assert_equal( 0, names.length )
end

def test_reset
xml_chunks = [
"<message>First valid and complete message</message>",
"<message>Second valid and complete message</message>",
"<message>Third valid and complete message</message>"
]

messages = []

IO.pipe do |reader, writer|
xml_chunks.each do |chunk|
writer.write(chunk)
end
writer.close

parser = REXML::Parsers::PullParser.new(reader)
while parser.has_next?
parser.pull
message_text = parser.pull
messages << message_text[0]
parser.pull
parser.reset
end
end

assert_equal(
messages,
["First valid and complete message",
"Second valid and complete message",
"Third valid and complete message"]
)
end

class EntityExpansionLimitTest < Test::Unit::TestCase
class GeneralEntityTest < self
def test_have_value
Expand Down

0 comments on commit 20562ec

Please sign in to comment.