Stack overflow when parsing huge XML file


  • Type: Defect Defect
  • Status: Resolved Resolved
  • Priority: Major Major
  • Resolution: Completed
  • Affects Version/s: None
  • Fix Version/s: None
  • Component/s: None
  • Labels:
  • Environment:
    OS X
  • Patch:
    Code and Test


This is using Ryan Senior's new 0.0.3-SNAPSHOT.

While trying to parse a huge XML file (7.5 GB compressed, a dump of Wikipedia), got a stack overflow error. Some digging turned up this bug:

Modifying to disable the IS_COALESCING property got rid of the error.

The old lazy-xml contrib code worked (although used up tons more memory).

Attached is a patch that adds keyword options to source-seq, parse, and parse-str, allowing the consumer to disable coalescing and sidestep the upstream bug.


Alan Malloy made changes -
Field Original Value New Value
Assignee Alan Malloy [ amalloy ] Ryan Senior [ ryansenior ]
Ryan Senior made changes -
Status Open [ 1 ] In Progress [ 3 ]
Ryan Senior made changes -
Status In Progress [ 3 ] Resolved [ 5 ]
Resolution Completed [ 1 ]


Vote (1)
Watch (1)


  • Created: