apache spark - Parse complex XML file to get object and its fields -
is there way parse complex xml file child element object fields/members accessible in 1 read?
i'm working on code passed me uses dom4j , developer did read each node. object fields , members, read file 4 or 5 times each one. build object have make these calls? takes 15 secs run.
maybe dom4j can more 1 element in 1 read of file?
is jaxb more that, giving reference object , fields in 1 read?
is spark or hadoop better/faster performance wise?
you see i'm looking read xml file , extract element child elements, etc build objects in 1 read.
let me know work best!
since dom4j reads entire file , builds object graph in memory, there shouldn't need "read" file more once.
on other hand, if have schema type of document processing, jaxb better alternative.
Comments
Post a Comment