apache spark - Parse complex XML file to get object and its fields -
is there way parse complex xml file child element object fields/members accessible in 1 read? i'm working on code passed me uses dom4j , developer did read each node. object fields , members, read file 4 or 5 times each one. build object have make these calls? takes 15 secs run. maybe dom4j can more 1 element in 1 read of file? is jaxb more that, giving reference object , fields in 1 read? is spark or hadoop better/faster performance wise? you see i'm looking read xml file , extract element child elements, etc build objects in 1 read. let me know work best! since dom4j reads entire file , builds object graph in memory, there shouldn't need "read" file more once. on other hand, if have schema type of document processing, jaxb better alternative.