Extending the XMLWorker
Depending on the nature of your XML file, you can either write your own Pipeline implementations, or you can extend the HtmlPipeline by adding your own TagProcessor classes.
Pipeline
HtmlPipeline
TagProcessor
Let's start by extending the HtmlPipeline.