Devsphere.com


com.devsphere.xml.saxdomix
Class SDXBuilder

java.lang.Object
  |
  +--org.xml.sax.helpers.DefaultHandler
        |
        +--com.devsphere.xml.saxdomix.SDXBuilder
All Implemented Interfaces:
org.xml.sax.ContentHandler, org.xml.sax.DTDHandler, org.xml.sax.EntityResolver, org.xml.sax.ErrorHandler, org.xml.sax.ext.LexicalHandler
Direct Known Subclasses:
BuilderSupport

public class SDXBuilder
extends org.xml.sax.helpers.DefaultHandler
implements org.xml.sax.ext.LexicalHandler

This class provides the mixed SAX 2.0 - DOM Level 2 parsing functionality. It needs JAXP 1.1 support. TrAX support isn't necessary.

Each builder object needs from the application a SAX content handler (org.xml.sax.ContentHandler) and a controller (com.devsphere.xml.saxdomix.SDXController). Any builder also has an internal wantDOM flag.

The application receives the SAX events via the content handler as long as the wantDOM flag is false. During the SAX parsing, the builder object invokes the controller's wantDOM() method for each startElement() parsing event. If wantDOM() returns false the SAX parsing mode continues. When wantDOM() returns true the value of the internal wantDOM flag is switched to true and the builder starts creating a DOM sub-tree from the SAX events, which aren't forwarded to the application anymore.

The building of the sub-tree ends after the endElement() event that corresponds to the startElement() event whose associated wantDOM() call returned true. The DOM sub-tree, whose root is an org.w3c.dom.Element object is passed to the controller's handleDOM() method. After this, the wantDOM flag is switched automatically to false re-enabling the SAX parsing. The application will receive SAX events until the controller's wantDOM() returns true for another element or the end of the parsed document is reached.

A DOM sub-tree is made of five types of DOM objects:

These DOM objects are created with a helper object. In most cases you'll let the framework to use its own DefaultSDXHelper, but you may provide your own implementation of the SDXHelper interface, which is declared in the com.devsphere.xml.saxdomix.helpers package.

The entity references are expanded.

If the class of the content handler object implements org.xml.sax.ext.LexicalHandler, the application will also receive comment(), startCDATA() and endCDATA() SAX lexical events during the SAX parsing mode.

In conclusion, the builder is initially in SAX parsing mode, which means that SAX events are forwarded to a content handler provided by the application. During the SAX parsing, the wantDOM() method of a controller object is called for each element start. When wantDOM() returns true, the builder retains the SAX events and uses them to build a DOM sub-tree, which is then passed for handling to the handleDOM() method of the controller object. During the DOM building, the wantDOM() method is NOT called. The builder switches to the SAX parsing mode automatically when the DOM sub-tree is completed. This process is repeated until the end of the parsed document, which means that multiple DOM sub-trees can be constructed during the parsing of a single XML document. All SAX events that aren't used to build the sub-trees are passed to the application's content handler.

SDXBuilder extends the DefaultHandler SAX helper class but overrides only the methods of the ContentHandler SAX interface. It also implements the LexicalHandler SAX extension. A SAX parser is needed in order to feed the builder with SAX events. Some of these events will be forwarded to the application's content handler and the others will be used to build the DOM sub-trees as described above.

There are two ways to use SDXBuilder: you may pass it to the parse() method of a JAXP SAXParser or you may register it as content handler to a SAX XMLReader and then call its parse() method. In the second case you could create a JAXP SAXParser and then obtain the SAX XMLReader using the getXMLReader() method of the SAXParser object.

Usage 1:

    org.xml.sax.ContentHandler handler = ...
    SDXController controller = ...
    org.xml.sax.helpers.DefaultHandler builder
        = new SDXBuilder(handler, controller);
    javax.xml.parsers.SAXParser saxParser = ...
    saxParser.getXMLReader().setProperty(
        "http://xml.org/sax/properties/lexical-handler", builder);
    java.io.InputStream in = ...
    saxParser.parse(in, builder);

If the default error handling or entity resolving mechanisms don't suit the needs of your application or you need information about unparsed entities and notations, then you should subclass SDXBuilder and override the methods defined by the ErrorHandler, EntityResolver and DTDHandler SAX interfaces.

Usage 2:

    org.xml.sax.ContentHandler handler = ...
    SDXController controller = ...
    org.xml.sax.ContentHandler builder
        = new SDXBuilder(handler, controller);
    org.xml.sax.XMLReader xmlReader = ...
    xmlReader.setContentHandler(builder);
    xmlReader.setProperty(
        "http://xml.org/sax/properties/lexical-handler", builder);
    xmlReader.setErrorHandler(...);
    org.xml.sax.InputSource in = ...
    xmlReader.parse(in);

In this case, you should always register an error handler. (If you don't have one then register a DefaultHandler as error handler.)

If the default entity resolving mechanism doesn't suit the needs of your application and you need information about unparsed entities and notations, then register an entity resolver and a DTD handler to the XML reader.

See Also:
SDXController, SDXHelper, DefaultSDXHelper

Field Summary
protected  SDXController controller
          The controller provided by the application
protected  org.xml.sax.ContentHandler handler
          The SAX content handler provided by the application
protected  SDXHelper helper
          The helper object used as factory for DOM nodes
protected  boolean inCDATA
          A flag indicating that we are inside a CDATA section
protected  org.xml.sax.ext.LexicalHandler lexicalHandler
          A cast to the content handler or null if its class doesn't implement LexicalHandler
protected  boolean newCDATA
          A flag indicating that we just entered inside a CDATA section
protected  ElementStack stack
          A stack of DOM Element nodes
protected  boolean wantDOM
          The flag that indicates the current parsing mode (false means SAX; true means DOM)
 
Constructor Summary
SDXBuilder(org.xml.sax.ContentHandler handler, SDXController controller)
          Creates a builder that forwards the SAX events to the given handler, switches to DOM when the wantDOM() method of the given controller returns true and also passes the DOM sub-trees to the controller's handleDOM() method.
SDXBuilder(org.xml.sax.ContentHandler handler, SDXController controller, SDXHelper helper)
          Creates a builder that forwards the SAX events to the given handler, switches to DOM when the wantDOM() method of the given controller returns true and also passes the DOM sub-trees to the controller's handleDOM() method.
 
Method Summary
 void characters(char[] ch, int start, int length)
          Calls the method with the same name of the handler object or creates a DOM character data node depending on the value of the wantDOM flag.
 void comment(char[] ch, int start, int length)
          Calls the method with the same name of the handler object or creates a DOM Comment node depending on the value of the wantDOM flag.
 void endCDATA()
          Calls the method with the same name of the handler object if the value of the wantDOM flag if false.
 void endDocument()
          Calls the method with the same name of the handler object.
 void endDTD()
          Does nothing.
 void endElement(java.lang.String namespaceURI, java.lang.String localName, java.lang.String qualifiedName)
          Calls the method with the same name of the handler object or passes the DOM sub-trees to the controller's handleDOM() method depending on the value of the wantDOM flag.
 void endEntity(java.lang.String name)
          Does nothing.
 void processingInstruction(java.lang.String target, java.lang.String data)
          Calls the method with the same name of the handler object or creates a DOM ProcessingInstruction node depending on the value of the wantDOM flag.
 void setDocumentLocator(org.xml.sax.Locator locator)
          Calls the method with the same name of the handler object passing the given locator as parameter.
 void startCDATA()
          Calls the method with the same name of the handler object if the value of the wantDOM flag if false.
 void startDocument()
          Calls the method with the same name of the handler object and initializes some of the fields of this object.
 void startDTD(java.lang.String name, java.lang.String publicId, java.lang.String systemId)
          Does nothing.
 void startElement(java.lang.String namespaceURI, java.lang.String localName, java.lang.String qualifiedName, org.xml.sax.Attributes attributes)
          Calls the method with the same name of the handler object or creates a DOM Element node depending on the value of the wantDOM flag.
 void startEntity(java.lang.String name)
          Does nothing.
 
Methods inherited from class org.xml.sax.helpers.DefaultHandler
endPrefixMapping, error, fatalError, ignorableWhitespace, notationDecl, resolveEntity, skippedEntity, startPrefixMapping, unparsedEntityDecl, warning
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

handler

protected org.xml.sax.ContentHandler handler
The SAX content handler provided by the application


controller

protected SDXController controller
The controller provided by the application


lexicalHandler

protected org.xml.sax.ext.LexicalHandler lexicalHandler
A cast to the content handler or null if its class doesn't implement LexicalHandler


helper

protected SDXHelper helper
The helper object used as factory for DOM nodes


wantDOM

protected boolean wantDOM
The flag that indicates the current parsing mode (false means SAX; true means DOM)


stack

protected ElementStack stack
A stack of DOM Element nodes


newCDATA

protected boolean newCDATA
A flag indicating that we just entered inside a CDATA section


inCDATA

protected boolean inCDATA
A flag indicating that we are inside a CDATA section

Constructor Detail

SDXBuilder

public SDXBuilder(org.xml.sax.ContentHandler handler,
                  SDXController controller)
Creates a builder that forwards the SAX events to the given handler, switches to DOM when the wantDOM() method of the given controller returns true and also passes the DOM sub-trees to the controller's handleDOM() method. The created builder will use a DefaultSDXHelper to create the DOM nodes.

Parameters:
handler - The application's SAX ContentHandler
controller - The application's SDXController
Throws:
java.lang.NullPointerException - If one of the parameters is null

SDXBuilder

public SDXBuilder(org.xml.sax.ContentHandler handler,
                  SDXController controller,
                  SDXHelper helper)
Creates a builder that forwards the SAX events to the given handler, switches to DOM when the wantDOM() method of the given controller returns true and also passes the DOM sub-trees to the controller's handleDOM() method. The created builder will use the given helper to create the DOM nodes.

Parameters:
handler - The application's SAX ContentHandler
controller - The application's SDXController
helper - The application's optional SDXHelper
Throws:
java.lang.NullPointerException - If one of the parameters is null
Method Detail

setDocumentLocator

public void setDocumentLocator(org.xml.sax.Locator locator)
Calls the method with the same name of the handler object passing the given locator as parameter.

Specified by:
setDocumentLocator in interface org.xml.sax.ContentHandler
Overrides:
setDocumentLocator in class org.xml.sax.helpers.DefaultHandler
Parameters:
locator - The SAX Locator object

startDocument

public void startDocument()
                   throws org.xml.sax.SAXException
Calls the method with the same name of the handler object and initializes some of the fields of this object.

Specified by:
startDocument in interface org.xml.sax.ContentHandler
Overrides:
startDocument in class org.xml.sax.helpers.DefaultHandler
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

endDocument

public void endDocument()
                 throws org.xml.sax.SAXException
Calls the method with the same name of the handler object.

Specified by:
endDocument in interface org.xml.sax.ContentHandler
Overrides:
endDocument in class org.xml.sax.helpers.DefaultHandler
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

startElement

public void startElement(java.lang.String namespaceURI,
                         java.lang.String localName,
                         java.lang.String qualifiedName,
                         org.xml.sax.Attributes attributes)
                  throws org.xml.sax.SAXException
Calls the method with the same name of the handler object or creates a DOM Element node depending on the value of the wantDOM flag.

Specified by:
startElement in interface org.xml.sax.ContentHandler
Overrides:
startElement in class org.xml.sax.helpers.DefaultHandler
Parameters:
namespaceURI - The element's namespace URI
localName - The element's local name
qualifiedName - The element's qualified name
attributes - The element's attributes
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

endElement

public void endElement(java.lang.String namespaceURI,
                       java.lang.String localName,
                       java.lang.String qualifiedName)
                throws org.xml.sax.SAXException
Calls the method with the same name of the handler object or passes the DOM sub-trees to the controller's handleDOM() method depending on the value of the wantDOM flag.

Specified by:
endElement in interface org.xml.sax.ContentHandler
Overrides:
endElement in class org.xml.sax.helpers.DefaultHandler
Parameters:
namespaceURI - The element's namespace URI
localName - The element's local name
qualifiedName - The element's qualified name
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

characters

public void characters(char[] ch,
                       int start,
                       int length)
                throws org.xml.sax.SAXException
Calls the method with the same name of the handler object or creates a DOM character data node depending on the value of the wantDOM flag.

Specified by:
characters in interface org.xml.sax.ContentHandler
Overrides:
characters in class org.xml.sax.helpers.DefaultHandler
Parameters:
ch - The characters from the XML document
start - The start position in the array
length - The number of characters to get from the array
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

processingInstruction

public void processingInstruction(java.lang.String target,
                                  java.lang.String data)
                           throws org.xml.sax.SAXException
Calls the method with the same name of the handler object or creates a DOM ProcessingInstruction node depending on the value of the wantDOM flag.

Specified by:
processingInstruction in interface org.xml.sax.ContentHandler
Overrides:
processingInstruction in class org.xml.sax.helpers.DefaultHandler
Parameters:
target - The target of the processing instruction.
data - The data of the processing instruction.
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

startDTD

public void startDTD(java.lang.String name,
                     java.lang.String publicId,
                     java.lang.String systemId)
              throws org.xml.sax.SAXException
Does nothing.

Specified by:
startDTD in interface org.xml.sax.ext.LexicalHandler
Parameters:
name - The name of the document's root element.
publicId - The public identifier of the external DTD
systemId - The system identifier of the external DTD
Throws:
org.xml.sax.SAXException - Cannot be thrown since the method does nothing

endDTD

public void endDTD()
            throws org.xml.sax.SAXException
Does nothing.

Specified by:
endDTD in interface org.xml.sax.ext.LexicalHandler
Throws:
org.xml.sax.SAXException - Cannot be thrown since the method does nothing

startEntity

public void startEntity(java.lang.String name)
                 throws org.xml.sax.SAXException
Does nothing.

Specified by:
startEntity in interface org.xml.sax.ext.LexicalHandler
Parameters:
name - The name of an entity
Throws:
org.xml.sax.SAXException - Cannot be thrown since the method does nothing

endEntity

public void endEntity(java.lang.String name)
               throws org.xml.sax.SAXException
Does nothing.

Specified by:
endEntity in interface org.xml.sax.ext.LexicalHandler
Parameters:
name - The name of an entity
Throws:
org.xml.sax.SAXException - Cannot be thrown since the method does nothing

startCDATA

public void startCDATA()
                throws org.xml.sax.SAXException
Calls the method with the same name of the handler object if the value of the wantDOM flag if false. The inCDATA and newCDATA flags are set to true.

Specified by:
startCDATA in interface org.xml.sax.ext.LexicalHandler
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

endCDATA

public void endCDATA()
              throws org.xml.sax.SAXException
Calls the method with the same name of the handler object if the value of the wantDOM flag if false. The inCDATA flag is set to false.

Specified by:
endCDATA in interface org.xml.sax.ext.LexicalHandler
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

comment

public void comment(char[] ch,
                    int start,
                    int length)
             throws org.xml.sax.SAXException
Calls the method with the same name of the handler object or creates a DOM Comment node depending on the value of the wantDOM flag.

Specified by:
comment in interface org.xml.sax.ext.LexicalHandler
Parameters:
ch - An array holding the characters in the comment
start - The start position in the array
length - The number of characters to get from the array
Throws:
org.xml.sax.SAXException - Error reported by the handler's method

 Devsphere.com