Class AbstractCharStreamFilter

  • All Implemented Interfaces:
    IXMLConfigurable, IDocumentFilter, IOnMatchFilter, IImporterHandler
    Direct Known Subclasses:
    AbstractStringFilter

    public abstract class AbstractCharStreamFilter
    extends AbstractDocumentFilter

    Base class for filters dealing with the body of text documents only. Subclasses can safely be used as either pre-parse or post-parse handlers restricted to text documents only (see AbstractImporterHandler).

    When used as a pre-parse handler, this class uses the detected or previously set content character encoding unless the character encoding was specified using setSourceCharset(String). Since document parsing converts content to UTF-8, UTF-8 is always assumed when used as a post-parse handler.

    XML configuration usage:

    
    sourceCharset="(character encoding)"
      onMatch="[include|exclude]"

    Subclasses inherit the above IXMLConfigurable attribute(s), in addition to <restrictTo>.

    Since:
    2.0.0
    Author:
    Pascal Essiembre