Package org.htmlcleaner

Interface Summary
BaseToken Base token interface.
HtmlNode Marker interface denoting nodes of the document tree
ITagInfoProvider Provides set of TagInfo instances.
TagNode.ITagNodeCondition Used as base for different node checkers.
TagNodeVisitor Defines action to be performed on TagNodes
 

Class Summary
BrowserCompactXmlSerializer Broswer compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.
CleanerProperties Properties defining cleaner's behaviour
CleanerTransformations Contains transformation collection.
CommandLine Command line usage class.
CommentNode HTML comment token.
CompactHtmlSerializer Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.
CompactXmlSerializer Compact XML serializer - creates resulting XML by stripping whitespaces.
ConfigFileTagProvider Default tag provider - reads XML file in specified format and creates tag infos
ContentNode HTML text token.
DefaultTagProvider This class is automatically created from ConfigFileTagProvider which reads default XML configuration file with tag descriptions.
DoctypeToken HTML doctype token.
DomSerializer DOM serializer - creates xml DOM.
EndTagToken HTML tag end token.
HtmlCleaner Main HtmlCleaner class.
HtmlCleanerForAnt Support for ANT.
HtmlSerializer Abstract HTML serializer - contains common logic for descendants.
HtmlTokenizer Main HTML tokenizer.
JDomSerializer JDom serializer - creates xml JDom instance out of the TagNode.
PrettyHtmlSerializer Pretty HTML serializer - creates resulting HTML with indenting lines.
PrettyXmlSerializer Pretty XML serializer - creates resulting XML with indenting lines.
Serializer Basic abstract serializer - contains common logic for descendants (methods writeXXX().
SimpleHtmlSerializer Simple HTML serializer - creates resulting HTML without indenting and/or compacting.
SimpleXmlSerializer Simple XML serializer - creates resulting XML without indenting lines.
SpecialEntity This class contains map with special entities used in HTML and their unicodes.
TagInfo Class contains information about single HTML tag.
It also contains rules for tag balancing.
TagNode XML node tag - basic node of the cleaned HTML tree.
TagToken HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
TagTransformation Describes how specified tag is transformed to another one, or is ignored during parsing
Utils Common utilities.
XmlSerializer Abstract XML serializer - contains common logic for descendants.
XPather Utility for searching cleaned document tree with XPath expressions.
 

Exception Summary
HtmlCleanerException General HtmlCleaner runtime exception.
XPatherException Exception that could occure during XPather evaluation.
 



Copyright © 2006-2011. All Rights Reserved.