Parser that uses PHP 5's DOM extension (part of the core).
In PHP 5, the DOM XML extension was revamped into DOM and added to the core. It gives us a forgiving HTML parser, which we use to transform the HTML into a DOM, and then into the tokens. It is blazingly fast (for large documents, it performs twenty times faster than HTMLPurifier_Lexer_DirectLex,and is the default choice for PHP 5.
Located in /lib/core/Parsers/htmlpurifier/HTMLPurifier.standalone.php (line 13564)
HTMLPurifier_Lexer | --HTMLPurifier_Lexer_DOMLex
| Class | Description |
|---|---|
| HTMLPurifier_Lexer_PH5P | Experimental HTML5-based parser using Jeroen van der Meer's PH5P library. |
Inherited from HTMLPurifier_Lexer
HTMLPurifier_Lexer::$tracksLineNumbers
HTMLPurifier_Lexer::$_special_entity2str
Callback function that entity-izes ampersands in comments so that
callbackUndoCommentSubst doesn't clobber them
Callback function for undoing escaping of stray angled brackets
in comments
An error handler that mutes all errors
Recursive function that tokenizes a node, putting it into an accumulator.
Converts a DOMNamedNodeMap of DOMAttr objects into an assoc array.
Wraps an HTML fragment in the necessary HTML
Inherited From HTMLPurifier_Lexer
HTMLPurifier_Lexer::__construct()
HTMLPurifier_Lexer::CDATACallback()
HTMLPurifier_Lexer::create()
HTMLPurifier_Lexer::escapeCDATA()
HTMLPurifier_Lexer::escapeCommentedCDATA()
HTMLPurifier_Lexer::extractBody()
HTMLPurifier_Lexer::normalize()
HTMLPurifier_Lexer::parseData()
HTMLPurifier_Lexer::removeIEConditional()
HTMLPurifier_Lexer::tokenizeHTML()
Documentation generated on Sun, 06 Mar 2011 00:24:10 -0500 by phpDocumentor 1.4.3