RTidyHTML

RTidyHTML_0.2-1.tar.gz (18 Jul 2013)

This package interfaces to the libtidy library for "correcting" HTML documents that are not well-formed. This library corrects common errors in HTML documents and

This makes better HTML documents, but it also improves the processing of these documents with tools such as htmlParse() (in the XML package) and XPath queries.

At present the package provides a single function - tidyHTML() - that accepts either the name of an HTML file or the contents of an HTML document. It returns the corrected document and can also return a description of the problems it contained.


Duncan Temple Lang <duncan@wald.ucdavis.edu>
Last modified: Tue Oct 13 16:32:57 PDT 2009