Skip to content

A simple toolkit to take garbage HTML tagsoup pages and generate well-formed XHTML5 output

License

Notifications You must be signed in to change notification settings

UVicHCMC/rescueTagSoup

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rescueTagSoup

A simple toolkit to take garbage HTML tagsoup pages and generate well-formed XHTML5 output

A common task for us these days is to take a site created and hosted on a CMS system such as WordPress, or generated from a CMS or authoring system, and turn it into valid static XHTML5 suitable for long-term hosting on a plain web server. This toolkit will provide process a scrape of the original site and generate something as close to valid XHTML5 as possible, through a collection of fixes known to be commonly required, reducing the remaining manual fixes to a minimum.

About

A simple toolkit to take garbage HTML tagsoup pages and generate well-formed XHTML5 output

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published