Repair old HTML content
Some old (e.g. 2004 era) HTML content contains corrupted HTML, where the quotes have been HTML-encoded, for example:
<a href="https://www.indybay.org/">
instead of <a href="https://www.indybay.org/">
When sanitized by the HTML Purifier library, such links are broken (pointing at the page you are on, rather than the linked URL).
We might be able to write a script to repair the markup?