HTML/XML escaping/unescaping


#1

hi,

while trying to display some RSS feed I got some escaped characters like & < … but also … &ldquo … etc

In juce, I’ve only found the escapeIllegalXmlChars and readEntity so far.
but they only deal with XML reserved characters, not HTML.

I’ve looked for HTML translation tables but didn’t manage to find a complete solution.

Links and suggestions are welcome.


#2

This Java code has quite a list, not sure if it’s comprehensive:

http://www.koders.com/java/fidE0475B82B2CE9F01137C34F0795138B995F9D198.aspx

…it does have … and “ though.