If everyone involved is happy to waive their copyright on this stuff (sure, I know these aren't huge amounts of code, but gotta be careful about that kind of thing..), then I'd be glad to merge it in!
Totally fine for me. In fact it makes me a little proud that my code becomes part of JUCE :)
Perhaps you can alter the colour settings for it to make it consistent within the different tokenizers (I don't know about the Lua tokenizer but I chose the colours quite randomly)
Thanks chaps - I've added this now, though TBH very little (maybe none!) of the original code remains after I'd cleaned it up..
The XML parser was pretty far off the mark and I just rewrote that completely, but part of the problem is that the CodeTokeniser base class has to be used statelessly, and doing XML does involves some statefulness. Anyway, I quickly hacked it into a half-decent version that works statelessly, but don't have the time/inclination to add support for tokeniser state that'd be needed to do a more complete implementation.
This is all cool stuff to have though - thanks for getting it rolling!