OutputStream::writeText breaks text if asUTF16 is set to true

RolandMR · November 15, 2025, 4:45pm

OutputStream::writeText breaks text if asUTF16 is set to true

        for (;;)
        {
            auto c = src.getAndAdvance();

            if (c == 0)
                break;

            if (! writeShort ((short) c))
                return false;
        }

You can’t just cast a wchar_t to a short and call that utf16. There are utf16 conversion functions in juce that could be used.

reuk · November 19, 2025, 12:44pm

Thanks for reporting, that should be fixed here:

RolandMR · November 19, 2025, 1:46pm

Thanks. String::createStringFromData needs to be updated as well.

            for (int i = 0; i < numChars; ++i)
                builder.write ((juce_wchar) ByteOrder::swapIfBigEndian (src[i]));

Each utf16 pair is getting converted to a wchar, then when encoded to utf8 it’s actually becoming cesu8 instead. Seems that cesu8 ‘works’ on macOS because it print’s to the terminal ok.

My unit test looks like this:

    void testOverwriteWithTextUnicode()
    {
        beginTest ("Overwrite with Text - Unicode");

        auto tempFile = juce::File::getSpecialLocation (juce::File::tempDirectory)
            .getChildFile ("gin_test_" + juce::String::toHexString (juce::Random::getSystemRandom().nextInt()) + ".txt");

        auto testText = juce::String::fromUTF8 ("Hello 世界! Emoji: 🎵");

        expect (overwriteWithText (tempFile, testText, true, true, nullptr),
                "Should write Unicode text successfully");
        expect (tempFile.existsAsFile(), "File should exist");

        juce::String readBack = tempFile.loadFileAsString();
        DBG(testText);
        DBG(readBack);
        expectEquals (readBack, testText, "Unicode text should be preserved");

        tempFile.deleteFile();
    }

In the terminal I see:

Hello 世界! Emoji: 🎵
Hello 世界! Emoji: 🎵

But when I inspect the strings in the debugger I see:

(lldb) p a
(juce::String) $0 = {
  text = (data = "Hello 世界! Emoji: \xed\xa0\xbc\xed\xbe\xb5")
}
(lldb) p b
(juce::String) $1 = {
  text = (data = "Hello 世界! Emoji: 🎵")
}

reuk · November 20, 2025, 1:15pm

Thanks again, that should be fixed here:

Topic		Replies	Views
String::createStringFromData is really slow for unicode General JUCE discussion	2	355	May 12, 2017
JUCE bugs out when reading unicode files General JUCE discussion	4	156	March 5, 2026
Unicode Rendering Bug (Mac & Windows) General JUCE discussion	6	1100	February 23, 2022
Build failed with JUCE_STRING_UTF_TYPE=16 General JUCE discussion	3	584	December 21, 2015
juce_CharPointer_UTF16.h! Windows	12	591	January 28, 2011

OutputStream::writeText breaks text if asUTF16 is set to true

Purchase

Discover

Learn

Support

About

Events

OutputStream::writeText breaks text if asUTF16 is set to true

Related topics

Purchase

Discover

Learn

Support

About

Events