JUCE bugs out when reading unicode files

zs1 · March 2, 2026, 2:20am

If you write to a file (using File.replaceWithText() or such) with asUnicode set to true and then read from the file later, JUCE bugs out.

When reading from the affected file in any manner (File.loadFileAsString, File.readLines, FileInputStream.readEntireStreamAsString, etc.), only the first character is returned.

The only way I have found to get the whole file is a loop using FileInputStream.readNextLine, but this returns the characters one by one instead of lines.

Is there a different way read from files that supports unicode?

(Tested on MacOS 14.6 using JUCE v8.0.12)

reuk · March 3, 2026, 12:09pm

The name of the asUnicode parameter is misleading, it will actually cause the file to be written as UTF-16. If you omit the byte-order mark by setting the third parameter to ‘false’, then loadFileAsString() will have no way of determining the correct text encoding of the file, and will assume UTF-8. I recommend ensuring that the asUnicode and writeHeaderBytes parameters match. This should produce the expected results.

zs1 · March 3, 2026, 5:47pm

I see, thanks! But is there a reason that asUnicode and writeHeaderBytes are separate parameters? I can’t think of a use-case that would need one but not the other.

reuk · March 5, 2026, 12:56pm

In the case of appendText(), the file might already exist, in which case it’s reasonable to want to write extra UTF-16 to the end of the file without including the header bytes (byte-order mark).

I’m not sure about replaceWithText(), but I can imagine situations where a JUCE program must write a UTF-16 file that doesn’t include a byte-order mark, perhaps to interoperate with another program that always assumes a UTF-16 encoding for input files.

zs1 · March 5, 2026, 5:04pm

I see, I didn’t think about program interoperability. It makes sense to keep them separate even on replaceWithText() then. Thanks for all the help!

Topic		Replies	Views
Using unicode in Juce General JUCE discussion	6	1544	September 28, 2009
How to conversion a non UTF-8 text file/String to UTF-8 format? General JUCE discussion	15	3687	March 30, 2017
Cannot read CJK text from file and convert to String correctly on Windows Windows	3	876	January 24, 2017
Unicode conversion General JUCE discussion	1	323	March 1, 2007
JUCE 8 : bug with no-English characters General JUCE discussion	2	178	July 5, 2024

JUCE bugs out when reading unicode files

Purchase

Discover

Learn

Support

About

Events

JUCE bugs out when reading unicode files

Related topics

Purchase

Discover

Learn

Support

About

Events