Time‑stretching with JUCE + signalsmith‑stretch results in pitch drop instead of tempo change

agrex · January 17, 2026, 8:49am

Hello,
I’m trying to implement a time‑stretching feature using JUCE and signalsmith‑stretch, but I haven’t been able to get it working correctly yet.

I’ve extracted the parts of the implementation that seem relevant and included them below.
stretchRatio is a predefined value (smaller = slower, larger = faster).
My expectation is that setting stretchRatio to 2 would give “same pitch, double speed,” and setting it to 0.5 would give “same pitch, half speed.”
However, in both cases the pitch drops while the tempo stays the same.
If anyone has any insights or suggestions, I would greatly appreciate your help.

void ClipAudioSourceModel::prepareToPlay(int samplesPerBlockExpected, double sampleRate) {
    sr = sampleRate;
    if (readerSource) {
        readerSource->prepareToPlay(samplesPerBlockExpected, sampleRate);
    }
    if (resampler) {
        const double ratio = (sourceSampleRate > 0.0 && sr > 0.0) ? (sourceSampleRate / sr) : 1.0;
        resampler->setResamplingRatio(ratio);
        resampler->prepareToPlay(samplesPerBlockExpected, sampleRate);
    }

    if (!stretcher) {
        int numChannels = readerSource && readerSource->getAudioFormatReader()
            ? readerSource->getAudioFormatReader()->numChannels
            : 2;
        stretcher = std::make_unique<signalsmith::stretch::SignalsmithStretch<float>>();
        stretcher->reset();
        stretcher->presetDefault(numChannels, sampleRate);
    }
    if (stretcher) {
        stretcher->setTransposeFactor(1.0);
    }

    int numChannels = readerSource && readerSource->getAudioFormatReader()
        ? readerSource->getAudioFormatReader()->numChannels
        : 2;
    stretchInputBuffer.setSize(numChannels, samplesPerBlockExpected * 4);
    stretchOutputBuffer.setSize(numChannels, samplesPerBlockExpected * 4);
}

void ClipAudioSourceModel::getNextAudioBlock(const juce::AudioSourceChannelInfo& bufferToFill) {
    int outputSamples = bufferToFill.numSamples;
    int inputSamples = static_cast<int>(outputSamples * stretchRatio);
    
    juce::AudioSourceChannelInfo inputInfo;
    inputInfo.buffer = &stretchInputBuffer;
    inputInfo.startSample = 0;
    inputInfo.numSamples = inputSamples;

    resampler->getNextAudioBlock(inputInfo);

    int numChannels = bufferToFill.buffer->getNumChannels();
    float* const* inputChannels = stretchInputBuffer.getArrayOfWritePointers();
    float* const* outputChannels = stretchOutputBuffer.getArrayOfWritePointers();

    stretcher->process(inputChannels, inputSamples, outputChannels, outputSamples);

    for (int ch = 0; ch < numChannels; ++ch) {
        bufferToFill.buffer->copyFrom(ch, bufferToFill.startSample,
                                       stretchOutputBuffer, ch, 0, outputSamples);
    }
}

icebreakeraudio · January 17, 2026, 11:31am

This is a weird one. There’s nothing obviously wrong to me with the use of the SignalsmithStretch class from what I’m seeing here, but I’m also not seeing the whole picture.
The weird part is that you get a drop in pitch whether the ratio is 2 or 0.5, which suggests that something else is off. Are there any other artefacts in the audio?

One thing making me a little nervous is that the number of channels in bufferToFill is not checked against the number of channels in the stretchOutputBuffer, which could cause issues if the stretchOutputBuffer is mono and bufferToFill is stereo. But that could be nothing depending on the greater context of the project.

agrex · January 18, 2026, 1:41am

icebreakeraudio, thank you for your reply.

Since the copy of signalsmith-stretch had originally been added by the AI and I wasn’t sure where it came from, I re-added it via git submodule and checked again, but the issue didn’t improve.
I’m using signalsmith-stretch with tag 1.1.0, and linear is set to 0.3.0.

Are there any other artefacts in the audio?

You mean whether there are any other sources, right.

I’ll paste the entire source code here.

header

#pragma once

#include <JuceHeader.h>

#include "../../ThirdParty/signalsmith-stretch/signalsmith-stretch.h"

class ClipAudioSourceModel : public juce::PositionableAudioSource {
  public:
    ClipAudioSourceModel(std::unique_ptr<juce::AudioFormatReaderSource> readerSrc,
        const juce::String&                                             sourceName,
        double                                                          startTimeSec,
        double                                                          cropStartSec,
        double                                                          cropEndSec,
        juce::AudioFormatManager&                                       formatManager);

    void        prepareToPlay(int samplesPerBlockExpected, double sampleRate) override;
    void        releaseResources() override;
    void        getNextAudioBlock(const juce::AudioSourceChannelInfo& bufferToFill) override;
    void        setNextReadPosition(juce::int64 newPosition) override;
    juce::int64 getNextReadPosition() const override;
    juce::int64 getTotalLength() const override;
    bool        isLooping() const override;

    double              getStartTimeSec() const;
    void                setStartTimeSec(double newStartTime);
    double              getDurationSec() const;
    double              getCropStart() const;
    double              getCropEnd() const;
    void                setCropStart(double newCropStart);
    void                setCropEnd(double newCropEnd);
    const juce::String& getSourceName() const;

    juce::String getClipId() const;
    void         setClipId(const juce::String& id);

    void                  setThumbnailSource(juce::InputSource* source);
    juce::AudioThumbnail& getThumbnail() {
        return thumbnail;
    }
    const juce::AudioThumbnail& getThumbnail() const {
        return thumbnail;
    }

    double getSourceSampleRate() const;

    void setStretchRatio(double ratio);
    double getStretchRatio() const;

  private:
    std::unique_ptr<juce::AudioFormatReaderSource> readerSource;
    juce::String                                   sourceLabel;
    double                                         startTime = 0.0, cropStart = 0.0, cropEnd = 0.0;
    double                                         sr               = 44100.0;
    double                                         sourceSampleRate = 44100.0;
    juce::String                                   clipId;

    std::unique_ptr<juce::ResamplingAudioSource> resampler;
    juce::AudioThumbnailCache                    thumbnailCache;
    juce::AudioThumbnail                         thumbnail;

    double stretchRatio = 1.0;
    std::unique_ptr<signalsmith::stretch::SignalsmithStretch<float>> stretcher;
    juce::AudioBuffer<float> stretchInputBuffer;
    juce::AudioBuffer<float> stretchOutputBuffer;
};

lcapozzi · January 18, 2026, 6:47am

You are using setTransposeFactor

agrex · January 18, 2026, 7:06am

I forgot pasting source.

#include "ClipAudioSourceModel.h"

ClipAudioSourceModel::ClipAudioSourceModel(std::unique_ptr<juce::AudioFormatReaderSource> readerSrc,
    const juce::String&                                                                   sourceName,
    double                                                                                startTimeSec,
    double                                                                                cropStartSec,
    double                                                                                cropEndSec,
    juce::AudioFormatManager&                                                             formatManager)
    : readerSource(std::move(readerSrc)),
      sourceLabel(sourceName),
      startTime(startTimeSec),
      cropStart(cropStartSec),
      cropEnd(cropEndSec),
      thumbnailCache(5),
      thumbnail(512, formatManager, thumbnailCache) {
    if (readerSource && readerSource->getAudioFormatReader() != nullptr) {
        sourceSampleRate = readerSource->getAudioFormatReader()->sampleRate;
    }
    resampler = std::make_unique<juce::ResamplingAudioSource>(readerSource.get(), false, 2);
}

void ClipAudioSourceModel::prepareToPlay(int samplesPerBlockExpected, double sampleRate) {
    sr = sampleRate;
    if (readerSource) {
        readerSource->prepareToPlay(samplesPerBlockExpected, sampleRate);
    }
    if (resampler) {
        const double ratio = (sourceSampleRate > 0.0 && sr > 0.0) ? (sourceSampleRate / sr) : 1.0;
        resampler->setResamplingRatio(ratio);
        resampler->prepareToPlay(samplesPerBlockExpected, sampleRate);
    }

    if (!stretcher) {
        int numChannels = readerSource && readerSource->getAudioFormatReader()
            ? readerSource->getAudioFormatReader()->numChannels
            : 2;
        stretcher = std::make_unique<signalsmith::stretch::SignalsmithStretch<float>>();
        stretcher->reset();
        stretcher->presetDefault(numChannels, sampleRate);
    }
    if (stretcher) {
        stretcher->setTransposeFactor(1.0);
    }

    int numChannels = readerSource && readerSource->getAudioFormatReader()
        ? readerSource->getAudioFormatReader()->numChannels
        : 2;
    stretchInputBuffer.setSize(numChannels, samplesPerBlockExpected * 4);
    stretchOutputBuffer.setSize(numChannels, samplesPerBlockExpected * 4);
}

void ClipAudioSourceModel::releaseResources() {
    if (resampler)
        resampler->releaseResources();
    if (readerSource)
        readerSource->releaseResources();
}

void ClipAudioSourceModel::getNextAudioBlock(const juce::AudioSourceChannelInfo& bufferToFill) {
    if (std::abs(stretchRatio - 1.0) < 0.001) {
        DBG("[DEBUG] ClipAudioSourceModel::getNextAudioBlock No time stretch");
        if (resampler) {
            resampler->getNextAudioBlock(bufferToFill);
            return;
        }
        if (readerSource) {
            readerSource->getNextAudioBlock(bufferToFill);
            return;
        } else {
            bufferToFill.clearActiveBufferRegion();
            return;
        }
    }

    if (!stretcher || !resampler) {
        bufferToFill.clearActiveBufferRegion();
        return;
    }

    int outputSamples = bufferToFill.numSamples;
    int inputSamples = static_cast<int>(outputSamples * stretchRatio);
    
    double timeFactor = static_cast<double>(outputSamples) / static_cast<double>(inputSamples);
    DBG("[DEBUG] ClipAudioSourceModel::getNextAudioBlock - stretchRatio=" << stretchRatio 
        << ", outputSamples=" << outputSamples << ", inputSamples=" << inputSamples
        << ", timeFactor=" << timeFactor
        << " (if >1: slower, if <1: faster)");

    juce::AudioSourceChannelInfo inputInfo;
    inputInfo.buffer = &stretchInputBuffer;
    inputInfo.startSample = 0;
    inputInfo.numSamples = inputSamples;

    resampler->getNextAudioBlock(inputInfo);

    int numChannels = bufferToFill.buffer->getNumChannels();
    float* const* inputChannels = stretchInputBuffer.getArrayOfWritePointers();
    float* const* outputChannels = stretchOutputBuffer.getArrayOfWritePointers();

    stretcher->process(inputChannels, inputSamples, outputChannels, outputSamples);

    for (int ch = 0; ch < numChannels; ++ch) {
        bufferToFill.buffer->copyFrom(ch, bufferToFill.startSample,
                                       stretchOutputBuffer, ch, 0, outputSamples);
    }
}

void ClipAudioSourceModel::setNextReadPosition(juce::int64 newPosition) {
    if (readerSource)
        readerSource->setNextReadPosition(newPosition);
    if (resampler)
        resampler->flushBuffers();
}

juce::int64 ClipAudioSourceModel::getNextReadPosition() const {
    return readerSource ? readerSource->getNextReadPosition() : 0;
}

juce::int64 ClipAudioSourceModel::getTotalLength() const {
    return readerSource ? readerSource->getTotalLength() : 0;
}

bool ClipAudioSourceModel::isLooping() const {
    return false;
}

double ClipAudioSourceModel::getStartTimeSec() const {
    return startTime;
}

void ClipAudioSourceModel::setStartTimeSec(double newStartTime) {
    startTime = newStartTime;
}

double ClipAudioSourceModel::getDurationSec() const {
    return juce::jmax(0.0, cropEnd - cropStart);
}

double ClipAudioSourceModel::getCropStart() const {
    return cropStart;
}

double ClipAudioSourceModel::getCropEnd() const {
    return cropEnd;
}

void ClipAudioSourceModel::setCropStart(double newCropStart) {
    cropStart = newCropStart;
}

void ClipAudioSourceModel::setCropEnd(double newCropEnd) {
    cropEnd = newCropEnd;
}

const juce::String& ClipAudioSourceModel::getSourceName() const {
    return sourceLabel;
}

void ClipAudioSourceModel::setThumbnailSource(juce::InputSource* source) {
    thumbnail.setSource(source);
}

double ClipAudioSourceModel::getSourceSampleRate() const {
    return sourceSampleRate;
}

juce::String ClipAudioSourceModel::getClipId() const {
    return clipId;
}

void ClipAudioSourceModel::setClipId(const juce::String& id) {
    clipId = id;
}

void ClipAudioSourceModel::setStretchRatio(double ratio) {
    if (ratio <= 0.0)
        return;

    double oldRatio = stretchRatio;
    stretchRatio = ratio;
    
    DBG("[DEBUG] ClipAudioSourceModel::setStretchRatio - oldRatio=" << oldRatio << ", newRatio=" << ratio);

    if (stretcher && sr > 0.0) {
        stretcher->setTransposeFactor(1.0);
        DBG("[DEBUG] ClipAudioSourceModel::setStretchRatio - stretcher reset and transposeFactor set to 1.0");
    } else {
        DBG("[DEBUG] ClipAudioSourceModel::setStretchRatio - stretcher not yet initialized, will be configured in prepareToPlay");
    }
}

double ClipAudioSourceModel::getStretchRatio() const {
    return stretchRatio;
}

agrex · January 18, 2026, 7:18am

lcapozzi

I’m passing 1.0 as the argument to setTransposeFactor,
but does that also affect the pitch?

lcapozzi · January 18, 2026, 7:35am

Are you processing samples or real time audio? If you are processing incoming signal, you have to account for the different buffer length. As far as I remember, i had many headaches trying to use stretch on realtime audio, while it works like a charm for samples

agrex · January 18, 2026, 7:51am

I’m processing samples, but I’m still running into problems…

icebreakeraudio · January 19, 2026, 10:59am

I meant audio artifacts, like glitches, distortion, noise, that sort of thing. You say it plays at a lower pitch, but does it play perfectly at a lower pitch?

I can tell you now that putting DBG messages in an audio callback will cause glitches. I also find that any time/pitch effect running in realtime will suffer from glitches if you’re running a debug build, or if you haven’t got the settings right. Try running a release build and see how things go.

Ultimately how you approach this problem will depend heavily on your needs and end goals. When processing samples it might be better to render the time-stretching of the audio file in a background thread, then swap it out when it’s finished - however this has many pitfalls that AI coding assistants tend to miss; it also isn’t great for long audio files, but there are ways to deal with that

agrex · January 19, 2026, 12:18pm

The audio isn’t just playing at a lower pitch. It sounds slightly distorted, and it also feels like the pitch isn’t quite correct.
As you pointed out, I’m running it in a debug build. However, what concerns me is that when time‑stretching is disabled — meaning when the stretchRatio is 1.0 — I don’t hear any noticeable artifacts, despite it being a debug build.
I also feel that the fact that the tempo doesn’t change is something that can’t really be explained.
I’ll try checking it in a release build for now.

xenakios · January 19, 2026, 5:23pm

I got your code working in an offline processing context. I did have to change this in your clip source’s prepareToPlay, though :


stretcher = std::make_unique<signalsmith::stretch::SignalsmithStretch>();

stretcher->presetDefault(numChannels, sampleRate);

stretcher->reset();

That is, I changed the order of reset() and presetDefault. Without that, the code crashes right away for me and doesn’t even get to the audio processing.

The offline processing code I wrote for reference :

gist.github.com

https://gist.github.com/Xenakios/c9aebbb170c7a651d8ce16d553028639

clipsourcetest.cxx

inline void test_clipsource()
{
    juce::AudioFormatManager mana;
    mana.registerBasicFormats();
    auto reader =
        mana.createReaderFor(juce::File(R"(C:\MusicAudio\sourcesamples\ukulele\uku01.wav)"));
    double outsr = 96000.0;
    auto readersrc = std::make_unique<juce::AudioFormatReaderSource>(reader, true);
    auto src =
        new ClipAudioSourceModel(std::move(readersrc), juce::String("foo"), 0.0, 0.0, 1.0, mana);

This file has been truncated. show original

agrex · January 19, 2026, 7:59pm

icebreakeraudio, xenakios
Thank you very much for your replies.

I tried both of the suggestions you gave me — doing a release build, and changing the order of reset() and presetDefault() — each one separately.
Unfortunately, neither of them had any effect.
The behavior looks the same as before the fix.
The release build did make the audio slightly clearer, which is nice, but the major issues remain unchanged: the pitch shifts and the tempo does not.

Next, I plan to look at the code xenakios posted and compare it with mine to find elements that exist in his code but not in mine.

agrex · January 20, 2026, 11:26pm

UPDATE:

The other day, lcapozzi asked me whether I was processing samples or real‑time audio. I answered that I was processing samples, but it turns out that wasn’t correct. It seems I was actually processing real‑time audio.
Real‑time audio processing with signalsmith‑stretch looked difficult, just as lcapozzi mentioned, so I switched to bungee, which claims to support real‑time processing as well. However, the results sounded pretty much the same. Even when I tried to apply time‑stretching, the tempo didn’t change and the pitch dropped, making it sound out of tune.
It looks like there’s still quite a bit more investigation to do.

lcapozzi · January 21, 2026, 1:42pm

Be aware that the output buffer won’t have the same size of the input. Maybe that’s why you are getting artifacts

xipix · January 24, 2026, 12:00pm

Bungee developer here: if you can share your code and/or audio samples I may be able to help.

Topic		Replies	Views
Using SignalsmithStretch inside Juce Development	4	305	March 10, 2025
Audio mutes when using multiple signalsmith-stretch pitch shifters Audio Plugins	1	89	January 27, 2026
FR : Native and affordable Juce time-stretching Feature Requests	6	249	June 15, 2025
Real-time TimeStretching Tracktion Engine	7	1446	January 12, 2024
A lightweight Akai-style time-stretch algorithm (realtime!) General JUCE discussion	7	1448	August 14, 2024

Time‑stretching with JUCE + signalsmith‑stretch results in pitch drop instead of tempo change

Purchase

Discover

Learn

Support

About

Events

Time‑stretching with JUCE + signalsmith‑stretch results in pitch drop instead of tempo change

Related topics

Purchase

Discover

Learn

Support

About

Events