[FR] Vectorized summation

luzifer · September 1, 2020, 7:19am

It would be great to have a cross-platform function to sum vectors. Something along the lines of float FloatVectorOperations::sum(const float *vec, int n) and it’s double equivalent. I’ve run across several uses already in dsp and analysis situations.

CLeJack · September 1, 2020, 6:55pm

This is an honest question, not snark. Why wouldn’t you write this yourself? I’m coming from the world of python, and I’m familiar with loops being slow there, but I thought vectorization was just a loop created by a more performant language like cython (C/C++) which we are already using in JUCE.

xenakios · September 1, 2020, 7:43pm

If you write a regular for-loop yourself, there’s no guarantee the compiler is going to vectorize it automatically. That’s why it would be preferred to have a manually written vectorized version. (It maybe isn’t that hard to do but it would be a nice facility to have directly available as a Juce function.)

HowardAntares · September 1, 2020, 7:55pm

I’m confused here. What do you mean by “vectorize” here? I thought you were just talking about summing the values in a std::vector. What C++ syntax would accomplish such a thing? You mean like a std::for_each() function? Or…?

CLeJack · September 1, 2020, 8:04pm

I see I had a misunderstanding. I just finished doing a bit of reading and yes this is a bit lower level than I’m used to dealing with, and based on efficiency gains created by SIMD.

There’s so many overlaps in terminology between technical fields, it’s easy to think you know something and then you turn out not to have any idea.

xenakios · September 1, 2020, 8:04pm

“Vectorize” as in “use SIMD instructions” in the hopes it will make the execution faster.

cpr2323 · September 1, 2020, 8:07pm

and not that it matters, since it’s not what the OP was talking about, but to sum a vector you would likely want to use std::accumulate…

otristan · September 2, 2020, 6:24am

Something like that. Although I’m don’t handle the juce syntax with their simd macro

float JUCE_CALLTYPE FloatVectorOperations::sum (const float* src, int num) noexcept
{
  float sum = 0
  #if JUCE_USE_VDSP_FRAMEWORK
     vDSP_vse((float*) src, 1, &sum (vDSP_Length) num);
  #else
     assert(0)
  #endif
  return sum;
}

luzifer · September 2, 2020, 9:25am

Indeed SIMD is the more accurate term here. I wrote one myself both for mac and SSE intrinsics. But it seems a frequent enough function that it would make sense to have a canonical implementation right there with the other functions i.e. SIMD multiply, add, etc.
E.g. my implementation is tailored to my special needs and e.g. assumes memory is aligned to 16 byte boundaries. A more generalized implementation would be better. And say we want to port the code to ARM we’d have to remember to add a special implementation there.
Both clang and Visual Studio with optimisation turned all the way up didn’t vectorize the trivial loop automatically and doing it manually yielded a very measurable difference in performance.

Topic		Replies	Views
Request: FloatVectorOperations::sum()? General JUCE discussion	4	1156	October 1, 2025
Sum array of floats Audio Plugins	2	700	March 8, 2021
No performance improvement with FloatVectorOperations General JUCE discussion	42	4856	March 12, 2024
Newbie Question: Where is the vector math lib? General JUCE discussion	2	89	January 14, 2025
FloatVectorOperations receiving pointer to static function General JUCE discussion	0	247	July 14, 2021

[FR] Vectorized summation

Purchase

Discover

Learn

Support

About

Events

[FR] Vectorized summation

Related topics

Purchase

Discover

Learn

Support

About

Events