mirror of
https://github.com/azahar-emu/soundtouch
synced 2025-11-07 07:30:02 +01:00
Refactored FIRfilter and TDStretch hot-spot routines to help compiler perform more efficient autovectorization. Benchmarked: - 2x/3x improvement in gcc-generated x86 SIMD code execution times for SSE2/AVX instruction extensions accordingly, when hand-tuned SSE intrinsics were disabled. Hand-tuned SSE code still is slightly faster than gcc-produced AVX. - 2.4x improvement for cumulative ARM NEON tunings when compared to previous SoundTouch release. Signed-off-by: Olli Parviainen <oparviai'at'iki.fi>