[openal] [PATCH V2] Add some mixer SSE2/4.1 optimisations

Chris Robinson chris.kcat at gmail.com
Tue Jun 3 12:45:52 EDT 2014

On 06/03/2014 06:28 AM, Timothy Arceri wrote:
> Yes that does seem to work (at least in my test) and also seems to
> perform much better. My SSE2 resample code was taking around 4.45% of
> cpu with this change its down to 2.22%. For reference the C code is at
> 6.23% and SSE4.1 1.5%.

Weird that the SSE4.1 linear resampler is performing that much better 
than the SSE2 version. With the _mm_store_ps/_mm_castsi128_ps trick, the 
code for the two becomes exactly the same.

More information about the openal mailing list