mirror of
https://github.com/Ed94/Odin.git
synced 2026-06-17 03:12:22 -07:00
f61dc7d071
This makes a tremendous (2x with SSE2, 3x with AVX2) difference on big datasets on my system, but this may be hardware-dependent (e.g. instruction cache sizes). Naturally, this also results in somewhat larger code for the large-data case (~75% larger).