Skip to content

Latest commit

 

History

History
190 lines (154 loc) · 11.7 KB

File metadata and controls

190 lines (154 loc) · 11.7 KB

The latest published Fearless SIMD release is 0.4.0 which was released on 2026-02-13. You can find its changes documented below.

This release has an MSRV of 1.88.

0.4.0 (2026-02-13)

This release has an MSRV of 1.88.

Added

  • All vector types now implement Index and IndexMut. (#112 by @Ralith)
  • 256-bit vector types now use native AVX2 intrinsics on supported platforms. (#115 by @valadaptive)
  • 8-bit integer multiplication is now implemented on x86. (#115 by @valadaptive)
  • New native-width associated types: f64s and mask64s. (#125 by @valadaptive)
  • The bitwise "not" operation on integer vector types. (#130 by @valadaptive)
  • The from_fn method on vector types. (#137 by @valadaptive)
  • The load_interleaved and store_interleaved operations now use native intrinsics on x86, instead of using the fallback implementations. (#140 by @valadaptive)
  • Add support for relaxed_simd operations in WebAssembly. (#143 by @valadaptive)
  • The ceil and round_ties_even operations on floating-point vector types. (Rust's round operation rounds away from zero in the case of ties. Many architectures do not natively implement that behavior, so it's omitted.) (#145 by @valadaptive)
  • A prelude module, which exports all the traits in the library but not the types. (#149 by @valadaptive)
  • The any_true, all_true, any_false, and all_false methods on mask types. (#141 by @valadaptive)
  • Documentation for most traits, vector types, and operations. (#154 by @valadaptive)
  • A "shift left by vector" operation, to go with the existing "shift right by vector". (#155 by @valadaptive)
  • "Precise" float-to-integer conversions, which saturate out-of-bounds results and convert NaN to 0 across all platforms. (#167 by @valadaptive)
  • Add the slide and slide_within_blocks methods for shifting elements within a vector. (#164 by @valadaptive)
  • The Level::is_fallback method, which lets you check if the current SIMD level is the scalar fallback. This works even if Level::Fallback is not compiled in, always returning false in that case. (#168 by @valadaptive)
  • Added store_array methods to store SIMD vectors back to memory explicitly using intrinsics. (#181 by @LaurenzV)

Fixed

  • Improved the performance for load/store operations of vectors. (#185 by @valadaptive)
  • Integer equality comparisons now function properly on x86. Previously, they performed "greater than" comparisons. (#115 by @valadaptive)
  • All float-to-integer and integer-to-float conversions are implemented properly on x86, including the precise versions. (#134 by @valadaptive)
  • The floating-point min_precise and max_precise operations now behave the same way on x86 and WebAssembly as they do on AArch64, returning the non-NaN operand if one operand is NaN and the other is not. Previously, they returned the second operand if either was NaN. (#136 by @valadaptive)

Changed

  • Breaking change: The AVX2 level now requires all features from the x86-64-v3 baseline. (#188 by @Shnatsel)

  • Breaking change: Level::fallback has been removed, replaced with Level::baseline. (#105 by @DJMcNab) This corresponds with a change to avoid compiling in support for the fallback level on compilation targets which don't require it; this is most impactful for binary size on WASM, Apple Silicon Macs or Android. A consequence of this is that the available variants on Level are now dependent on the target features you are compiling with. The fallback level can be restored with the force_support_fallback cargo feature. We don't expect this to be necessary outside of tests.

  • Code generation for select and unzip operations on x86 has been improved. (#115 by @valadaptive)

  • Breaking change: The native-width associated types (f32s, u8s, etc.) for the Avx2 struct have been widened from 128-bit types (like f32x4) to 256-bit types (like f32x8). (#123 by @valadaptive)

  • Breaking change: All the vector types' inherent methods have been removed. Any remaining functionality has been moved to trait methods. (#149 by @valadaptive)

    Some functionality is exposed under different names:

    • Instead of the reinterpret methods, use the bitcast method on the Bytes trait. (e.g. foo.reinterpret_i32() -> foo.bitcast::<i32x4<_>>())
    • Instead of the cvt methods, use the to_int or to_float convenience methods on the SimdFloat and SimdInt traits (e.g. foo.cvt_u32() -> foo.to_int::<u32x4<_>>())

    Some functionality (such as split or combine) is exposed under new traits. You may use the new prelude module to conveniently import all of the traits.

  • Breaking change: The madd and msub methods have been renamed to mul_add and mul_sub, matching Rust's naming conventions. (#158 by @Shnatsel)

  • Breaking change: the val field on SIMD vector types is now private, and vector types are no longer represented as arrays internally. To access a vector type's elements, you can use the Into or Deref traits to obtain an array, or the as_slice/as_mut_slice methods to obtain a slice. (#159 by @valadaptive)

  • Breaking change: the Element type on the SimdBase trait is now an associated type instead of a type parameter. This should make it more pleasant to write code that's generic over different vector types. (#170 by @valadaptive)

  • The WasmSimd128 token type now wraps the new crate::core_arch::wasm32::WasmSimd128 type. This doesn't expose any new functionality as WASM SIMD128 can only be enabled statically, but matches all the other backend tokens. (#176 by @valadaptive)

  • Breaking change: the SimdFrom::simd_from method now takes the SIMD token as the first argument instead of the second. This matches the argument order of the from_slice, splat, and from_fn methods on SimdBase. (#180 by @valadaptive)

Removed

  • Breaking change: The (deprecated) simd_dispatch! macro. (#105 by @DJMcNab)

0.3.0 (2025-10-14)

This release has an MSRV of 1.86.

Added

  • SimdBase::witness to fetch the Simd implementation associated with a generic vector. (#76 by @Ralith)
  • Select is now available on native-width masks. (#77, #83 by @Ralith)
  • Simd::shrv_* preforms a right shift with shift amount specified per-lane. (#79 by @Ralith)
  • The >> operator is implemented for SIMD vectors. (#79 by @Ralith)
  • Assignment operator implementations. (#80 by @Ralith)
  • SimdFrom splatting is available on native-width vectors. (#84 by @Ralith)
  • Left shift by u32. (#86 by @Ralith)
  • Unary negation of signed integers. (#91 by @Ralith)
  • A simpler dispatch macro to replace simd_dispatch. (#96, #99 by @Ralith, @DJMcNab)

Fixed

  • Simd now requires consistent mask types for native-width vectors. (#75 by @Ralith)
  • Simd now requires consistent Bytes types for native-width vectors, enabling Bytes::bitcast in generic code. (#81 by @Ralith)
  • Scalar fallback now uses wrapping integer addition. (#85 by @Ralith)

Changed

  • Breaking: a.madd(b, c) and a.msub(b, c) now correspond to a * b + c and a * b - c for consistency with mul_add in std. (#88 by @Ralith) Previously, madd was a + b * c, and msub was a - b * c. Therefore, if you previously had a.madd(b, c), that's now written as b.madd(c, a). And if you had a.msub(b, c), that's now written b.madd(-c, a).
  • Constructors for static SIMD levels are now const (#93 by @Ralith)

0.2.0 (2025-08-26)

There has been a complete rewrite of Fearless SIMD. For some details of the ideas used, see our blog post Towards fearless SIMD, 7 years later.

The repository has also been moved into the Linebender organisation.

0.1.1 (2018-11-05)

No changelog was kept for this release.

0.1.0 (2018-10-19)

This is the initial release. No changelog was kept for this release.