mirror of https://github.com/rustfs/rustfs.git synced 2026-01-17 01:30:33 +00:00

Files

weisd 4559baaeeb feat: migrate to reed-solomon-simd only implementation

- Remove reed-solomon-erasure dependency and all related code
- Simplify ReedSolomonEncoder from enum to struct with SIMD-only implementation
- Eliminate all conditional compilation (#[cfg(feature = ...)])
- Add instance caching with RwLock-based encoder/decoder reuse
- Implement reset mechanism to avoid unnecessary allocations
- Ensure thread safety with proper cache management
- Update documentation and benchmark scripts for SIMD-only approach
- Apply code formatting across all files

Breaking Changes:
- Removes support for reed-solomon-erasure feature flag
- API remains compatible but implementation is now SIMD-only

Performance Impact:
- Improved encoding/decoding performance through SIMD optimization
- Reduced memory allocations via instance caching
- Enhanced thread safety and concurrency support

2025-06-23 10:00:17 +08:00

3.3 KiB

Raw Blame History

ECStore - Erasure Coding Storage

ECStore provides erasure coding functionality for the RustFS project, using high-performance Reed-Solomon SIMD implementation for optimal performance.

Reed-Solomon Implementation

SIMD Backend (Only)

Performance: Uses SIMD optimization for high-performance encoding/decoding
Compatibility: Works with any shard size through SIMD implementation
Reliability: High-performance SIMD implementation for large data processing
Use case: Optimized for maximum performance in large data processing scenarios

Usage Example

use ecstore::erasure_coding::Erasure;

// Create erasure coding instance
// 4 data shards, 2 parity shards, 1KB block size
let erasure = Erasure::new(4, 2, 1024);

// Encode data
let data = b"hello world from rustfs erasure coding";
let shards = erasure.encode_data(data)?;

// Simulate loss of one shard
let mut shards_opt: Vec<Option<Vec<u8>>> = shards
    .iter()
    .map(|b| Some(b.to_vec()))
    .collect();
shards_opt[2] = None; // Lose shard 2

// Reconstruct missing data
erasure.decode_data(&mut shards_opt)?;

// Recover original data
let mut recovered = Vec::new();
for shard in shards_opt.iter().take(4) { // Only data shards
    recovered.extend_from_slice(shard.as_ref().unwrap());
}
recovered.truncate(data.len());
assert_eq!(&recovered, data);

Performance Considerations

SIMD Implementation Benefits

High Throughput: Optimized for large block sizes (>= 1KB recommended)
CPU Optimization: Leverages modern CPU SIMD instructions
Scalability: Excellent performance for high-throughput scenarios

Implementation Details

`reed-solomon-simd`

Instance Caching: Encoder/decoder instances are cached and reused for optimal performance
Thread Safety: Thread-safe with RwLock-based caching
SIMD Optimization: Leverages CPU SIMD instructions for maximum performance
Reset Capability: Cached instances are reset for different parameters, avoiding unnecessary allocations

Performance Tips

Batch Operations: When possible, batch multiple small operations into larger blocks
Block Size Optimization: Use block sizes that are multiples of 64 bytes for optimal SIMD performance
Memory Allocation: Pre-allocate buffers when processing multiple blocks
Cache Warming: Initial operations may be slower due to cache setup, subsequent operations benefit from caching

Cross-Platform Compatibility

The SIMD implementation supports:

x86_64 with advanced SIMD instructions (AVX2, SSE)
aarch64 (ARM64) with NEON SIMD optimizations
Other architectures with fallback implementations

The implementation automatically selects the best available SIMD instructions for the target platform, providing optimal performance across different architectures.

Testing and Benchmarking

Run performance benchmarks:

# Run erasure coding benchmarks
cargo bench --bench erasure_benchmark

# Run comparison benchmarks
cargo bench --bench comparison_benchmark

# Generate benchmark reports
./run_benchmarks.sh

Error Handling

All operations return Result types with comprehensive error information:

Encoding errors: Invalid parameters, insufficient memory
Decoding errors: Too many missing shards, corrupted data
Configuration errors: Invalid shard counts, unsupported parameters

3.3 KiB Raw Blame History