- 🤖 Master GenAI Engineering - Build Production AI Systems
- 🦀 Learn Professional Rust - Industry-Grade Development
- 📊 AWS AI & Analytics - Scale Your ML in Cloud
- ⚡ Production GenAI on AWS - Deploy at Enterprise Scale
- 🛠️ Rust DevOps Mastery - Automate Everything
- 💼 Production ML Program - Complete MLOps & Cloud Mastery
- 🎯 Start Learning Now - Fast-Track Your ML Career
- 🏢 Trusted by Fortune 500 Teams
Learn end-to-end ML engineering from industry veterans at PAIML.COM
A Rust based deduplication tool
- Build a multiplatform, fast deduplication tool that uses Rust parallelization.
- Added
- Added progress bar
- Add a GUI
- Add a web interface
- Fix GitHub Actions Build process to not fail silently!
- Use Polars DataFrame and include statistics about files and generate a CSV report.
- Store logs about actions performed across multiple runs
- Build: cd into rdedupe and run
make all
- Run:
cargo run -- dedupe --path tests --pattern .txt
- Run tests:
make test
- Install rust via rustup
- Add to
~/.cargo/config
[target.x86_64-apple-darwin]
rustflags = [
"-C", "link-arg=-undefined",
"-C", "link-arg=dynamic_lookup",
]
[target.aarch64-apple-darwin]
rustflags = [
"-C", "link-arg=-undefined",
"-C", "link-arg=dynamic_lookup",
]
- run
make all
in rdedupe directory