Intelligent Machine Learning (IML) targets to setup a full-stack, high-performant and intelligent infrastructure of deep learning for both offline and online, including data processing, model training, model evaluation, and model inferencing, and makes DL real engineering-free and democratic for AI-driven biz.
IML
Pinned Loading
Repositories
Showing 8 of 8 repositories
- flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
intelligent-machine-learning/flash-attention’s past year of commit activity