- 22 Aug, 2023 2 commits
- 21 Aug, 2023 1 commit
-
-
GAOXinyu authored
-
- 20 Aug, 2023 2 commits
-
-
Xuechen Li authored
* q * add comment.
-
Tri Dao authored
-
- 19 Aug, 2023 2 commits
-
-
Tri Dao authored
-
Xuechen Li authored
* fix name. * set inv function. * add map back function. * handle gqa. * add type annotation to avoid confusion. * fix docstr. * test inverse remap logic.
-
- 18 Aug, 2023 5 commits
-
-
Tri Dao authored
-
Tri Dao authored
-
Xuechen Li authored
* uneql rank. * trim. * enable passing in number of heads for each rank. * simplify. * simplify. * cleanup. * fix col parallel. * fix bug with row parallel. * fit out proj. * refac. * fix sharding logic. * refac sharding. * refac. * support multiple of. * make fn reuseable. * fix bug in dimensions. * scaffold. * test uneven heads. * fix test by adding barrier. * refac. * reuse code. * clean up.
-
Tri Dao authored
-
Tri Dao authored
-
- 17 Aug, 2023 4 commits
- 16 Aug, 2023 2 commits
- 15 Aug, 2023 1 commit
-
-
Xuechen Li authored
* prelim. * add hf convertion fn. * mlp. * change name. * fix bug. * inverse permute. * change comment. * revert style changes. * fix. * add doc. * revert. * enable load safe. * fix safe load. * fix import. * fix typing-related lints. * fix ckpt loading logic. * make single gpu work. * test with parallel. * ckpt format. * enable pretrained state dict. * remove unused imports. * remove unused. * mark idea related.
-
- 14 Aug, 2023 4 commits
-
-
Tri Dao authored
-
Aman Gupta Karmani authored
-
Tri Dao authored
-
Tri Dao authored
-
- 13 Aug, 2023 7 commits
-
-
Tri Dao authored
-
Tri Dao authored
-
Tri Dao authored
* piercefreeman-feature/demo-wheels: (25 commits) Install standard non-wheel package Remove release creation Build wheel on each push Isolate 2.0.0 & cuda12 Clean setup.py imports Remove builder project Bump version Add notes to github action workflow Add torch dependency to final build Exclude cuda erroring builds Exclude additional disallowed matrix params Full version matrix Add CUDA 11.7 Release is actually unsupported echo OS version Temp disable deploy OS version build numbers Restore full build matrix Refactor and clean of setup.py Strip cuda name from torch version ...
-
Tri Dao authored
Merge branch 'feature/demo-wheels' of https://github.com/piercefreeman/flash-attention into piercefreeman-feature/demo-wheels * 'feature/demo-wheels' of https://github.com/piercefreeman/flash-attention: (25 commits) Install standard non-wheel package Remove release creation Build wheel on each push Isolate 2.0.0 & cuda12 Clean setup.py imports Remove builder project Bump version Add notes to github action workflow Add torch dependency to final build Exclude cuda erroring builds Exclude additional disallowed matrix params Full version matrix Add CUDA 11.7 Release is actually unsupported echo OS version Temp disable deploy OS version build numbers Restore full build matrix Refactor and clean of setup.py Strip cuda name from torch version ...
-
Tri Dao authored
-
Tri Dao authored
-
Tri Dao authored
-
- 11 Aug, 2023 4 commits
-
-
Pierce Freeman authored
-
Pierce Freeman authored
-
Pierce Freeman authored
-
Pierce Freeman authored
-
- 10 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 01 Aug, 2023 5 commits