1. 22 Aug, 2023 2 commits
  2. 21 Aug, 2023 1 commit
  3. 20 Aug, 2023 2 commits
  4. 19 Aug, 2023 2 commits
  5. 18 Aug, 2023 5 commits
  6. 17 Aug, 2023 4 commits
  7. 16 Aug, 2023 2 commits
  8. 15 Aug, 2023 1 commit
    • Xuechen Li's avatar
      enable loading hf llama checkpoints for training (#446) · 0f7853c6
      Xuechen Li authored
      * prelim.
      
      * add hf convertion fn.
      
      * mlp.
      
      * change name.
      
      * fix bug.
      
      * inverse permute.
      
      * change comment.
      
      * revert style changes.
      
      * fix.
      
      * add doc.
      
      * revert.
      
      * enable load safe.
      
      * fix safe load.
      
      * fix import.
      
      * fix typing-related lints.
      
      * fix ckpt loading logic.
      
      * make single gpu work.
      
      * test with parallel.
      
      * ckpt format.
      
      * enable pretrained state dict.
      
      * remove unused imports.
      
      * remove unused.
      
      * mark idea related.
      0f7853c6
  9. 14 Aug, 2023 4 commits
  10. 13 Aug, 2023 7 commits
    • Tri Dao's avatar
      Bump to v2.0.6 · 67ae6fd7
      Tri Dao authored
      67ae6fd7
    • Tri Dao's avatar
      Fix wheel building · 2ddeaa40
      Tri Dao authored
      2ddeaa40
    • Tri Dao's avatar
      Merge branch 'piercefreeman-feature/demo-wheels' · d8ec6a2f
      Tri Dao authored
      * piercefreeman-feature/demo-wheels: (25 commits)
        Install standard non-wheel package
        Remove release creation
        Build wheel on each push
        Isolate 2.0.0 & cuda12
        Clean setup.py imports
        Remove builder project
        Bump version
        Add notes to github action workflow
        Add torch dependency to final build
        Exclude cuda erroring builds
        Exclude additional disallowed matrix params
        Full version matrix
        Add CUDA 11.7
        Release is actually unsupported
        echo OS version
        Temp disable deploy
        OS version build numbers
        Restore full build matrix
        Refactor and clean of setup.py
        Strip cuda name from torch version
        ...
      d8ec6a2f
    • Tri Dao's avatar
      Merge branch 'feature/demo-wheels' of... · 3c458cff
      Tri Dao authored
      Merge branch 'feature/demo-wheels' of https://github.com/piercefreeman/flash-attention into piercefreeman-feature/demo-wheels
      
      * 'feature/demo-wheels' of https://github.com/piercefreeman/flash-attention: (25 commits)
        Install standard non-wheel package
        Remove release creation
        Build wheel on each push
        Isolate 2.0.0 & cuda12
        Clean setup.py imports
        Remove builder project
        Bump version
        Add notes to github action workflow
        Add torch dependency to final build
        Exclude cuda erroring builds
        Exclude additional disallowed matrix params
        Full version matrix
        Add CUDA 11.7
        Release is actually unsupported
        echo OS version
        Temp disable deploy
        OS version build numbers
        Restore full build matrix
        Refactor and clean of setup.py
        Strip cuda name from torch version
        ...
      3c458cff
    • Tri Dao's avatar
      Prepare for Cutlass 3.2 · dbd79237
      Tri Dao authored
      dbd79237
    • Tri Dao's avatar
      Bump to v2.0.5 · c5e87b11
      Tri Dao authored
      c5e87b11
    • Tri Dao's avatar
      Update to Cutlass 3.1 · 3524e13c
      Tri Dao authored
      3524e13c
  11. 11 Aug, 2023 4 commits
  12. 10 Aug, 2023 1 commit
  13. 01 Aug, 2023 5 commits