1. 26 May, 2024 7 commits
  2. 23 May, 2024 1 commit
  3. 06 May, 2024 1 commit
  4. 26 Apr, 2024 3 commits
  5. 08 Apr, 2024 4 commits
  6. 05 Apr, 2024 1 commit
  7. 28 Mar, 2024 2 commits
  8. 19 Mar, 2024 1 commit
  9. 15 Mar, 2024 3 commits
  10. 14 Mar, 2024 2 commits
  11. 02 Mar, 2024 2 commits
  12. 21 Feb, 2024 4 commits
  13. 20 Feb, 2024 1 commit
  14. 18 Feb, 2024 1 commit
    • Qubitium's avatar
      Optimize compile to 1: avoid oom 2: minimize swap usage 3: avoid threads... · f45bbb4c
      Qubitium authored
      Optimize compile to 1: avoid oom 2: minimize swap usage 3: avoid threads starvation when letting ninja decide how many workers to spawn or manual MAX_JOBS "guesses". Logic is to take the min value of MAX_JOBS auto-calculated by two metrics: 1: cpu cores 2: free memory. This should allow flash-attn to compile close to the most efficient manner under any consumer/server env. (#832)
      
      f45bbb4c
  15. 10 Feb, 2024 4 commits
  16. 08 Feb, 2024 1 commit
  17. 31 Jan, 2024 2 commits