.gitignore · b28ec236df49cb57c213a9a4f29beb727adbf65b · Shenguo Wang / Flash Attention

enable loading hf llama checkpoints for training (#446) · 0f7853c6

Xuechen Li authored Aug 15, 2023

* prelim.

* add hf convertion fn.

* mlp.

* change name.

* fix bug.

* inverse permute.

* change comment.

* revert style changes.

* fix.

* add doc.

* revert.

* enable load safe.

* fix safe load.

* fix import.

* fix typing-related lints.

* fix ckpt loading logic.

* make single gpu work.

* test with parallel.

* ckpt format.

* enable pretrained state dict.

* remove unused imports.

* remove unused.

* mark idea related.

0f7853c6

.gitignore 236 Bytes

Replace .gitignore