• Xuechen Li's avatar
    enable loading hf llama checkpoints for training (#446) · 0f7853c6
    Xuechen Li authored
    * prelim.
    
    * add hf convertion fn.
    
    * mlp.
    
    * change name.
    
    * fix bug.
    
    * inverse permute.
    
    * change comment.
    
    * revert style changes.
    
    * fix.
    
    * add doc.
    
    * revert.
    
    * enable load safe.
    
    * fix safe load.
    
    * fix import.
    
    * fix typing-related lints.
    
    * fix ckpt loading logic.
    
    * make single gpu work.
    
    * test with parallel.
    
    * ckpt format.
    
    * enable pretrained state dict.
    
    * remove unused imports.
    
    * remove unused.
    
    * mark idea related.
    0f7853c6
.gitignore 236 Bytes