Geeta Chauhan
|
fbc513ec47
adding notes how to get the HF models (#151)
|
пре 2 година |
Hamid Shojanazeri
|
bcfafd9a0b
adding notes how to get the HF models
|
пре 2 година |
Geeta Chauhan
|
cfba150311
adding llama code inference (#144)
|
пре 2 година |
Hamid Shojanazeri
|
6105a3f886
clarifying the infilling use-case
|
пре 2 година |
Hamid Shojanazeri
|
8b0008433c
fix typos
|
пре 2 година |
Hamid Shojanazeri
|
564ef2f628
remove padding logic
|
пре 2 година |
Hamid Shojanazeri
|
277a292fbc
adding autotokenizer
|
пре 2 година |
Hamid Shojanazeri
|
3f2fb9167e
adding notes to model not supporting infilling
|
пре 2 година |
Hamid Shojanazeri
|
c62428b99c
setting defaults of temp and top_p
|
пре 2 година |
Hamid Shojanazeri
|
c014ae7cb8
setting BT option to true
|
пре 2 година |
Hamid Shojanazeri
|
4fa44e16d9
add note for python llama not suited for llama infilling
|
пре 2 година |
Hamid Shojanazeri
|
b18a186385
removing the option to take prompt from cli
|
пре 2 година |
Hamid Shojanazeri
|
75991d8795
fix the extra line added and remove take prompt from cli
|
пре 2 година |
Hamid Shojanazeri
|
d28fc9898a
addressing doc comments
|
пре 2 година |
Hamid Shojanazeri
|
a234d1fe0c
fix typos
|
пре 2 година |
Hamid Shojanazeri
|
2d9f4796e8
fixing the output format
|
пре 2 година |
Hamid Shojanazeri
|
1e8ea70b26
adding llama code inference
|
пре 2 година |
Geeta Chauhan
|
82e05c46e0
fix a bug in the config for use_fast_kernels (#121)
|
пре 2 година |
Hamid Shojanazeri
|
971c079aa6
bugfix: remove duplicate load_peft_model (#124)
|
пре 2 година |
hongbo.mo
|
fcc817e923
bugfix: remove duplicate load_peft_model
|
пре 2 година |
Brian Vaughan
|
3faf005226
fix a bug in the config for use_fast_kernels
|
пре 2 година |
Geeta Chauhan
|
03faba661f
Update paddings (#85)
|
пре 2 година |
Geeta Chauhan
|
205e5a4b81
save cpu mem by leveraging FSDP rank0 broadcasting (#77)
|
пре 2 година |
Hamid Shojanazeri
|
85a4ed1b65
Merge branch 'main' into update_paddings
|
пре 2 година |
lchu
|
feaa344af3
resolve conflicts
|
пре 2 година |
Geeta Chauhan
|
3f1fef7a00
adding flash attention and xformer memory efficient through PT SDPA (#97)
|
пре 2 година |
Hamid Shojanazeri
|
beab5726cc
add notes for padding
|
пре 2 година |
Hamid Shojanazeri
|
c3a11c4fbe
update to main
|
пре 2 година |
Hamid Shojanazeri
|
51269b816f
moving Bt to the try block
|
пре 2 година |
Hamid Shojanazeri
|
8fddaa9966
resolving conflicts
|
пре 2 година |