rahul-sarvam
|
0efb8bd31e
Update README.md
|
11 months ago |
rahul-sarvam
|
687c2dc5d8
Update README.md
|
11 months ago |
Rahul A R
|
2fa8e69b62
add new argument: tokenizer_name
|
11 months ago |
Rahul A R
|
f8183b96fe
use new tokenizer_name argument and resize embeddings if required
|
11 months ago |
Rahul A R
|
1e4e3e00fc
adding new multilingual recipe
|
11 months ago |
Kai Wu
|
362cda0fa6
fixing test_gradient_accumulation and test_save_to_json
|
11 months ago |
Kai Wu
|
fa0a389f74
add max_step feature for training and eval
|
11 months ago |
Suraj Subramanian
|
201daff2d1
Add note on CUDA version + remove 'test' from pytorch whl url
|
11 months ago |
Hamid Shojanazeri
|
37c8f72211
Update location and name of llm.py example notebook (#417)
|
11 months ago |
Thomas Robinson
|
79266217ef
Update location and name of llm.py example notebook
|
11 months ago |
Hamid Shojanazeri
|
f7aa02af9f
only save training params on rank 0 (#415)
|
11 months ago |
jpgard
|
6954b16b3b
only save training params on rank 0
|
11 months ago |
Allen
|
525de548aa
version 1.0, inference with h2o on summarization tasks
|
11 months ago |
Allen
|
115e9306d5
Update cache.py
|
11 months ago |
Allen
|
cedb89b064
Update generation.py
|
11 months ago |
Allen
|
428e8e83ed
test
|
11 months ago |
Allen
|
9fb1080e17
test
|
11 months ago |
Allen
|
f2802dd7ee
Update generation.py
|
11 months ago |
Allen
|
9441e0c5cb
Update exp.sh
|
11 months ago |
Allen
|
38cec86026
Update utils_llama.py
|
11 months ago |
Allen
|
5affb02787
Update utils_llama.py
|
11 months ago |
Allen
|
a694fe8b62
Update utils_llama.py
|
11 months ago |
Allen
|
43e85991ab
Update utils_llama.py
|
11 months ago |
Allen
|
b33b68c3f7
Update utils_llama.py
|
11 months ago |
Allen
|
ab07cdcf6c
Update utils_llama.py
|
11 months ago |
Allen
|
84ddf520a7
Update utils_llama.py
|
11 months ago |
Allen
|
5b50ca5687
Update utils_llama.py
|
11 months ago |
Allen
|
0832c0620c
Update utils_llama.py
|
11 months ago |
Allen
|
036620e6d7
Update utils_llama.py
|
11 months ago |
Allen
|
57d1f6d04f
Update utils_llama.py
|
11 months ago |