.. |
__init__.py
|
207d2f80e9
Make code-llama and hf-tgi inference runnable as module
|
hai 1 ano |
chat_utils.py
|
6d9d48d619
Use apply_chat_template instead of custom functions
|
hai 1 ano |
checkpoint_converter_fsdp_hf.py
|
0e54f5634a
use AutoTokenizer instead of LlamaTokenizer
|
hai 11 meses |
llm.py
|
eeb45e5f2c
Updated model names for OctoAI
|
hai 11 meses |
model_utils.py
|
d51d2cce9c
adding sdpa for flash attn
|
hai 1 ano |
prompt_format_utils.py
|
bcdb5b31fe
Fixing quantization config. Removing prints
|
hai 1 ano |
safety_utils.py
|
f63ba19827
Fixing tokenizer used for llama 3. Changing quantization configs on safety_utils.
|
hai 1 ano |