|
|
hai 1 mes | |
|---|---|---|
| .. | ||
| deepspeed_zero3.yaml | c88e10fab6 grpo llama 3.2 3b with 3 reward functions | hai 1 mes |
| grpo-llama323b-text2sql.yaml | c88e10fab6 grpo llama 3.2 3b with 3 reward functions | hai 1 mes |
| grpo_text2sql.py | c88e10fab6 grpo llama 3.2 3b with 3 reward functions | hai 1 mes |
| requirements.txt | c88e10fab6 grpo llama 3.2 3b with 3 reward functions | hai 1 mes |