|
|
4 meses atrás | |
|---|---|---|
| .. | ||
| data | 4 meses atrás | |
| eval | 4 meses atrás | |
| fine-tuning | 4 meses atrás | |
| quickstart | 4 meses atrás | |
| README.md | 4 meses atrás | |
This folder contains scripts to:
Evaluate Llama (original and fine-tuned) models on the Text2SQL task using the popular BIRD dataset in 3 simple steps;
Generate fine-tuning datasets (both with and without CoT reasoning) and fine-tuning Llama 3.1 8B with the datasets, gaining a 165% (with no reasoning) and 209% (with reasoning) accuracy improvement over the original model.
Our end goal is to maximize the accuracy of Llama models on the Text2SQL task. To do so we need to first evaluate the current state of the art Llama models on the task, then apply fine-tuning, agent and other approaches to evaluate and improve Llama's performance.