Explorar el Código

READMEs update

Jeff Tang hace 1 semana
padre
commit
2cdfbf0593

+ 2 - 2
end-to-end-use-cases/coding/text2sql/eval/README.md

@@ -15,8 +15,8 @@ Below are the results of the Llama models we have evaluated on the BIRD DEV data
 | Llama 4 Maverick       | 44.00%             | 41.46%            |
 
 - Llama 3.1 8b on Hugging Face: quantized 14.02%, non-quantized 39.47%
-- Fine-tuned with no CoT dataset: 39.31%
-- Fine-tuned with CoT dataset: 43.35%
+- Non-quantized FFT with no CoT dataset: 36.31%
+- Non-quantized FFT with CoT dataset: 43.87%
 
 ## Quick Start
 

+ 12 - 12
end-to-end-use-cases/coding/text2sql/fine-tuning/README.md

@@ -10,18 +10,18 @@ This folder contains scripts to:
 
 ## Eval Results of the Fine-tuned Models
 
-The eval results of SFT Llama 3.1 8B with different options (epochs is 3) are summarized below:
-
-| Fine-tuning Combination     | Accuracy |
-|-----------------------------|----------|
-| Non-Quantized, CoT, PEFT    | 43.35%   |
-| Quantized, CoT, PEFT        | 42.89%   |
-| Non-Quantized, CoT, FFT     | 42.44%   |
-| Non-Quantized, No CoT, PEFT | 39.31%   |
-| Quantized, No CoT, PEFT     | 39.31%   |
-| Non-Quantized, No CoT, FFT  | 36.31%   |
-| Quantized, CoT, FFT         | N/A      |
-| Quantized, No CoT, FFT      | N/A      |
+The eval results of SFT Llama 3.1 8B with different options (epochs is 3, with an additional 10 for the two FFT models) are summarized below:
+
+| Fine-tuning Combination     | Accuracy                      |
+|-----------------------------|-------------------------------|
+| Non-Quantized, CoT, PEFT    | 43.35%                        |
+| Quantized, CoT, PEFT        | 42.89%                        |
+| Non-Quantized, CoT, FFT     | 42.44% (43.87% for 10 epochs) |
+| Non-Quantized, No CoT, PEFT | 39.31%                        |
+| Quantized, No CoT, PEFT     | 39.31%                        |
+| Non-Quantized, No CoT, FFT  | 36.31% (38.27% for 10 epochs) |
+| Quantized, CoT, FFT         | N/A                           |
+| Quantized, No CoT, FFT      | N/A                           |
 
 The table above shows that: