1 год назад · 5a18b6b997
--- a/end-to-end-use-cases/coding/text2sql/tool/README.md
+++ b/end-to-end-use-cases/coding/text2sql/tool/README.md
@@ -27,16 +27,16 @@ Llama-3.1-405B: Together: 55.80% - Together: 57.17%
 
																 Llama 4 Scout: 43.94% - Llama API: 44.39%
															
 
																 Llama 4 Maverick: 41.46% - Llama API: 44.00%
															
 
																-### Supported Models
															
 
																+## Supported Models
															
 
																-#### Together AI Models
															
 
																+### Together AI Models
															
 
																 - meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
															
 
																 - meta-llama/Llama-3.3-70B-Instruct-Turbo
															
 
																 - meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
															
 
																 - meta-llama/Llama-4-Scout-17B-16E-Instruct
															
 
																 - other Llama models hosted on Together AI
															
 
																-#### Llama API Models
															
 
																+### Llama API Models
															
 
																 - Llama-3.3-8B-Instruct
															
 
																 - Llama-3.3-70B-Instruct
															
 
																 - Llama-4-Maverick-17B-128E-Instruct-FP8
															
@@ -53,31 +53,6 @@ Llama 4 Maverick: 41.46% - Llama API: 44.00%
 
																 4. **Accuracy Calculation**: Accuracy scores are calculated overall and broken down by difficulty levels (simple, moderate, challenging).
															
 
																-## Data Format
															
 
																-
															
 
																-The evaluation data should be in JSON format with the following structure:
															
 
																-
															
 
																-```json
															
 
																-[
															
 
																-  {
															
 
																-    "question": "Natural language question",
															
 
																-    "db_id": "database_name",
															
 
																-    "evidence": "External knowledge (optional)",
															
 
																-    "SQL": "Ground truth SQL query",
															
 
																-    "difficulty": "simple|moderate|challenging"
															
 
																-  },
															
 
																-  ...
															
 
																-]
															
 
																-```
															
 
																-
															
 
																-## Output
															
 
																-
															
 
																-The evaluation produces:
															
 
																-- Generated SQL queries saved to the specified output directory
															
 
																-- Accuracy scores printed to the console, broken down by difficulty level
															
 
																-
															
 
																-
															
 
																-
															
 
																 ## Preparing Fine-tuning Dataset
															
 
																 ### Using the TRAIN to prepare for supervised fine-tuning
															
@@ -100,7 +75,7 @@ This will create `train_text2sql_sft_dataset.json` and `test_text2sql_sft_datase
 
																 {"messages":[{"content":"You are a text to SQL query translator. Using the SQLite DB Schema and the External Knowledge, translate the following text question into a SQLite SQL select statement.","role":"system"},{"content":"-- DB Schema: <DB_SCHEMA>\n\n-- External Knowledge: <KNOWLEDGE_FROM_TRAIN>\n\n-- Question: <TEXT_QUESTION>","role":"user"},{"content":"<GOLD_SQL>","role":"assistant"}]}
															
 
																 ```
															
 
																-3. Supervised Fine-tuning
															
 
																+## Supervised Fine-tuning
															
 
																 First, you need to login to HuggingFace (via running `huggingface-cli login` and enter your [HF token](https://huggingface.co/settings/tokens)) and have been granted access to the [Llama 3.1 8B Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) model.
															
@@ -110,6 +85,7 @@ After running `tensorboard --logdir ./llama31-8b-text2sql-fine_tuning` you can o
 
																 ![](fine_tuning/train_loss.png)
															
 
																+
															
 
																 ## Evaluating the fine-tuned model
															
 
																 First, modify `llama_eval.sh` to use the fine-tuned model:
															
@@ -139,8 +115,21 @@ Note that this is using the 4-bit quantized Llama 3.1 8b model to reduce the mem
 
																   )
															
 
																 ```
															
 
																+
															
 
																 ### Creating a reasoning dataset from the TRAIN dataset
															
 
																 In the fine_tuning folder, run:
															
 
																 ```
															
 
																 python create_reasoning_dataset.py --input_json data/train/train.json --db_root_path data/train/train_databases
															
 
																 ```
															
 
																+This will create `text2sql_cot_dataset` dataset in HuggingFace format, which is ready for fine-tuning with the reasoning prompt. Each line in the json file is in the conversation format ready for fine-tuning:
															
 
																+
															
 
																+```
															
 
																+    "messages": [
															
 
																+        {
															
 
																+            "role": "system",
															
 
																+            "content": "You are a text to SQL query translator. Using the SQLite DB Schema and the External Knowledge, generate the step-by-step reasoning and the final SQLite SQL select statement from the text question.",
															
 
																+        },
															
 
																+        {"role": "user", "content": prompt},
															
 
																+        {"role": "assistant", "content": reasoning},
															
 
																+    ]
															
 
																+```