Sanyam Bhutani 80a6fce069 fix toolcall 6 meses atrás
..
configs 84b4a054bb add folders 7 meses atrás
data_prep 5c24996241 add prep scripts 6 meses atrás
scripts 80a6fce069 fix toolcall 6 meses atrás
ReadMe.MD 5c24996241 add prep scripts 6 meses atrás

ReadMe.MD

WIP

The end goal for this effort is to serve as fine-tuning data preparation kit.

Current status:

Currently, I'm (WIP) evaluating the idea to improve tool-calling datasets.

Setup:

  • configs: Has the config prompts for creating synthetic data using 3.3
  • data_prep/scripts: This is what you would like to run to prepare your datasets for annotation
  • scripts/annotation-inference: Script for generating synthetic datasets -> Use the vllm script for inference
  • fine-tuning: configs for FT using TorchTune