Html-code-generation-from-LLMs

Objective:

Fine-tuning the Falcon 7B for the task of HTML code generation. The current model was selected based on its performance on complex reasoning benchmarks such as ARC and GSM8K and its compatibility with the available computational resources.

Dataset:

Used https://huggingface.co/datasets/ttbui/html_alpaca dataset which contains: Size of data: 636 rows

Instructions- user prompts (textual)
Input-Further information needed as per the instruction could be html code or data points (textual+code)
Response- empty
Output- expected HTML code

Process

Model selection
Dataset Preparation and Preprocessing
Model Fine tuning script(setting hyperparameters and choosing fine tuning techniques and regularization)
Model Evaluation
API development to serve the model

Challenges and Errors encountered with resolutions

Understanding and implementing Parameter-Efficient Tuning (PeFT).
Managing the computational complexity and memory limitations of large models.
Ensuring reproducibility and consistency across training runs.
Dealing with long training times and optimizing model runtime.
Found ways to complete the training and evaluation without buying colab pro. Out of RAM error encountered during model training was solved by trying a different way to load my fine-tuned model rather than loading base model from scratch.

Solutions Implemented:

Adopting PeFT techniques like LoRA.
Utilizing quantization and model sharding to manage memory usage.
Setting a random seed for train-test splitting to ensure reproducibility.
Implementing precision training, early stopping and learning rate scheduling to improve convergence speed and solve GPU memory limitations.
Regularization techniques such as dropout and scaling factor were applied.
Training arguments were carefully set up to balance performance and resource usage.

List of hyperparameters that can be tweaked during training:

1.learning_rate: 0.0002 2. train_batch_size: 3. eval_batch_size: 8 4. seed: 42 5.gradient_accumulation_steps: 2 6.total_train_batch_size: 4 7.optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 8.lr_scheduler_type: cosine 9.lr_scheduler_warmup_ratio: 0.03 10.training_steps: 320