SBLLM

This is the artifact of the submitted ICSE'25 paper: "Search-Based LLMs for Code Optimization".

Dependency

Python == 3.9.12

C++17

GCC 9.4.0

Linux

Run the following command in the root directory of this repo:

pip install -r requirements.txt

Download the processed dataset and test cases based on the instructions in the processed_data/ folder. Clone the pie-perf repository based on the instructions in the pie/ folder for evaluation.
Our code relies on the service of OpenAI (for ChatGPT, GPT-4), Google (for Gemini), and DeepInfra (for CodeLLaMa), so you need first obtain their API keys and fill them in the baselines/baselines.py and sbllm/evol_query.py
SBLLM acquires the initialization results based on the COT prompt, so you need first obtain the results of COT prompt by

cd baselines  
bash cot.sh

You can change the model name in the cot.sh to experiment on different models (i.e., chatgpt, gpt4, gemini, codellama)

cd sbllm   
python initial.py --model_name model_name --lang lang

bash run.sh/run_cpp.sh

SBLLM will then use default settings to optimize the code in the test set.

The default setting is set to ns=3 and iteration=4. This setting is consistent with the paper.

Please follow the instructions in the processed_data/ folder to download the dataset for experiments.

The source code of other baseline methods is in the baselines/ folder.

For direct instruction:

cd baselines  
bash direct.sh

For in-context learning:

cd baselines  
bash icl.sh

For retrieval-augment generation:

cd baselines  
bash rag.sh

For chain-of-thought prompt:

cd baselines  
bash cot.sh

The default script will run the experiments on all LLMs. You can choose the model you want to run by commenting on corresponding code

We provide more detailed results and case study in the results/ folder.