A useful link for USPTO dataset
csyanghan opened this issue · 7 comments
At the same time, could you explain the meaning of Suzuki, or tell me where you collect them.
Thanks for your attention for this project! The suzuki is a specific type of chemical reaction and we have updated the data source in our README: https://github.com/ChemFoundationModels/ChemLLMBench#dataset
This is because the test set are different. In the original paper, they use the whole test set to evaluate. In our work, due the API requesting time and money consumption, we sample 100 examples from the test set to evaluate all models. The 0.482 here is the result for the 100 examples.
Thanks, I see. 100 samples are a bit small (as ~3300 samples in the test set), this may cause bias, and I also suggest you should indicate that in the paper.
Thanks for your suggestion. Actually we have indicated the test set in Table 2 and the "Experiment setup strategy" part in Section 2 (The Evaluation Process and Setting). Due to the high time consumption and fee of the API requesting, we have to reduce the test set to 100. We're currently preparing more tasks and dataset to evaluate LLMs. Once all tasks are finished, we will consider increase the size of the test set.
I am confused by the Scaffold and the Random option in the tables, is it mean that Scaffold is the certain examples (carefully picked) and Random is the random examples from the train set?