ChemFoundationModels/ChemLLMBench

A useful link for USPTO dataset

csyanghan opened this issue · 7 comments

At the same time, could you explain the meaning of Suzuki, or tell me where you collect them.

Thanks for your attention for this project! The suzuki is a specific type of chemical reaction and we have updated the data source in our README: https://github.com/ChemFoundationModels/ChemLLMBench#dataset

Thanks a lot, I find your paper report the BLEU-2 of MolT6-Large in Molecule Captioning is 0.482, however, it is reported as 0.594 in the original paper.
image
image

This is because the test set are different. In the original paper, they use the whole test set to evaluate. In our work, due the API requesting time and money consumption, we sample 100 examples from the test set to evaluate all models. The 0.482 here is the result for the 100 examples.

Thanks, I see. 100 samples are a bit small (as ~3300 samples in the test set), this may cause bias, and I also suggest you should indicate that in the paper.

Thanks for your suggestion. Actually we have indicated the test set in Table 2 and the "Experiment setup strategy" part in Section 2 (The Evaluation Process and Setting). Due to the high time consumption and fee of the API requesting, we have to reduce the test set to 100. We're currently preparing more tasks and dataset to evaluate LLMs. Once all tasks are finished, we will consider increase the size of the test set.

I am confused by the Scaffold and the Random option in the tables, is it mean that Scaffold is the certain examples (carefully picked) and Random is the random examples from the train set?