sci-assess/SciAssess
SciAssess is a comprehensive benchmark for evaluating Large Language Models' proficiency in scientific literature analysis across various fields, focusing on memorization, comprehension, and analysis.
Python
Stargazers
- 545487677DP Technology
- AngxiaoYueRenmin University of China
- caic99@deepmodeling
- Caixc97Nanjing University
- cherushuiDP technology
- ChitandaErumanga
- enjoysport2022beijing
- FanmengWangNanjing, China
- FicereBeijing
- guolinke@dptech-corp
- Heisenburger2020Institute of Computing Technology-pFind
- HongshuaiWang1
- i2vec
- jiaxianyanUniversity of Science and Technology of China
- Linmj-JudySouth China University of Technology
- lsh0520University of Science and Technology of China
- NaplessssDPTechnology
- neverbiasu🪐
- Newlcg
- Osiris-Y
- PKUterran
- QizhiPeiGaoling School of Artifical Intelligence, RUC
- SchrodingersCatttGuangzhou
- TablewareBoxDP Technology; PKU-CCME
- taichengguoUniversity of Notre Dame
- TODESENGEL1116
- tom832
- Type59pro
- wanghan-iapcm
- WhisperingFlame
- wjk376
- yonggeli66
- ysyecustBS@ECUST, PhD@ECUST
- yufengwhy
- ZhouGengmoRenmin University of China
- ZiyaoLiPeking University