zhaochen0110

https://zhaochen0110.github.io/

Soochow University

zhaochen0110's Stars

pengsida/learning_research
本人的科研经验
6.1k 71 31365
mistralai/mistral-finetune
Language:Python2.8k 40 39239
openai/simple-evals
Language:Python2.1k 27 15181
marcotcr/checklist
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Language:Jupyter Notebook2k 29 89206
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.6k 17 26177
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
Language:Python1.6k 19 48138
zjunlp/KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
971 27 864
OpenLMLab/LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Language:Python365 4 1714
RenShuhuai-Andy/TimeChat
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
Language:Python317 5 4628
swj0419/detect-pretrain-code
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins , Danqi Chen , Luke Zettlemoyer.
Language:Python216 1 1623
RUCAIBox/POPE
The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
Language:Python189 1 06
Ethan-yt/guwen-models
GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.
Language:Python162 3 218
hkust-nlp/llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
Language:Python128 3 76
TIGER-AI-Lab/MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
Language:Python127 3 99
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
Language:Python122 1 713
pillowsofwind/Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
97 2 05
TIGER-AI-Lab/LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning"
Language:Python94 3 44
apple/ml-knowledge-conflicts
Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/
Language:Python72 10 018
saprmarks/geometry-of-truth
Language:HTML72 1 020
Luckfort/CD
[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
Language:Python59 1 14
Spico197/random-luck
Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !
Language:Python44 4 16
Spico197/MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
Language:Python35 1 00
zhaochen0110/conflictbank
Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and Benchmarks)
Language:Python31 1 10
zhaochen0110/Cotempqa
Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)
Language:Python30 2 01
yizhongw/llm-temporal-alignment
Methods and evaluation for aligning language models temporally
25 1 21
AlexWan0/rag-convincingness
Language:Python23 1 02
EternityYW/TRAM-Benchmark
TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)
Language:Jupyter Notebook22 1 12
zhaochen0110/Timo
Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)
Language:Python19 1 11
ddhruvkr/CONTRADOC
Language:Python9 3 13
Spico197/feishu-alert-bots
Language:Python4 3 0

zhaochen0110

zhaochen0110's Stars

pengsida/learning_research

mistralai/mistral-finetune

openai/simple-evals

marcotcr/checklist

gkamradt/LLMTest_NeedleInAHaystack

maitrix-org/llm-reasoners

zjunlp/KnowledgeEditingPapers

OpenLMLab/LEval

RenShuhuai-Andy/TimeChat

swj0419/detect-pretrain-code

RUCAIBox/POPE

Ethan-yt/guwen-models

hkust-nlp/llm-compression-intelligence

TIGER-AI-Lab/MAmmoTH2

Vance0124/Token-level-Direct-Preference-Optimization

pillowsofwind/Knowledge-Conflicts-Survey

TIGER-AI-Lab/LongICLBench

apple/ml-knowledge-conflicts

saprmarks/geometry-of-truth

Luckfort/CD

Spico197/random-luck

Spico197/MoE-SFT

zhaochen0110/conflictbank

zhaochen0110/Cotempqa

yizhongw/llm-temporal-alignment

AlexWan0/rag-convincingness

EternityYW/TRAM-Benchmark

zhaochen0110/Timo

ddhruvkr/CONTRADOC

Spico197/feishu-alert-bots