mihara-bot

CS Student in ECNU. Research interset: LLM, Ocean Engineering

mihara-bot's Stars

vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.1k 271 5.8k5k
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell11.5k 71 898694
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python5.7k 36 596473
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.6k 79 91348
eliahuhorwitz/Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Language:JavaScript2.4k 5 3360
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.1k 47 138158
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.4k 43 93135
mlfoundations/dclm
DataComp for Language Models
Language:HTML1.2k 37 68108
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Language:Python918 29 174110
CarperAI/OpenELM
Evolution Through Large Models
Language:Python707 26 1187
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python521 6 2729
mlfoundations/open_lm
A repository for research on medium sized language models.
Language:Python482 20 6769
huggingface/cosmopedia
Language:Python479 12 1245
microsoft/rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
374 6 611
eth-sri/language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
Language:Python214 8 815
tomekkorbak/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
Language:Python179 6 814
SALT-NLP/demonstrated-feedback
Language:Python113 1 414
msclar/formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
Language:Python97 1 210
yifanzhang-pro/AutoMathText
Official implementation of paper "Autonomous Data Selection with Language Models for Mathematical Texts" (As Huggingface Daily Papers: https://huggingface.co/papers/2402.07625)
Language:Python79 1 45
locuslab/scaling_laws_data_filtering
Language:Python64 3 04
skzhang1/IDEAL
IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models
Language:Python59 3 15
cxcscmu/MATES
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
Language:Python54 2 26
adymaharana/d2pruning
Language:Python31 2 25
UCSB-NLP-Chang/llm_uncertainty
Language:Python26 1 12
cohere-ai/human-feedback-paper
Code and data from the paper 'Human Feedback is not Gold Standard'
Language:Jupyter Notebook18 11 01
daeveraert/gradient-information-optimization
Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection
Language:Python11 1 11
kothasuhas/understanding-forgetting
Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Language:Python9 1 13
zijian678/TDD
Language:Python7
luffy06/ReFusion
[ICLR 2024] ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion
Language:Python3 1 01
xlhex/acl2024_xicl
Language:Python2 1 11

mihara-bot

mihara-bot's Stars

vllm-project/vllm

QwenLM/Qwen2.5

THUDM/GLM-4

togethercomputer/RedPajama-Data

eliahuhorwitz/Academic-project-page-template

huggingface/datatrove

huggingface/nanotron

mlfoundations/dclm

huggingface/lighteval

CarperAI/OpenELM

hkust-nlp/deita

mlfoundations/open_lm

huggingface/cosmopedia

microsoft/rho

eth-sri/language-model-arithmetic

tomekkorbak/pretraining-with-human-feedback

SALT-NLP/demonstrated-feedback

msclar/formatspread

yifanzhang-pro/AutoMathText

locuslab/scaling_laws_data_filtering

skzhang1/IDEAL

cxcscmu/MATES

adymaharana/d2pruning

UCSB-NLP-Chang/llm_uncertainty

cohere-ai/human-feedback-paper

daeveraert/gradient-information-optimization

kothasuhas/understanding-forgetting

zijian678/TDD

luffy06/ReFusion

xlhex/acl2024_xicl