whybe-choi

Interested in NLP

KyungHee UniversityIncheon, Republic of Korea

whybe-choi's Stars

huggingface/smol-course
A course on aligning smol models.
Language:Jupyter Notebook3.2k929
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript12.7k44.5k
wikibook/llm-finetuning
《한 권으로 끝내는 실전 LLM 파인튜닝》 예제 코드
Language:Jupyter Notebook4
DSBA-Lab/Contrastive-Accumulation
Language:Python71
philschmid/deep-learning-pytorch-huggingface
Language:Jupyter Notebook706166
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Language:Python1.7k194
andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
Language:Python8.3k725
baeseongsu/KoSAIM2024-Clinical-LLM
[KoSAIM 2024 Summer School] Fine-tuning a clinical domain Large Language Model
Language:Jupyter Notebook81
ritaranx/BMRetriever
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
Language:Python182
cfahlgren1/observers
A Lightweight Library for AI Observability
Language:Python22026
naver/splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
Language:Python79586
haon-chen/SPEED
Language:Python3
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python7k790
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language:Jupyter Notebook2k284
PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
Language:Python68651
AIR-Bench/AIR-Bench
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Language:Python11010
jxmorris12/cde
code for training & evaluating Contextual Document Embedding models
Language:Python1336
wandb/llm-kr-eval
Language:Python185
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.4k2.6k
LeeSureman/E5-Retrieval-Reproduction
Use contrastive learning to train a large language model (LLM) as a retriever
Language:Python81
hkust-nlp/SynCSE
This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"
Language:Python375
staoxiao/RetroMAE
Codebase for RetroMAE and beyond.
Language:Python24319
facebookresearch/mexma
MEXMA: Token-level objectives improve sentence representations
Language:Python362
vec2text/vec2text
utilities for decoding deep representations (like sentence embeddings) back to text
Language:Python75385
mrdbourke/simple-local-rag
Build a RAG (Retrieval Augmented Generation) pipeline from scratch and have it all run locally.
Language:Jupyter Notebook542162
mlfoundations/task_vectors
Editing Models with Task Arithmetic
Language:Python43637
songys/huggingface_KoreanDataset
huggingface에 있는 한국어 데이터 세트
22
MinishLab/model2vec
The Fastest State-of-the-Art Static Embeddings in the World
Language:Python50821
worldbank/GISTEmbed
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
Language:Python371
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Language:Python11.5k1.5k

whybe-choi

whybe-choi's Stars

huggingface/smol-course

academicpages/academicpages.github.io

wikibook/llm-finetuning

DSBA-Lab/Contrastive-Accumulation

philschmid/deep-learning-pytorch-huggingface

beir-cellar/beir

andrewyng/aisuite

baeseongsu/KoSAIM2024-Clinical-LLM

ritaranx/BMRetriever

cfahlgren1/observers

naver/splade

haon-chen/SPEED

jessevig/bertviz

embeddings-benchmark/mteb

PrithivirajDamodaran/FlashRank

AIR-Bench/AIR-Bench

jxmorris12/cde

wandb/llm-kr-eval

microsoft/unilm

LeeSureman/E5-Retrieval-Reproduction

hkust-nlp/SynCSE

staoxiao/RetroMAE

facebookresearch/mexma

vec2text/vec2text

mrdbourke/simple-local-rag

mlfoundations/task_vectors

songys/huggingface_KoreanDataset

MinishLab/model2vec

worldbank/GISTEmbed

HKUDS/LightRAG