yoonene's Stars
NVlabs/EAGLE
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Marker-Inc-Korea/AutoRAG
RAG AutoML Tool - Find optimal RAG pipeline for your own data.
NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
explodinggradients/ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
microsoft/LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
NomaDamas/awesome-korean-llm
Awesome list of Korean Large Language Models.
Bing-su/train_ml_clip
jaketae/koclip
KoCLIP: Korean port of OpenAI CLIP, in Flax
instructkr/LogicKor
한국어 언어모델 다분야 사고력 벤치마크
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
michaelfeil/infinity
Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
microsoft/ReACC
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
RasaHQ/rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
kongds/scaling_sentemb
Scaling Sentence Embeddings with Large Language Models
khanrc/honeybee
Official implementation of project Honeybee (CVPR 2024)
skywalker023/sodaverse
🥤🧑🏻🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
kh-kim/arxiv-translator
lovit/soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
Kikyo-16/Sound_event_detection
This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: specialized decision surface (SDS) and disentangled feature (DF) for weakly-supervised learning and guided learning (GL) for semi-supervised learning. We're so glad if you're interested in using it for research purpose or DCASE participation.
EleutherAI/dps
Data processing system for polyglot
hyunwoongko/nlp-datasets
Curation note of NLP datasets
kakaobrain/trident
A performance library for machine learning applications.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
espnet/espnet
End-to-End Speech Processing Toolkit