swoook's Stars
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
google-research/google-research
Google Research
chenfei-wu/TaskMatrix
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
FMInference/FlexiGen
Running large language models on a single GPU for throughput-oriented scenarios.
getmoto/moto
A library that allows you to easily mock out tests based on AWS infrastructure.
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
salesforce/CodeGen
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
salesforce/CodeT5
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
WassimTenachi/PhySO
Physical Symbolic Optimization
Beomi/KoAlpaca
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)
IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
OpenLMLab/MOSS-RLHF
Secrets of RLHF in Large Language Models Part I: PPO
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
eagle705/pytorch-bert-crf-ner
KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity Recognition model for Korean)
WooilJeong/PublicDataReader
공공 데이터 조회를 위한 오픈소스 파이썬 라이브러리
huggingface/large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
pablomarin/GPT-Azure-Search-Engine
Azure Cognitive Search + Azure OpenAI Accelerator
josw123/dart-fss
한국 금융감독원에서 운영하는 다트(Dart) 시스템 크롤링을 위한 라이브러리
tabtoyou/KoLLaVA
KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)
monologg/KoBigBird
🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)
kimwoonggon/publicservant_AI
김웅곤 - 텐서플로우와 케라스로 구현한 NLP 기초 (2020년 버전)
cosmoquester/2021-dialogue-summary-competition
[2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.
monologg/KoBERT-NER
NER Task with KoBERT (with Naver NLP Challenge dataset)
SKplanet/Dialog-KoELECTRA
ELECTRA기반 한국어 대화체 언어모델
songys/entity
날짜, 장소, 사람, 기관, 시간
toriving/naver-nlp-challenge-2018
Named Entity Recognition Model for Naver NLP Challenge 2018 : BiLSTM-CRF model based Korean named entity tagger
warnikchow/omniKSA
Speech Act and its Analysis for the (spoken) Korean Language: An Omnibus Description