Nguyendat-bit's Stars
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Raphire/Win11Debloat
A simple, easy to use PowerShell script to remove pre-installed apps from Windows, disable telemetry, remove Bing from Windows search as well as perform various other changes to declutter and improve your Windows experience. This script works for both Windows 10 and Windows 11.
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
langfuse/langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
dsdanielpark/Bard-API
The unofficial python package that returns response of Google Bard through cookie value.
HIT-SCIR/ltp
Language Technology Platform
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
nmslib/nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
huggingface/safetensors
Simple, safe way to store and distribute tensors
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism etc. 🎉🎉
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
allenai/longformer
Longformer: The Long-Document Transformer
microsoft/DeBERTa
The implementation of DeBERTa
zjukg/KG-LLM-Papers
[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
luyug/GradCache
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
fferrin/free-medium
An extension to read Medium posts for free
project-miracl/miracl
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
dwzhu-pku/LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
shizhediao/R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"
worldbank/GISTEmbed
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
anti-aii/RagE
RagE (RAG Engine) - A tool supporting the construction and training of components of the Retrieval-Augmented-Generation (RAG) model. It also facilitates the rapid development of Q&A systems and chatbots following the RAG model.
Nguyendat-bit/qa_information_utt
My university graduation thesis with the topic of building an automatic information question and answer system for the University of Transport Technology (UTT)