ColdFusion2001's Stars
KindXiaoming/pykan
Kolmogorov Arnold Networks
mshumer/gpt-prompt-engineer
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
mintisan/awesome-kan
A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
stanfordnlp/pyreft
ReFT: Representation Finetuning for Language Models
google-ai-edge/model-explorer
A modern model graph visualizer and debugger
OpenLMLab/LOMO
LOMO: LOw-Memory Optimization
FullStackRetrieval-com/RetrievalTutorials
kuleshov-group/llmtools
Finetuning Large Language Models on One Consumer GPU in Under 4 Bits
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
deep-diver/llamaduo
This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.
xjtushujun/meta-weight-net
NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).
michaelhodel/arc-dsl
Domain Specific Language for the Abstraction and Reasoning Corpus
yuchenlin/ZeroEval
A simple unified framework for evaluating LLMs
CASIA-LM/MoDS
bminixhofer/zett
Code for Zero-Shot Tokenizer Transfer
jxbz/modula
Scalable neural net training via automatic normalization in the modular norm.
CG80499/KAN-GPT-2
Training small GPT-2 style models using Kolmogorov-Arnold networks.
logix-project/logix
AI Logging for Interpretability and Explainability🔬
KhoomeiK/complexity-scaling
gzip Predicts Data-dependent Scaling Laws
GeneZC/MiniMoE
Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"
Lemon-cmd/energy-transformer-torch
Official Implementation of Energy Transformer in PyTorch for Mask Image Reconstruction
krypticmouse/matryoshka-representation-learning
PyTorch implementation for MRL
alexandertheus/Intra-Fusion
Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)