Joshua-Ren

Joshua-Ren's Stars

lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.1k 353 1.8k4.6k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python35k 213 5.4k4.3k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.4k 194 3822.2k
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python8.9k 62 1.5k567
thuml/Time-Series-Library
A Library for Advanced Deep Time Series Models.
Language:Python7.2k 74 5171.1k
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
6.9k 139 14410
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Language:Python4.9k 121 58451
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.7k 112 137414
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
Language:Jupyter Notebook2.3k 33 109173
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.2k 19 82183
cure-lab/LTSF-Linear
[AAAI-23 Oral] Official implementation of the paper "Are Transformers Effective for Time Series Forecasting?"
Language:Python2k 17 113449
thuml/Autoformer
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
Language:Jupyter Notebook2k 15 208425
yuqinie98/PatchTST
An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730
Language:Python1.6k 13 117281
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python1.1k 12 3192
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Language:Python748 8 2445
MAZiqing/FEDformer
Language:Python649 7 86123
mims-harvard/TFC-pretraining
Self-supervised contrastive learning for time series via time-frequency consistency
Language:Python440 5 3282
princeton-nlp/LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Language:Jupyter Notebook377 5 3537
djsutherland/arxiv-collector
A little Python script to collect LaTeX sources for upload to the arXiv.
Language:Python333 3 1723
LLaMafia/llamafia.github
Language:Python315 21 216
scienceetonnante/grokking
Language:Jupyter Notebook54 3 04
KindXiaoming/Omnigrok
Omnigrok: Grokking Beyond Algorithmic Data
Language:Jupyter Notebook51 3 015
shuyhere/about-super-alignment
Feeling confused about super alignment? Here is a reading list
43 3 01
linlu-qiu/lm-inductive-reasoning
Language:Python28 2 16
tommccoy1/embers-of-autoregression
Language:Jupyter Notebook24 1 01
carlguo866/circle-survival
Code repository for "Survival of the Fittest Representation: A Case Study with Modular Addition."
Language:Python8 1 00
Shawn-Guo-CN/SFT_function_learning
Reference implementation for DPO (Direct Preference Optimization)
Language:Python8 0 01
ServiceNow/THANOS
This ANOmaly is Synthetic - A Timeseries Recipe Data Generator
Language:Jupyter Notebook5 3 0
Joshua-Ren/Learning_dynamics_LLM
Language:Jupyter Notebook40
Joshua-Ren/iICL
Language:Jupyter Notebook2 2 00