Pinned Repositories
Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
agentflow
Complex LLM Workflows from Simple JSON.
AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
AGIEval
alignment-handbook
Robust recipes for to align language models with human and AI preferences
Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
llama.cpp
Port of Facebook's LLaMA model in C/C++
llmkit
Megatron-LM
Ongoing research training transformer models at scale
ftgreat's Repositories
ftgreat/awesome-ssm-ml
ftgreat/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
ftgreat/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
ftgreat/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
ftgreat/Firefly
Firefly(流萤): 中文对话式大语言模型
ftgreat/FlagScale
FlagScale is a Large Language Model (LLM) toolkit based on open-sourced projects.
ftgreat/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
ftgreat/Latte
Latte: Latent Diffusion Transformer for Video Generation.
ftgreat/LESS
Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning
ftgreat/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
ftgreat/LLaMA-Pro
Progressive LLaMA with Block Expansion.
ftgreat/llm-foundry
LLM training code for MosaicML foundation models
ftgreat/llm-random
ftgreat/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
ftgreat/megalodon
Reference implementation of Megalodon 7B model
ftgreat/MiniCPM
MiniCPM-2.4B: An end-side LLM outperforms Llama2-13B.
ftgreat/modelzoo
ftgreat/OLMo
Modeling, training, eval, and inference code for OLMo
ftgreat/Open-Sora-Plan
This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
ftgreat/open_clip
An open source implementation of CLIP.
ftgreat/QAnything
Question and Answer based on Anything.
ftgreat/quiet-star
Code for Quiet-STaR
ftgreat/QuRating
Select LM Training Data Based on Qualitative Aspects of Text
ftgreat/qwen-vllm
通义千问VLLM推理部署DEMO
ftgreat/Qwen1.5
Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
ftgreat/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
ftgreat/stable-diffusion
A latent text-to-image diffusion model
ftgreat/stable-weight-decay-regularization
[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.
ftgreat/transformer-debugger
ftgreat/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection