ftgreat

Pinned Repositories

Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language:Python0 0 00
agentflow
Complex LLM Workflows from Simple JSON.
Language:Python0 0 00
AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Language:Python0 0 00
AGIEval
Language:Python0 0 00
alignment-handbook
Robust recipes for to align language models with human and AI preferences
Language:Python0 0 00
Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Language:Python0 0 00
FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
Language:Python1 0 01
llama.cpp
Port of Facebook's LLaMA model in C/C++
Language:C1 0 00
llmkit
Language:Python31
Megatron-LM
Ongoing research training transformer models at scale
Language:Python0 0 00

ftgreat's Repositories

ftgreat/awesome-ssm-ml
ftgreat/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
ftgreat/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python0 0
ftgreat/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
ftgreat/Firefly
Firefly(流萤): 中文对话式大语言模型
Language:Python0 0
ftgreat/FlagScale
FlagScale is a Large Language Model (LLM) toolkit based on open-sourced projects.
Language:Python
ftgreat/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
ftgreat/Latte
Latte: Latent Diffusion Transformer for Video Generation.
ftgreat/LESS
Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning
ftgreat/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Language:Python0 0
ftgreat/LLaMA-Pro
Progressive LLaMA with Block Expansion.
ftgreat/llm-foundry
LLM training code for MosaicML foundation models
Language:Python0 0
ftgreat/llm-random
ftgreat/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
ftgreat/megalodon
Reference implementation of Megalodon 7B model
ftgreat/MiniCPM
MiniCPM-2.4B: An end-side LLM outperforms Llama2-13B.
ftgreat/modelzoo
Language:Python
ftgreat/OLMo
Modeling, training, eval, and inference code for OLMo
ftgreat/Open-Sora-Plan
This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
Language:Jupyter Notebook0 0
ftgreat/open_clip
An open source implementation of CLIP.
Language:Jupyter Notebook0 0
ftgreat/QAnything
Question and Answer based on Anything.
ftgreat/quiet-star
Code for Quiet-STaR
ftgreat/QuRating
Select LM Training Data Based on Qualitative Aspects of Text
Language:Python0 0
ftgreat/qwen-vllm
通义千问VLLM推理部署DEMO
ftgreat/Qwen1.5
Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
ftgreat/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
ftgreat/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook0 0
ftgreat/stable-weight-decay-regularization
[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.
Language:Python0 0
ftgreat/transformer-debugger
ftgreat/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection