hxixixh's Stars
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
CompVis/stable-diffusion
A latent text-to-image diffusion model
meta-llama/llama
Inference code for Llama models
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
google/styleguide
Style guides for Google-originated open-source projects
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
state-spaces/mamba
Mamba SSM architecture
openai/consistency_models
Official repo for consistency models.
hyunwoongko/transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
NVlabs/edm
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
facebookresearch/LaViLa
Code release for "Learning Video Representations from Large Language Models"
Cranial-XIX/llm-pddl
zhaoyue-zephyrus/AVION
[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"
Cranial-XIX/CAGrad
Official PyTorch Implementation for Conflict-Averse Gradient Descent (CAGrad)
zihuixue/DynMM
Code for the paper 'Dynamic Multimodal Fusion'
sangyun884/rfpp
The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024
ZO-Bench/ZO-LLM
[ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
Cranial-XIX/longhorn
Official PyTorch Implementation of the Longhorn Deep State Space Model
google-deepmind/asyncdiloco
hxixixh/AdaFlow
Official PyTorch implementation of AdaFlow
zihuixue/MFH
[ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation
facebookresearch/VidOSC
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
facebookresearch/EgoT2
Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)
mbreuss/consistency_models_toy_task
Unofficial minimal implementation of consistency models (CM) proposed by Song et al. 2023 on a 1D toy task in pytorch
Cranial-XIX/metric-residual-network
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
zihuixue/AlignEgoExo
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment" (NeurIPS 2023)
Cranial-XIX/longhorn_cuda
CUDA kernels for the Longhorn Architecture