Pinned Repositories
baconian-project
Model-based Reinforcement Learning Framework
Cola
[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"
CSVAL
[MIDL 2023] Official Imeplementation of "Making Your First Choice: To Address Cold Start Problem in Vision Active Learning"
llama3
the main Llama 3 GitHub site - will be moved under Meta-Llama
med_eval
NTU_FYP_latex_template
NTU FYP template by Latex
random_hacks
Random hacks that I need to keep happy
ShiroDarumas
video-diffusion
Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
cliangyu's Repositories
cliangyu/Cola
[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"
cliangyu/CSVAL
[MIDL 2023] Official Imeplementation of "Making Your First Choice: To Address Cold Start Problem in Vision Active Learning"
cliangyu/random_hacks
Random hacks that I need to keep happy
cliangyu/gpt4v_api
cliangyu/llama3
the main Llama 3 GitHub site - will be moved under Meta-Llama
cliangyu/med_eval
cliangyu/aibrowser
cliangyu/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
cliangyu/cliangyu
cliangyu/CSVALv2
cliangyu/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
cliangyu/Emu
Emu: An Open Multimodal Generalist
cliangyu/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
cliangyu/init-pools-dal
cliangyu/litellm
Call all LLM APIs using the OpenAI format. Use Azure, OpenAI, Cohere, Anthropic, Ollama, VLLM, Sagemaker, HuggingFace, Replicate (100+ LLMs)
cliangyu/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
cliangyu/LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
cliangyu/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
cliangyu/megablocks
cliangyu/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
cliangyu/mmina
cliangyu/mot
cliangyu/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
cliangyu/open-instruct
cliangyu/open_flamingo
An open-source framework for training large multimodal models.
cliangyu/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
cliangyu/toolformer
cliangyu/visitor-badge
A badge generator service to count visitors of your markdown file.
cliangyu/visual-chatgpt
VisualChatGPT
cliangyu/yang-song.github.io
Personal website