kugwzk

think twice, code once

kugwzk's Stars

AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
77264
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python17.4k1.9k
locuslab/fast_adversarial
[ICLR 2020] A repository for extremely fast adversarial training using FGSM
Language:Python41793
hemingkx/Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
Language:Python938
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python77338
LargeWorldModel/LWM
Language:Python6.9k536
openai/dalle3-eval-samples
Text-to-image samples collected for the evaluation of DALL-E 3 in the whitepaper.
498
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
Language:Python2.5k232
mmathew23/improved_edm
Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"
Language:Python791
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Language:Python65435
allenai/OLMo-Eval
Evaluation suite for LLMs
Language:Python27029
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.1k385
SqueezeAILab/KVQuant
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Language:Python21817
InternLM/InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
Language:Python1.8k121
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook13.9k1.3k
XuandongZhao/weak-to-strong
Weak-to-Strong Jailbreaking on Large Language Models
Language:Python496
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Language:Python48038
Aleph-Alpha/NeurIPS-WANT-submission-efficient-parallelization-layouts
Language:Python21
ddupont808/GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
Language:JavaScript90084
kevin-ssy/CLIP_as_RNN
681
OpenGVLab/MM-Interleaved
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Language:Python16210
apple/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
Language:Python65341
google-deepmind/alphageometry
Language:Python3.8k408
sgl-project/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Language:Python2.6k166
microsoft/Cream
This is a collection of our NAS and Vision Transformer work.
Language:Python1.6k219
kyegomez/MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
Language:Python41221
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9k789
huggingface/open-muse
Open reproduction of MUSE for fast text2image generation.
Language:Python29422
huggingface/amused
Language:Python714
FuxiaoLiu/LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Language:Python22514

kugwzk

kugwzk's Stars

AIoT-MLSys-Lab/Efficient-LLMs-Survey

haotian-liu/LLaVA

locuslab/fast_adversarial

hemingkx/Spec-Bench

NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion

LargeWorldModel/LWM

openai/dalle3-eval-samples

facebookresearch/jepa

mmathew23/improved_edm

deepseek-ai/DeepSeek-Math

allenai/OLMo-Eval

allenai/OLMo

SqueezeAILab/KVQuant

InternLM/InternLM-XComposer

IDEA-Research/Grounded-Segment-Anything

XuandongZhao/weak-to-strong

declare-lab/instruct-eval

Aleph-Alpha/NeurIPS-WANT-submission-efficient-parallelization-layouts

ddupont808/GPT-4V-Act

kevin-ssy/CLIP_as_RNN

OpenGVLab/MM-Interleaved

apple/ml-aim

google-deepmind/alphageometry

sgl-project/sglang

microsoft/Cream

kyegomez/MultiModalMamba

mistralai/mistral-inference

huggingface/open-muse

huggingface/amused

FuxiaoLiu/LRV-Instruction