pbw-Berwin

Computer Vision, Deep learning

MIT CSAIL

pbw-Berwin's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python168k 1.6k 2.7k44.2k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.1k 305 6635.6k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.1k 343 2.8k4.1k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python28.4k 236 4.8k4.2k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.6k 152 4692.2k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.8k 114 1.1k1.3k
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.6k 125 143848
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9k 95 395790
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.8k 109 157455
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7k 74 209445
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.6k 63 98509
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Language:Python5.2k 46 1k319
OpenGVLab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Language:Python3.2k 43 49231
databricks/megablocks
Language:Python1.2k 18 56172
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python1.2k 42 387
ibm-granite/granite-code-models
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
1.1k 21 1174
microsoft/tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
Language:Python718 14 6190
sail-sg/lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Language:Python582 11 2235
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Language:Python547 24 7243
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Language:Python525 13 2941
pointnetwork/point-alpaca
Language:Python406 17 1829
mathllm/MathCoder
Family of LLMs for mathematical reasoning.
Language:Python218 4 315
IBM/ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.
Language:Python217 11 614
HanGuo97/flute
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Language:C++178 5 95
shawntan/scattermoe
Triton-based implementation of Sparse Mixture of Experts.
Language:Python175 5 1213
stanford-futuredata/stk
Language:Python86 3 619
X-PLUG/mPLUG
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Language:Python81 2 106
YicongHong/Fine-Grained-R2R
Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation
Language:Python42 5 13
astra-vision/FAMix
[CVPR 2024] Official repository of "A Simple Recipe for Language-guided Domain Generalized Segmentation"
Language:Python39 2 81
Lyken17/FlashATM
3 2 0

pbw-Berwin

pbw-Berwin's Stars

Significant-Gravitas/AutoGPT

facebookresearch/segment-anything

microsoft/DeepSpeed

vllm-project/vllm

tloen/alpaca-lora

Dao-AILab/flash-attention

mistralai/mistral-inference

facebookresearch/dinov2

jzhang38/TinyLlama

OpenBMB/MiniCPM

pytorch-labs/gpt-fast

aimhubio/aim

OpenGVLab/InternGPT

databricks/megablocks

horseee/Awesome-Efficient-LLM

ibm-granite/granite-code-models

microsoft/tutel

sail-sg/lorahub

princeton-nlp/LLM-Shearing

declare-lab/instruct-eval

pointnetwork/point-alpaca

mathllm/MathCoder

IBM/ModuleFormer

HanGuo97/flute

shawntan/scattermoe

stanford-futuredata/stk

X-PLUG/mPLUG

YicongHong/Fine-Grained-R2R

astra-vision/FAMix

Lyken17/FlashATM