iambabao's Stars
iptv-org/iptv
Collection of publicly available IPTV channels from all over the world
meta-llama/llama
Inference code for Llama models
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
mlfoundations/open_clip
An open source implementation of CLIP.
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
microsoft/DeepSpeedExamples
Example models using DeepSpeed
open-mmlab/mmcv
OpenMMLab Computer Vision Foundation
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
neulab/prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
WojciechMula/pyahocorasick
Python module (C extension and plain python) implementing Aho-Corasick algorithm
google/pygtrie
Python library implementing a trie data structure.
zdou0830/METER
METER: A Multimodal End-to-end TransformER Framework
Zoeyyao27/CoT-Igniting-Agent
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
kobiso/Computer-Vision-Leaderboard
Comparison of famous convolutional neural network models
phellonchen/awesome-Vision-and-Language-Pre-training
Recent Advances in Vision and Language Pre-training (VLP)
deehzee/unionfind
A union-find disjoint sets data structure implemented in Python with the "Weighted Quick Union with Path Compression" algorithm.