echo840

echo840's Stars

microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35k 342 2.7k4.1k
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook14.7k 110 3871.3k
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python6.8k 72 123773
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.4k 25 267134
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Language:Jupyter Notebook1.8k 21 62239
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Language:Python1.2k 22 4947
onnx/onnxmltools
ONNXMLTools enables conversion of models to ONNX
Language:Python999 43 292181
vivo-ai-lab/BlueLM
BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab
Language:Python827 14 2855
hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
Language:Jupyter Notebook781 8 35107
AntonioTepsich/Convolutional-KANs
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.
Language:Jupyter Notebook726 13 1471
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
Language:Python631 25 3423
WXinlong/DenseCL
Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.
Language:Python546 7 3570
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
Language:Python502 6 4136
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Language:Python478 6 5929
AIDC-AI/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Language:Python353 3 2718
microsoft/rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
296 5 511
OpenGVLab/OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Language:Python253 13 55
1ssb/torchkan
An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations
Language:Python150 3 716
zh460045050/V2L-Tokenizer
Language:Python106 3 128
HKUST-LongGroup/Awesome-Open-Vocabulary-Detection-and-Segmentation
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
97 2 06
cheng-haha/KANs
🕹️The toy examples of Kolmogorov-Arnold Network (Get Started Quickly)
Language:Python70 2 09
IntelLabs/multimodal_cognitive_ai
research work on multimodal cognitive ai
Language:Python55 4 710
amazon-science/QA-ViT
Language:Python45 4 58
foundation-multimodal-models/CAL
[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Language:Python44 0 52
naver-ai/cream
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023
Language:Python43 7 20
Ikomia-dev/onnx-donut
Export Donut model to onnx and run it with onnxruntime
Language:Python23 1 14
Lackel/AGLA
Code for paper "AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention"
Language:Python13 2 40
ShuoZhang2003/DT-VQA
Language:Python5 2 30
xinke-wang/EST-VQA
[CVPR2020] EST-VQA Dataset
Language:Python5 1 00
leeguandong/EcommerceOCRBench
电商文字识别的多模态大模型的ocr基准测试集，参照ocrbench，但是测评数据更多。
Language:Python1