kamuyix's Stars
howard-hou/RWKV-TS
RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks
nttmdlab-nlp/VisualMRC
VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)
MBZUAI-LLM/web2code
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
AILab-CVC/SEED-Bench
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
openai/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
yoshall/AirFormer
PyTorch implementation of AirFormer, AAAI-23
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
TRI-ML/prismatic-vlms
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
howard-hou/VisualRWKV
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
BAAI-DCAI/DataOptim
A collection of visual instruction tuning datasets.
CASIA-LM/MoDS
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Victorwz/MLM_Filter
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
shrinivdeshmukh/avroconvert
Convert avro files to parquet, csv and json format
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
AykutSarac/jsoncrack.com
✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs.
xai-org/grok-1
Grok open release
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
bryanbocao/awesome-open-papernotes
Yet another Ph.D. adventure.
Ironbrotherstyle/UnVIO
The source code of IJCAI2020 paper "Unsupervised Monocular Visual-inertial Odometry Network".
mingyuyng/Visual-Selective-VIO
Code for "Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection", ECCV 2022
hanbt/learn_dl
Deep learning algorithms source code for beginners
hunkim/DeepLearningStars
Top Deep Learning Projects based on their Stars!
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
practical-tutorials/project-based-learning
Curated list of project-based tutorials