ArsenLuca's Stars
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
chaoswork/llm_illustrated
看图学大模型
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
open-mmlab/Multimodal-GPT
Multimodal-GPT
YouHuang67/focsam
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
G-U-N/Phased-Consistency-Model
Boosting the performance of consistency models with PCM!
Charles-Xie/awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
ynqa/sig
Interactive grep (for streaming)
conda-forge/miniforge
A conda-forge distribution.
ahupp/python-magic
A python wrapper for libmagic
astanin/python-tabulate
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
mozillazg/python-pinyin
汉字转拼音(pypinyin)
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
bqplot/bqplot
Plotting library for IPython/Jupyter notebooks
HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
joye61/pic-smaller
Pic Smaller – Compress JPEG, PNG, WEBP, AVIF, SVG and GIF images intelligently
xavysp/TEED
TEED: Tiny and Efficient Edge Detector
TheMistoAI/ComfyUI-Anyline
Anyline: A Fast, Accurate, and Detailed Line Detection Preprocessor
evil-huawei/evil-huawei
Evil Huawei - 华为作过的恶
mbzuai-oryx/PALO
Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.
lijiannuist/Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
aim-uofa/DiverGen
[CVPR 2024] DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
QwenLM/Qwen1.5
Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
baaivision/CapsFusion
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
reasoning-survey/Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
LeapLabTHU/EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
ma-xu/Rewrite-the-Stars
[CVPR 2024] Rewrite the Stars
ShineChen1024/MiaoBi
Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion