Jerrisk's Stars
lupantech/chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
VITA-MLLM/VITA
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
allenai/OLMo-Eval
Evaluation suite for LLMs
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
google-deepmind/gemma
Open weights LLM from Google DeepMind.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
LLaVA-VL/LLaVA-NeXT
autonomousvision/unimatch
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
microsoft/ptvsd
Python debugger package for use with Visual Studio and Visual Studio Code.
PyO3/maturin
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
PyO3/pyo3
Rust bindings for the Python interpreter
pemistahl/lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
nalepae/pandarallel
A simple and efficient tool to parallelize Pandas operations on all available CPUs
oppo-us-research/OpenIlluminationCapture
FreddeFrallan/Multilingual-CLIP
OpenAI CLIP text encoders for multiple languages!
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
zju3dv/IntrinsicAnything
waczjoan/MiraGe
Official implementation of "MiraGe: Editable 2D Images using Gaussian Splatting"
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
ohayonguy/PMRF
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
neuralmagic/sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
rom1504/cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
buaacyw/MeshAnythingV2
From anything to mesh like human artists. Official impl. of "MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization"