Dcas89's Stars
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Asabeneh/30-Days-Of-Python
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
choosehappy/PatchSorter
A tool for rapidly labeling objects using deep learning feature embedding
cloneofsimo/minRF
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
karpathy/LLM101n
LLM101n: Let's build a Storyteller
srush/Triton-Puzzles
Puzzles for learning Triton
lucidrains/multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
Jiangbo-Shi/ViLa-MIL
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification (CVPR 2024)
PCrnjak/CM6_COBOT_ROBOT
Files and intro page for first version of CM6 COBOT robotic arm
lucidrains/linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
openai/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
google-deepmind/nanodo
togethercomputer/Dragonfly
FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
lllyasviel/Omost
Your image is almost there!
sslotin/amh-code
Complete implementations from "Algorithms for Modern Hardware"
algorithmica-org/algorithmica
A computer science textbook
jiangsongtao/TinyMed
severian42/Vodalus-Expert-LLM-Forge
Dataset Crafting and Efficient Fine-Tuning Using Only Free Open-Source Tools
jbdel/vilmedic
ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field
evo-design/evo
Biological foundation modeling from molecular to genome scale
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
mustafaaljadery/gemma-2B-10M
Gemma 2B with 10M context length using Infini-attention.
whitead/paper-qa
LLM Chain for answering questions from documents with citations
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
huggingface/lerobot
🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval