Dcas89's Stars
openai/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
google-deepmind/nanodo
togethercomputer/Dragonfly
FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
lllyasviel/Omost
Your image is almost there!
sslotin/amh-code
Complete implementations from "Algorithms for Modern Hardware"
algorithmica-org/algorithmica
A computer science textbook
jiangsongtao/Med-MoE
severian42/Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation editor Gradio UI.
jbdel/vilmedic
ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field
evo-design/evo
Biological foundation modeling from molecular to genome scale
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
mustafaaljadery/gemma-2B-10M
Gemma 2B with 10M context length using Infini-attention.
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
raymin0223/fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
e2b-dev/code-interpreter
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
rougier/numpy-100
100 numpy exercises (with solutions)
astramind-ai/Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
UCDvision/NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
KindXiaoming/pykan
Kolmogorov Arnold Networks
lucidrains/infini-transformer-pytorch
Implementation of Infini-Transformer in Pytorch
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling