Dcas89

Dcas89's Stars

openai/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
Language:Python1.5k192
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Language:Python1.1k157
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Language:Python11.6k388
google-deepmind/nanodo
Language:Python1849
togethercomputer/Dragonfly
Language:Python6411
FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
Language:Jupyter Notebook8427
lllyasviel/Omost
Your image is almost there!
Language:Python7.2k418
sslotin/amh-code
Complete implementations from "Algorithms for Modern Hardware"
Language:Jupyter Notebook66641
algorithmica-org/algorithmica
A computer science textbook
Language:Jupyter Notebook3.3k320
jiangsongtao/Med-MoE
Language:Python545
severian42/Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation editor Gradio UI.
Language:Jupyter Notebook15519
jbdel/vilmedic
ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field
Language:Python15420
evo-design/evo
Biological foundation modeling from molecular to genome scale
Language:Jupyter Notebook932112
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Language:Python8.4k348
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.5k58
mustafaaljadery/gemma-2B-10M
Gemma 2B with 10M context length using Infini-attention.
Language:Python94258
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
Language:Python5.9k560
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.4k922
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.5k583
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python1.4k115
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Language:Jupyter Notebook5.8k581
raymin0223/fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
Language:Python518
e2b-dev/code-interpreter
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
Language:Python1.1k80
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
Language:C++1.9k301
rougier/numpy-100
100 numpy exercises (with solutions)
Language:Python12k5.7k
astramind-ai/Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Language:Python1237
UCDvision/NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
Language:Python472
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook14.7k1.3k
lucidrains/infini-transformer-pytorch
Implementation of Infini-Transformer in Pytorch
Language:Python1001
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
Language:Python69554