longlive1234

longlive1234's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python134k 1.1k 15.9k26.7k
krahets/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing
Language:Java97.3k 540 22012.3k
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go93.8k 556 4.6k7.4k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python55.1k 452 1325.7k
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook38.2k 398 674k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python32.6k 205 5k4k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.4k 2.3k 01.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python28.7k 241 4.9k4.3k
ddbourgin/numpy-ml
Machine learning, in numpy
Language:Python15.3k 457 503.7k
graykode/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
Language:Jupyter Notebook14.1k 289 533.9k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.8k 115 1.1k1.3k
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python13.8k 126 3142k
state-spaces/mamba
Mamba SSM architecture
Language:Python12.9k 100 5161.1k
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.6k 134 211855
Blealtan/efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Language:Python4k 36 39353
Nixtla/neuralforecast
Scalable and user friendly neural :brain: forecasting algorithms.
Language:Python3k 36 540346
ddz16/TSFpaper
This repository contains a reading list of papers on Time Series Forecasting/Prediction (TSF) and Spatio-Temporal Forecasting/Prediction (STF). These papers are mainly categorized according to the type of model.
2k 65 23175
KimMeen/Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Language:Python1.3k 14 148233
NX-AI/xlstm
Official repository of the xLSTM.
Language:Python1.3k 12 4494
kwuking/TimeMixer
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Language:Python1.2k 94 84165
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Language:Python1k 7 2256
NVlabs/MambaVision
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Language:Python757 17 3843
AntonioTepsich/Convolutional-KANs
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.
Language:Jupyter Notebook740 13 1471
GistNoesis/FourierKAN
Language:Python699 8 758
EurekaLabsAI/micrograd
The Autograd Engine
Language:HTML517 13 149
IvanDrokin/torch-conv-kan
This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implementations of 1D, 2D, and 3D convolutions with different kernels, ResNet-like and DenseNet-like models, training code based on accelerate/PyTorch, as well as scripts for experiments with CIFAR-10 and Tiny ImageNet.
Language:Python387 6 1330
test-time-training/ttt-lm-jax
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Language:Python355 9 1127
XiudingCai/Awesome-Mamba-Collection
A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.
337 11 031
Zyphra/tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Language:Python100 5 04
XiudingCai/MambaTS-pytorch
Official code for MambaTS: Improved Selective State Space Models for Long-term Time Series Forecasting
Language:Python20 2 01