ryyzn9
Large Language Models(LLMs)|🤗|Artificial Intelligence(AI) researcher|Natural Language Processing( NLP )| Computer Vision|Data scientist|ML0ps| python |Pytorch|
ryyzn9's Stars
karpathy/LLM101n
LLM101n: Let's build a Storyteller
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
pytorch/torchtitan
A native PyTorch Library for large model training
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
SilasMarvin/lsp-ai
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
dabochen/spreadsheet-is-all-you-need
A nanoGPT pipeline packed in a spreadsheet
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
alxndrTL/mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
microsoft/Samba
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
lucidrains/linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
mlfoundations/open_lm
A repository for research on medium sized language models.
minyoungg/platonic-rep
kyegomez/zeta
Build high-performance AI models with modular building blocks
leobeeson/llm_benchmarks
A collection of benchmarks and datasets for evaluating LLM.
lucidrains/linformer
Implementation of Linformer for Pytorch
AmeenAli/HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
wuch15/Fastformer
A pytorch &keras implementation and demo of Fastformer.
LeapLabTHU/MLLA
Official repository of MLLA (NeurIPS 2024)
lucidrains/fast-transformer-pytorch
Implementation of Fast Transformer in Pytorch
yl4579/HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
PeaBrane/mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. More efficient than using for loops, but probably less efficient than using associative scans
alxndrTL/othello_mamba
Evaluating the Mamba architecture on the Othello game
knotgrass/attention
several types of attention modules written in PyTorch
fkodom/python-repo-template
Template repo for Python projects, especially those focusing on machine learning and/or deep learning.
karthikncode/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialog datasets.