richardsun-voyager

Interested in machine(deep) learning and natural language processing.

A*Singapore

richardsun-voyager's Stars

karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.1k 371 3175.9k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.6k 154 4692.2k
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook13.5k 295 8423.2k
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.6k 134 216858
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.5k 93 7751k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.9k 76 1.2k1.3k
roboticcam/machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习，概率模型和深度学习的讲义(2000+页)和视频链接
Language:Jupyter Notebook8.4k 388 311.7k
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python6.9k 38 1.1k1.8k
OpenNMT/OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Language:Python6.8k 178 1.4k2.3k
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python6k 68 270517
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Language:Python5.7k 78 142373
bentrevett/pytorch-sentiment-analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
Language:Jupyter Notebook4.4k 82 1091.2k
KoljaB/RealtimeTTS
Converts text to speech in realtime
Language:Python2k 20 109197
KinWaiCheuk/nnAudio
Audio processing by using pytorch 1D convolution network
Language:Python1k 18 6389
OpenLMLab/LOMO
LOMO: LOw-Memory Optimization
Language:Python977 12 7068
karpathy/pytorch-normalizing-flows
Normalizing flows in PyTorch. Current intended use is education not production.
Language:Jupyter Notebook845 27 998
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python626 6 6350
lupantech/ScienceQA
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".
Language:Python603 9 2063
timbmg/Sentence-VAE
PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349
Language:Python586 10 27153
ikostrikov/pytorch-flows
PyTorch implementations of algorithms for density estimation
Language:Python575 18 875
XuezheMax/NeuroNLP2
Deep neural models for core NLP tasks (Pytorch version)
Language:Python440 21 4489
allanj/pytorch_neural_crf
Pytorch implementation of LSTM/BERT-CRF for named entity recognition
Language:Python359 9 3662
ypeleg/llama
User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.
Language:Python329 2 1360
Kyubyong/bert_ner
Ner with Bert
Language:Python281 10 956
XuezheMax/flowseq
Generative Flow based Sequence-to-Sequence Toolkit written in Python.
Language:Python243 8 932
serp-ai/LLaMA-8bit-LoRA
Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on HuggingFace with 8-bit or 4-bit quantization. Research only.
Language:Python146 4 615
ProjectD-AI/LLaMA-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python68 0 06
allanj/phd-thesis
Phd thesis
Language:TeX9 2 00
nlplab-best-team/diagnostic-reasoning
Explore the ability of large language models (LLMs) to perform history taking through diagnostic reasoning, and how to enhance such ability.
Language:Jupyter Notebook40
lingo-mit/context-ablations
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Language:Shell30

richardsun-voyager

richardsun-voyager's Stars

karpathy/nanoGPT

tloen/alpaca-lora

NVIDIA/DeepLearningExamples

BlinkDL/RWKV-LM

Lightning-AI/litgpt

huggingface/trl

roboticcam/machine-learning-notes

EleutherAI/lm-evaluation-harness

OpenNMT/OpenNMT-py

Lightning-AI/lit-llama

OpenGVLab/LLaMA-Adapter

bentrevett/pytorch-sentiment-analysis

KoljaB/RealtimeTTS

KinWaiCheuk/nnAudio

OpenLMLab/LOMO

karpathy/pytorch-normalizing-flows

alibaba/Megatron-LLaMA

lupantech/ScienceQA

timbmg/Sentence-VAE

ikostrikov/pytorch-flows

XuezheMax/NeuroNLP2

allanj/pytorch_neural_crf

ypeleg/llama

Kyubyong/bert_ner

XuezheMax/flowseq

serp-ai/LLaMA-8bit-LoRA

ProjectD-AI/LLaMA-Megatron-DeepSpeed

allanj/phd-thesis

nlplab-best-team/diagnostic-reasoning

lingo-mit/context-ablations