elphinkuo's Stars
xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
google-research/google-research
Google Research
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
karpathy/llama2.c
Inference Llama 2 in one file of pure C
ml-explore/mlx
MLX: An array framework for Apple silicon
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
garrettj403/SciencePlots
Matplotlib styles for scientific plotting
zihangdai/xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
leoxiaobin/deep-high-resolution-net.pytorch
The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
google-research/albert
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
cbfinn/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
LiyuanLucasLiu/RAdam
On the Variance of the Adaptive Learning Rate and Beyond
salesforce/awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
nyu-mll/jiant
jiant is an nlp toolkit
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
MarkPKCollier/NeuralTuringMachine
Tensorflow implementation of a Neural Turing Machine
crazydonkey200/neural-symbolic-machines
Neural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.
avisingh599/reward-learning-rl
[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering
Cornell-RelaxML/QuIP
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"
HazyResearch/fly
google-deepmind/interval-bound-propagation
This repository contains a simple implementation of Interval Bound Propagation (IBP) using TensorFlow: https://arxiv.org/abs/1810.12715
yeshaokai/Robustness-Aware-Pruning-ADMM
Code release for "Adversarial Robustness vs Model Compression, or Both?"
amitz25/PCCoder
Implementation of the paper "Automatic Program Synthesis of Long Programs with a Learned Garbage Collector"
sebastianheinz/super-mario-reinforcement-learning
Double Q-learning reinforcement learning agent on NES Super Mario Bros
lottery-ticket/code
elphinkuo/fast_matrix_multiplication
Different matrix multiplication implementation and benchmarking on CPUs