tmhm

Driving for AI.

Shenzhen, Guangdong

tmhm's Stars

excalidraw/excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
Language:TypeScript82.8k 406 3.6k7.8k
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
Language:Scala62.2k 341 95612.2k
mli/paper-reading
深度学习经典、新论文逐段精读
26.7k 724 02.4k
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
Language:Python15k 101 2921.2k
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Language:Python11.3k 100 229975
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.2k 154 64805
harvardnlp/annotated-transformer
An annotated implementation of the Transformer paper.
Language:Jupyter Notebook5.6k 66 901.2k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.6k 109 134397
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
Language:JavaScript4.1k 63 79396
simoninithomas/Deep_reinforcement_learning_Course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Language:Jupyter Notebook3.8k 134 741.2k
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.3k 61 3206
wzhe06/Reco-papers
Classic papers and resources on recommendation
Language:Python3.3k 194 3804
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Language:Python2.5k 48 165293
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language:Python2.5k 40 23236
rail-berkeley/rlkit
Collection of reinforcement learning algorithms
Language:Python2.5k 61 131552
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python2.4k 29 64447
microsoft/CodeBERT
CodeBERT
Language:Python2.2k 38 297452
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
1.7k 18 5132
Mimino666/langdetect
Port of Google's language-detection library to Python.
Language:Python1.7k 26 77197
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Language:Python1.6k 15 8177
openai/evolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
Language:Python1.6k 243 22278
codefuse-ai/Awesome-Code-LLM
[TMLR] A curated list of language modeling researches for code and related datasets.
1.5k 38 8103
guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR and CVR prediction), Post Ranking, Multi-task Learning, Graph Neural Networks, Transfer Learning, Reinforcement Learning, Self-supervised Learning and so on.
Language:Python1.4k 53 1224
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.2k 24 15163
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
Language:Python760 5 884
tensorboy/PIDOptimizer
Code for this CVPR 2018 paper: "A PID Controller Approach for Stochastic Optimization of Deep Networks", Wangpeng An, Haoqian Wang, Qingyun Sun, Jun Xu, Qionghai Dai, Lei Zhang.
Language:Python184 15 1151
bojone/t5_in_bert4keras
整理一下在keras中使用T5模型的要点
Language:Python171 7 1328
HKUSTDial/NL2SQL_Handbook
This is a continuously updated handbook for readers to easily track the latest NL2SQL techniques in the literature and provide practical guidance for researchers and practitioners.
Language:Python153 8 16
Mercury7353/PyBench
Language:Python33 3 21
helpmefindaname/transformer-smaller-training-vocab
Temporary remove unused tokens during training to save ram and speed.
Language:Python20 3 32

tmhm

tmhm's Stars

excalidraw/excalidraw

twitter/the-algorithm

mli/paper-reading

ScrapeGraphAI/Scrapegraph-ai

ShishirPatil/gorilla

RUCAIBox/LLMSurvey

harvardnlp/annotated-transformer

huggingface/alignment-handbook

OpenBMB/AgentVerse

simoninithomas/Deep_reinforcement_learning_Course

opendilab/awesome-RLHF

wzhe06/Reco-papers

ekzhu/datasketch

databricks/dbrx

rail-berkeley/rlkit

kzl/decision-transformer

microsoft/CodeBERT

eosphoros-ai/Awesome-Text2SQL

Mimino666/langdetect

hkust-nlp/ceval

openai/evolution-strategies-starter

codefuse-ai/Awesome-Code-LLM

guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising

openai/lm-human-preferences

jia-zhuang/pytorch-multi-gpu-training

tensorboy/PIDOptimizer

bojone/t5_in_bert4keras

HKUSTDial/NL2SQL_Handbook

Mercury7353/PyBench

helpmefindaname/transformer-smaller-training-vocab