tmhm's Stars
excalidraw/excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
mli/paper-reading
深度学习经典、新论文逐段精读
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
harvardnlp/annotated-transformer
An annotated implementation of the Transformer paper.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
simoninithomas/Deep_reinforcement_learning_Course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
wzhe06/Reco-papers
Classic papers and resources on recommendation
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
rail-berkeley/rlkit
Collection of reinforcement learning algorithms
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
microsoft/CodeBERT
CodeBERT
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
Mimino666/langdetect
Port of Google's language-detection library to Python.
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
openai/evolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
codefuse-ai/Awesome-Code-LLM
[TMLR] A curated list of language modeling researches for code and related datasets.
guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR and CVR prediction), Post Ranking, Multi-task Learning, Graph Neural Networks, Transfer Learning, Reinforcement Learning, Self-supervised Learning and so on.
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
tensorboy/PIDOptimizer
Code for this CVPR 2018 paper: "A PID Controller Approach for Stochastic Optimization of Deep Networks", Wangpeng An, Haoqian Wang, Qingyun Sun, Jun Xu, Qionghai Dai, Lei Zhang.
bojone/t5_in_bert4keras
整理一下在keras中使用T5模型的要点
HKUSTDial/NL2SQL_Handbook
This is a continuously updated handbook for readers to easily track the latest NL2SQL techniques in the literature and provide practical guidance for researchers and practitioners.
Mercury7353/PyBench
helpmefindaname/transformer-smaller-training-vocab
Temporary remove unused tokens during training to save ram and speed.