yonghao211's Stars
chunhuizhang/llms_tuning
stay tuned.
zhaibowen/Retriever
Retriever-0.1B
SmartFlowAI/LLM101n-CN
LLM101n: Let's build a Storyteller 中文版
huggingface/trl
Train transformer language models with reinforcement learning.
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
pittisl/Generative-AI-Tutorial
A subjective learning guide for generative AI research
RUC-GSAI/YuLan-Chat
YuLan: An Open-Source Large Language Model
owenliang/bpe-tokenizer
LLM Tokenizer with BPE algorithm
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
princeton-nlp/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
chunhuizhang/personal_chatgpt
personal chatgpt
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
sugarforever/wtf-langchain
microsoft/autogen
A programming framework for agentic AI 🤖
ZongXR/DCIC2023-Fraud-Risk-Identification
本赛题旨在运用有效的金融科技和大数据系统,分析涉赌涉诈资金交易新方式,持续优化风险监测模型,通过赛题提供的涉赌涉诈黑名单、白名单及用于训练的相关交易流水数据集,构建涉赌涉诈账户算法识别模型,全面排查存量风险。A榜排名11/1594,B榜排名13/1594。
chunhuizhang/bert_t5_gpt
TommyZihao/zihao_course
同济子豪兄的公开课
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more