Pinned Repositories
Ad-papers
Papers on Computational Advertising
AlgoNotes
公众号【浅梦学习笔记】文章汇总:包含 排序&CXR预估,召回匹配,用户画像&特征工程,推荐搜索综合 计算广告,大数据,图算法,NLP&CV,求职面试 等内容
alignment-handbook
Robust recipes to align language models with human and AI preferences
alphaFM
Multi-thread implementation of Factorization Machines with FTRL for binary-class classification problem.
annotated_deep_learning_paper_implementations
🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
awesome-deep-learning-single-cell-papers
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Awesome-Interview
Collection of awesome interview references.
Awesome_Diffusions
AllenShow's Repositories
AllenShow/alignment-handbook
Robust recipes to align language models with human and AI preferences
AllenShow/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
AllenShow/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
AllenShow/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
AllenShow/deepseekv2-profile
AllenShow/DeepSpeedExamples
Example models using DeepSpeed
AllenShow/Firefly
Firefly: 大模型训练工具,支持训练Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
AllenShow/Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
AllenShow/fusion_bench
FusionBench: A Comprehensive Benchmark of Deep Model Fusion
AllenShow/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
AllenShow/llama.cpp
LLM inference in C/C++
AllenShow/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
AllenShow/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
AllenShow/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
AllenShow/llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
AllenShow/mergekit
Tools for merging pretrained large language models.
AllenShow/mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
AllenShow/Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
AllenShow/notebooks
AllenShow/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
AllenShow/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
AllenShow/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
AllenShow/personal_chatgpt
personal chatgpt
AllenShow/SATURN
AllenShow/st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
AllenShow/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
AllenShow/tianshou
An elegant PyTorch deep reinforcement learning library.
AllenShow/trl
Train transformer language models with reinforcement learning.
AllenShow/UCE
UCE is a zero-shot foundation model for single-cell gene expression data
AllenShow/unsloth
Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory