jamindy

jamindy's Stars

ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go95.7k 571 4.7k7.6k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++67.1k 552 3.9k9.6k
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python65.3k 279 1.6k8k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript50.3k 358 4.6k7.2k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python33.3k 203 5.1k4.1k
mli/paper-reading
深度学习经典、新论文逐段精读
26.9k 728 02.4k
songquanpeng/one-api
OpenAI 接口管理 & 分发系统，支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元，可用于二次分发管理 key，仅单可执行文件，已打包好 Docker 镜像，一键部署，开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
Language:JavaScript18.8k 105 1.5k4.2k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.6k 95 171.1k
ggerganov/ggml
Tensor library for machine learning
Language:C++11.1k 128 4141k
chenzomi12/AISystem
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Language:Jupyter Notebook11k 149 371.6k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.4k 162 7652.3k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML10.2k 82 211k
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k 79 389579
Fafa-DL/Lhy_Machine_Learning
李宏毅2021/2022/2023春季机器学习课程课件及作业
Language:Jupyter Notebook6.2k 50 131.6k
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
Language:Python3.2k 48 359280
Jittor/jittor
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
Language:Python3.1k 64 356311
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Language:Python2.4k 21 274232
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.9k 24 182341
kyegomez/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Language:Python1.7k 40 37150
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.4k 14 871
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
Language:Python1.3k 51 242162
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Language:Python703 10 146100
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
Language:Python641 15 3032
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python345 4 1821
Strivin0311/long-llms-learning
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
Language:Jupyter Notebook252 8 214
FlagOpen/FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
Language:Python157 7 1142
sxontheway/Keep-Learning
The record of what I‘ve been through.
Language:Python95 1 014
hengjiUSTC/learn-llm
Language:Jupyter Notebook80 2 316
MARD1NO/CUDA-PPT
79 2 012
Strivin0311/llms-learning
A repository sharing the literatures about large language models
Language:Jupyter Notebook19 3 01