XU-YIJIE's Stars
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Xposed-Modules-Repo/com.fkzhang.wechatxposed
WeXposed (微X模块)
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
2ertwo/LLaMa3-Numpy-trainable
用Numpy复现可训练的LLaMa3
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
EndlessCheng/codeforces-go
算法竞赛模板库 by 灵茶山艾府 💭💡🎈
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
baidubce/app-builder
appbuilder-sdk, 千帆AppBuilder-SDK帮助开发者灵活、快速的搭建AI原生应用
tencentmusic/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
HumanSignal/Adala
Adala: Autonomous DAta (Labeling) Agent framework
key-networks/ztncui
ZeroTier network controller UI
multimodal-art-projection/MAP-NEO
Jonnyan404/zerotier-planet
一分钟自建zerotier-planet
microsoft/DeepSpeedExamples
Example models using DeepSpeed
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
openai/transformer-debugger
YJiangcm/PromCSE
Code for "Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning (EMNLP 2022)"
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
OKC13/General-Documents-Layout-parser
通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
dottxt-ai/outlines
Structured Text Generation
tanchongmin/strictjson
A Strict JSON Framework for LLM Outputs
varunshenoy/super-json-mode
Low latency JSON generation using LLMs ⚡️
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
km1994/LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.