XU-YIJIE

XU-YIJIE's Stars

wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML4k457
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python4.5k395
Xposed-Modules-Repo/com.fkzhang.wechatxposed
WeXposed （微X模块）
1.4k43
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
Language:Python715108
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python14.5k1.2k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.8k1.1k
2ertwo/LLaMa3-Numpy-trainable
用Numpy复现可训练的LLaMa3
Language:Python334
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Language:MDX50.8k4.9k
EndlessCheng/codeforces-go
算法竞赛模板库 by 灵茶山艾府 💭💡🎈
Language:Go5.4k585
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.9k345
baidubce/app-builder
appbuilder-sdk, 千帆AppBuilder-SDK帮助开发者灵活、快速的搭建AI原生应用
Language:Python464117
tencentmusic/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台，支持sso登录，多租户，大数据平台对接，notebook在线开发，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务VGPU，边缘计算，serverless，标注平台，自动化标注，数据集管理，大模型微调，vllm大模型推理，llmops，私有知识库，AI模型应用商店，支持模型一键开发/推理/微调，支持国产cpu/gpu/npu芯片，支持RDMA，支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
Language:Jupyter Notebook3.7k657
HumanSignal/Adala
Adala: Autonomous DAta (Labeling) Agent framework
Language:Python98177
key-networks/ztncui
ZeroTier network controller UI
Language:JavaScript1.6k243
multimodal-art-projection/MAP-NEO
Language:Python88382
Jonnyan404/zerotier-planet
一分钟自建zerotier-planet
Language:Shell1.4k278
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.1k1k
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.2k450
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.9k530
openai/transformer-debugger
Language:Python4k239
YJiangcm/PromCSE
Code for "Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning (EMNLP 2022)"
Language:Python13616
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7.8k571
OKC13/General-Documents-Layout-parser
通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
Language:Python458
dottxt-ai/outlines
Structured Text Generation
Language:Python9.9k511
tanchongmin/strictjson
A Strict JSON Framework for LLM Outputs
Language:Jupyter Notebook31232
varunshenoy/super-json-mode
Low latency JSON generation using LLMs ⚡️
Language:Jupyter Notebook38613
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
Language:Jupyter Notebook85454
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.4k638
km1994/LLMs_interview_notes
该仓库主要记录大模型（LLMs）算法工程师相关的面试题
1.5k108
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.4k1.5k