qzl164's Stars
IEIT-Yuan/Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
01-ai/Yi-1.5
Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
stanleylsx/llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
unclecode/crawl4ai
🔥🕷️ Crawl4AI: Crawl Smarter, Faster, Freely. For AI.
triton-lang/triton
Development repository for the Triton language and compiler
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
elder-plinius/L1B3RT4S
TOTALLY HARMLESS LIBERATION PROMPTS FOR GOOD LIL AI'S
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Developer-Y/cs-video-courses
List of Computer Science courses with video lectures.
microsoft/computervision-recipes
Best Practices, code samples, and documentation for Computer Vision.
WZMIAOMIAO/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
tinyvision/DAMO-YOLO
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
Lordog/dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
pytorch/torchtitan
A native PyTorch Library for large model training
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
2471023025/RALM_Survey
This is a repository of RALM surveys containing a summary of state-of-the-art RAG and other technologies
opendatalab/WanJuan2.0-WanJuan-CC
WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
apple/corenet
CoreNet: A library for training deep neural networks
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)