kangjiahui

kangjiahui's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++68.5k 549 4.1k9.8k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python56.7k 459 1325.8k
onnx/onnx
Open standard for machine learning interoperability
Language:Python18k 437 2.9k3.7k
ai-shifu/ChatALL
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers
Language:JavaScript15.3k 125 5521.6k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python14.4k 109 1.1k1.2k
divamgupta/diffusionbee-stable-diffusion-ui
Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
Language:JavaScript12.7k 111 467635
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.9k 70 107692
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell10.3k 64 874637
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.8k 94 2k1k
Reamd7/notion-zh_CN
notion 中文化
Language:JavaScript7.4k 42 1731.1k
makenotion/notion-sdk-js
Official Notion JavaScript Client
Language:TypeScript5k 103 256593
Lordog/dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
3.8k 24 9330
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
Language:Python3.7k 31 249442
DefTruth/lite.ai.toolkit
🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉
Language:C++3.7k 67 268701
open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
Language:Python2.8k 37 1.6k637
eProsima/Fast-DDS
The most complete DDS - Proven: Plenty of success cases. Looking for commercial support? Contact info@eprosima.com
Language:C++2.2k 83 1.1k778
quic/aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Language:Python2.2k 51 1.4k384
openai/finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
Language:Python2.2k 73 42501
kwuking/TimeMixer
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Language:Python1.3k 95 89177
zhiqwang/yolort
yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.
Language:Python722 14 101153
PacktPublishing/Modern-CMake-for-Cpp
Modern CMake for C++, published by Packt
Language:Dockerfile500 16 9139
xmba15/onnx_runtime_cpp
small c++ library to quickly deploy models using onnxruntime
Language:C++328 6 4149
fengxinjie/Transformer-OCR
Language:Python320 23 2074
ltkong218/FastFlowNet
FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation (ICRA 2021)
Language:Python260 9 2442
intel/ros2_openvino_toolkit
Language:C++166 12 8684
kalfazed/tensorrt_starter
This repository give a guidline to learn CUDA and TensorRT from the beginning.
Language:C++156 3 540
quic/efficient-transformers
This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficiently on Qualcomm Cloud AI 100 accelerators.
Language:Python53 9 632
wangxb96/Awesome-EdgeAI
Resources of our survey paper "A Comprehensive Survey on AI Integration at the Edge: Techniques, Applications, and Challenges"
53 3 06
scaidermern/top-processes
Displays the top processes according to current CPU or memory usage
Language:C21 3 12
kalfazed/multi-thread-programming
This is a repository to practice multi-thread programming in C++
Language:C++17 1 17