jikechao's Stars
ggerganov/llama.cpp
LLM inference in C/C++
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md)
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
dusty-nv/jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
openvinotoolkit/openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
nedbat/coveragepy
The code coverage tool for Python
open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
linyiLYi/pose-monitor
“让爷康康”是一款手机 AI 应用程序,可以监测不良坐姿并进行语音提示
joernio/joern
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
egraphs-good/egg
egg is a flexible, high-performance e-graph library
onnx/onnxmltools
ONNXMLTools enables conversion of models to ONNX
openvinotoolkit/nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
onnx/onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
hyperai/tvm-cn
TVM Documentation in Chinese Simplified / TVM 中文文档
saltudelft/ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
strongcourage/awesome-directed-fuzzing
A curated list of awesome directed fuzzing research papers
MegEngine/MegCC
MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器
microsoft/onnxconverter-common
Common utilities for ONNX converters
LeonYang95/PLELog
Implementation of PLELog in ICSE 2021 accepted paper:Semi-supervised Log-based Anomaly Detection via Probabilistic Label Estimation.
fabsx00/python-joern
A python interface to joern (deprecated).
DLFrameworkBug/DLFrameworkBugsData
a data collection of related work: Toward Understanding Deep Learning Framework Bugs
wzh99/GenCoG
GenCoG: A DSL-Based Approach to Generating Computation Graphs for TVM Testing (ISSTA‘23)
haoyang9804/HirGen
A Computational Graph Generator for AI Compiler Fuzzing
mlc-ai/docs
The documents for TVM Unity
SeekingDream/ISSTA23_DyCL
Deelvin/tvm-tools