inference-api

There are 85 repositories under inference-api topic.

roboflow/inference
Turn any computer or edge device into a command center for your computer vision projects.
Language:Python2k 24 170224
basetenlabs/truss
The simplest way to serve AI/ML models in production
Language:Python1.1k 18 12691
quic/ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Language:Python830 22 234145
quic/ai-hub-apps
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Language:Java329 11 9780
Michael-OvO/Yolov7-Flask
A Beautiful Flask Web API for Yolov7 (and custom) models
Language:Python203 1 411
mustafamerttunali/deep-learning-training-gui
Train and predict your model on pre-trained deep learning models through the GUI (web app). No more many parameters, no more data preprocessing.
Language:Python148 10 334
pszemraj/textsum
CLI & Python API to easily summarize text-based files with transformers
Language:Python130 4 78
BMW-InnovationLab/BMW-Classification-Training-GUI
This repository allows you to get started with training a State-of-the-art Deep Learning model with little to no configuration needed! You provide your labeled dataset and you can start the training right away. You can even test your model with our built-in Inference REST API. Training classification models with GluonCV has never been so easy.
Language:Python73 2 02
inference-gateway/inference-gateway
An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare and DeepSeek.
Language:Go73 1 6011
intelligencedev/eternal
Eternal is an experimental platform for machine learning models and workflows.
Language:Go67 7 95
Kardbord/hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API
Language:Go62 2 145
hupe1980/go-huggingface
🤗 Hugging Face Inference Client written in Go
Language:Go52 2 66
BMW-InnovationLab/BMW-Classification-Inference-GPU-CPU
This is a repository for an image classification inference API using the Gluoncv framework. The inference REST API works on CPU/GPU. It's supported on Windows and Linux Operating systems. Models trained using our Gluoncv Classification training repository can be deployed in this API. Several models can be loaded and used at the same time.
Language:Python49 2 01
Prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
Language:Python31 3 193
TimMikeladze/huggingface
Typescript wrapper for the Hugging Face Inference API.
Language:TypeScript28 2 32
TommyLemon/CVAuto
👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等，直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法：行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割等，还可一键下载测试报告、导出训练和测试数据集
Language:JavaScript26 1 02
stephanj/Llama3JavaChatCompletionService
Llama3.java Inference engine with OpenAI Chat Completion REST API/
Language:Java25 4 04
decisionfacts/semantic-ai
An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Large Language Model).
Language:Python21 2 01
yas-sim/openvino-ep-enabled-onnxruntime
Describing How to Enable OpenVINO Execution Provider for ONNX Runtime
Language:C++20 1 31
BorjaOteroFerreira/IALab-Suite
Tool for test diferents large language models without code.
Language:Python19 1 00
RageAgainstThePixel/com.rest.huggingface
A Non-Official HuggingFace Rest Client for Unity (UPM)
Language:C#17 2 11
SaeedNajafi/infer-pytorch-pyspark
Coupling PySpark with PyTorch Models
14 1 06
kyryl-opens-ml/ml-in-production-practice
Practice for Machine Learning in Production course
Language:Python13 0 87
jparkerweb/bedrock-proxy-endpoint
🔀 Bedrock Proxy Endpoint ⇢ Spin up your own custom OpenAI API server endpoint for easy AWS Bedrock inference (using standard baseUrl, and apiKey params)
Language:JavaScript12 1 07
shivamMg/stable-diffusion-on-azureml
REST APIs for StableDiffusion. Inferencing support on AzureML
Language:Jupyter Notebook11 1 04
antoninoLorenzo/Ollama-on-Colab-with-ngrok
Notebook to run Ollama on Google Colab
Language:Jupyter Notebook9 1 01
BMW-InnovationLab/BMW-TensorFlow-Training-GUI
This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.
Language:Python8 1 20163
gmkung/Cheemera
A Node.js backend that exposes a Typescript implementation of the deCheem inference engine for LLMs/ChatGPT.
Language:Python8 1 12
LM4eu/goinfer
Local LLM proxy, DevOps friendly
Language:Go8 4 32
pandruszkow/whisper-inference-server
A networked inference server for Whisper speech recognition
Language:Python8 1 00
PromptOn/prompton
Chat prompt template evaluation and inference monitoring
Language:Python8 1 00
ingyuseong/rabbitmq-inference
A message queue based server architecture to asynchronously handle resource-intensive tasks (e.g., ML inference)
Language:Python7 0 00
pchandrasekaran1595/Computer-Vision-API
Computer VIsion API built using FastAPI and pretrained models converted to ONNX format
Language:Python7 1 00
sunra-ai/sunra-clients
Language:TypeScript71
yas-sim/OpenVINO_Asynchronous_API_Performance_Demo
This project demonstrates the high performance of OpenVINO asynchronous inference API
Language:Python6 1 0
YAV-AI/NodeJS-Stable-Diffusion-XL-Base-1.0-Hugging-Face-Inference-API
A simple node.js example that generates an image using StableDiffusion via Hugging Face Inference API.
Language:JavaScript6 1 02

inference-api

roboflow/inference

basetenlabs/truss

quic/ai-hub-models

quic/ai-hub-apps

Michael-OvO/Yolov7-Flask

mustafamerttunali/deep-learning-training-gui

pszemraj/textsum

BMW-InnovationLab/BMW-Classification-Training-GUI

inference-gateway/inference-gateway

intelligencedev/eternal

Kardbord/hfapigo

hupe1980/go-huggingface

BMW-InnovationLab/BMW-Classification-Inference-GPU-CPU

Prismadic/magnet

TimMikeladze/huggingface

TommyLemon/CVAuto

stephanj/Llama3JavaChatCompletionService

decisionfacts/semantic-ai

yas-sim/openvino-ep-enabled-onnxruntime

BorjaOteroFerreira/IALab-Suite

RageAgainstThePixel/com.rest.huggingface

SaeedNajafi/infer-pytorch-pyspark

kyryl-opens-ml/ml-in-production-practice

jparkerweb/bedrock-proxy-endpoint

shivamMg/stable-diffusion-on-azureml

antoninoLorenzo/Ollama-on-Colab-with-ngrok

BMW-InnovationLab/BMW-TensorFlow-Training-GUI

gmkung/Cheemera

LM4eu/goinfer

pandruszkow/whisper-inference-server

PromptOn/prompton

ingyuseong/rabbitmq-inference

pchandrasekaran1595/Computer-Vision-API

sunra-ai/sunra-clients

yas-sim/OpenVINO_Asynchronous_API_Performance_Demo

YAV-AI/NodeJS-Stable-Diffusion-XL-Base-1.0-Hugging-Face-Inference-API