inference-api
There are 64 repositories under inference-api topic.
roboflow/inference
Turn any computer or edge device into a command center for your computer vision projects.
BMW-InnovationLab/BMW-TensorFlow-Training-GUI
This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.
basetenlabs/truss
The simplest way to serve AI/ML models in production
quic/ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Michael-OvO/Yolov7-Flask
A Beautiful Flask Web API for Yolov7 (and custom) models
mustafamerttunali/deep-learning-training-gui
Train and predict your model on pre-trained deep learning models through the GUI (web app). No more many parameters, no more data preprocessing.
quic/ai-hub-apps
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
dnth/x.infer
Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm, ultralytics, vllm, ollama and your custom model.
pszemraj/textsum
CLI & Python API to easily summarize text-based files with transformers
BMW-InnovationLab/BMW-Classification-Training-GUI
This repository allows you to get started with training a State-of-the-art Deep Learning model with little to no configuration needed! You provide your labeled dataset and you can start the training right away. You can even test your model with our built-in Inference REST API. Training classification models with GluonCV has never been so easy.
intelligencedev/eternal
Eternal is an experimental platform for machine learning models and workflows.
Kardbord/hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API
BMW-InnovationLab/BMW-Classification-Inference-GPU-CPU
This is a repository for an image classification inference API using the Gluoncv framework. The inference REST API works on CPU/GPU. It's supported on Windows and Linux Operating systems. Models trained using our Gluoncv Classification training repository can be deployed in this API. Several models can be loaded and used at the same time.
lofcz/LlmTornado
One .NET library to consume OpenAI, Anthropic, Cohere, Google, Azure, Groq, and self-hosed APIs.
hupe1980/go-huggingface
🤗 Hugging Face Inference Client written in Go
Prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
TimMikeladze/huggingface
Typescript wrapper for the Hugging Face Inference API.
stephanj/Llama3JavaChatCompletionService
Llama3.java Inference engine with OpenAI Chat Completion REST API/
decisionfacts/semantic-ai
An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Large Language Model).
yas-sim/openvino-ep-enabled-onnxruntime
Describing How to Enable OpenVINO Execution Provider for ONNX Runtime
BorjaOteroFerreira/IALab-Suite
Tool for test diferents large language models without code.
RageAgainstThePixel/com.rest.huggingface
A Non-Official HuggingFace Rest Client for Unity (UPM)
SaeedNajafi/infer-pytorch-pyspark
Coupling PySpark with PyTorch Models
kyryl-opens-ml/ml-in-production-practice
Practice for Machine Learning in Production course
shivamMg/stable-diffusion-on-azureml
REST APIs for StableDiffusion. Inferencing support on AzureML
defenseunicorns/leapfrogai-api
LeapfrogAI API
gmkung/Cheemera
A Node.js backend that exposes a Typescript implementation of the deCheem inference engine.
pandruszkow/whisper-inference-server
A networked inference server for Whisper so you don't have to keep waiting for the audio model to reload for the x-hunderdth time.
PromptOn/prompton
Chat prompt template evaluation and inference monitoring
pchandrasekaran1595/Computer-Vision-API
Computer VIsion API built using FastAPI and pretrained models converted to ONNX format
YAV-AI/NodeJS-Stable-Diffusion-XL-Base-1.0-Hugging-Face-Inference-API
A simple node.js example that generates an image using StableDiffusion via Hugging Face Inference API.
ingyuseong/rabbitmq-inference
A message queue based server architecture to asynchronously handle resource-intensive tasks (e.g., ML inference)
geniusrise/text
Text components powering LLMs & SLMs for geniusrise framework
inference-gateway/inference-gateway
An open-source, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq Cloud, Cohere, Anthropic and Cloudflare.
jparkerweb/bedrock-proxy-endpoint
🔀 Bedrock Proxy Endpoint ⇢ Spin up your own custom OpenAI API server endpoint for easy AWS Bedrock inference (using standard baseUrl, and apiKey params)
pchandrasekaran1595/Computer-Vision-API-V2
Computer Vision API V2 - FastAPI & ONNX Models