Pinned Repositories
AIOS
AIOS: LLM Agent Operating System
angular2-ama-cn
angular2 随便问
angular_train
AngularJS练习项目
comfy-server
comfyui server to use comfyui API as easy as send a message
llama.cpp
LLM inference in C/C++
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
sglang
SGLang is yet another fast serving framework for large language models and vision language models.
text-generation-inference
Large Language Model Text Generation Inference
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ilumiere's Repositories
ilumiere/comfy-server
comfyui server to use comfyui API as easy as send a message
ilumiere/AIOS
AIOS: LLM Agent Operating System
ilumiere/llama.cpp
LLM inference in C/C++
ilumiere/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
ilumiere/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
ilumiere/sglang
SGLang is yet another fast serving framework for large language models and vision language models.
ilumiere/text-generation-inference
Large Language Model Text Generation Inference
ilumiere/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ilumiere/code-server
VS Code in the browser
ilumiere/codeinterpreter-api
👾 Open source implementation of the ChatGPT Code Interpreter
ilumiere/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
ilumiere/comfyui-mixlab-nodes
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
ilumiere/ConsistentID
Customized ID Consistent for human
ilumiere/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
ilumiere/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ilumiere/I-S00N
ilumiere/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
ilumiere/LibreChat
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Google Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
ilumiere/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
ilumiere/lux
👾 Fast and simple video download library and CLI tool written in Go
ilumiere/NativeDancer
make your charactor Dancing as Native style
ilumiere/NativeSpeaker
make your Speaker talking as Native style with own voice!
ilumiere/navicat_reset_mac
ilumiere/openai-translator
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
ilumiere/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
ilumiere/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
ilumiere/SWE-agent
SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models
ilumiere/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ilumiere/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
ilumiere/xoscar
Python actor framework for heterogeneous computing.