Pinned Repositories
MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
openai-summary
Summarize URL's or files (including YouTube videos via transcripts) using an OpenAI compatible API.
openedai-images
An OpenAI API compatible images server to generate or manipulate images.
openedai-moderations
An OpenAI API compatible moderations server for checking whether text is potentially harmful.
openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
openedai-whisper
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
text-generation-webui
A Gradio web UI for Large Language Models.
exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
babyagi
matatonic's Repositories
matatonic/openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
matatonic/openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
matatonic/openai-summary
Summarize URL's or files (including YouTube videos via transcripts) using an OpenAI compatible API.
matatonic/openedai-whisper
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
matatonic/openedai-images
An OpenAI API compatible images server to generate or manipulate images.
matatonic/openedai-moderations
An OpenAI API compatible moderations server for checking whether text is potentially harmful.