bmwas's Stars
excalidraw/excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
feder-cr/linkedIn_auto_jobs_applier_with_AI
LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
a16z-infra/ai-town
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
EricLBuehler/mistral.rs
Blazingly fast LLM inference.
pipecat-ai/pipecat
Open Source framework for voice and multimodal conversational AI
BasedHardware/omi
AI wearables. Summaries, action items, transcription and 300+ apps
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
vocodedev/vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
StreetLamb/tribe
Low code tool to rapidly build and coordinate multi-agent teams
adithya-s-k/marker-api
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
abgulati/LARS
An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.
kevingduck/ChatGPT-phone
Demo of twilio
facebookresearch/ssl-data-curation
PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning
RetellAI/retell-custom-llm-python-demo
Zak-Hussain/LLM4BeSci_GSERM2024
The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral and social sciences.
SillyTavern/SillyTavern-WebSearch-Selenium
Add-on for the Web Search extension that provides the web browsing capabilities without the need for Extras API.
rahmanidashti/SyntheticTestCollections
[Official Codes] Synthetic Test Collections for Retrieval Evaluation (SIGIR 2024)
dragonscraper/ProxyHarvest
ProxyHarvest is an efficient tool designed to collect, validate, and organize proxies from various online sources
sachin0034/VocodeCallAssistant
This repository contains an AI-powered calling bot built using Vocode, OpenAI, Deepgram, and Twilio. The bot is designed to make automated calls, interact with users, and provide intelligent responses. It leverages advanced speech recognition and natural language processing capabilities to deliver a seamless and efficient calling experience.