Pinned Repositories
agency-swarm
agency-swarm-lab
AI-Content-Ideas-Generator-Prototype
AI-Marketing-Army-Tester
ai_saas_app
Build a REAL Software-as-a-Service app with AI features and payments & credits system that you might even turn into a side income or business idea using Next.js 14, Clerk, MongoDB, Cloudinary AI, and Stripe.
alignment-handbook
Robust recipes for to align language models with human and AI preferences
AllTheWorldAPlay
All the world is a play, we are but actors in it.
alpaca-lora-finetune-language
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
nike-open-intepreter
美少女OPInterpreter 公開用リポジトリ
MoXMoussa's Repositories
MoXMoussa/AllTheWorldAPlay
All the world is a play, we are but actors in it.
MoXMoussa/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
MoXMoussa/avp_teleoperate
MoXMoussa/chatbot-ui
The open-source AI chat app for everyone.
MoXMoussa/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
MoXMoussa/Cinemo
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models
MoXMoussa/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
MoXMoussa/ComfyUI-MimicMotion
a comfyui custom node for MimicMotion
MoXMoussa/ConsistentID
Customized ID Consistent for human
MoXMoussa/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
MoXMoussa/distributed-llama
Tensor parallelism is all you need. Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.
MoXMoussa/EVF-SAM
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
MoXMoussa/groqnotes
Groqnotes: Generate organized notes from audio using Groq, Whisper, and Llama3
MoXMoussa/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
MoXMoussa/L1B3RT45
J41LBR34K PR0MPT5
MoXMoussa/nougat-ocr
MoXMoussa/OmniFusion
OmniFusion — a multimodal model to communicate using text and images
MoXMoussa/page-assist
Use your locally running AI models to assist you in your web browsing
MoXMoussa/phidata
Memory, knowledge and tools for LLMs
MoXMoussa/PuLID
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
MoXMoussa/ragapp
The easiest way to use Agentic RAG in any enterprise
MoXMoussa/Scrapegraph-ai
Python scraper based on AI
MoXMoussa/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
MoXMoussa/sn-gamestate
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)
MoXMoussa/StableMoFusion
MoXMoussa/StoryDiffusion
Create Magic Story!
MoXMoussa/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
MoXMoussa/streamv2v
Official Pytorch implementation of StreamV2V.
MoXMoussa/twenty
Building a modern alternative to Salesforce, powered by the community.
MoXMoussa/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.