petermckj's Stars
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
QuivrHQ/quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Sinaptik-AI/pandas-ai
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
KwaiVGI/LivePortrait
Bring portraits to life!
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Vaibhavs10/insanely-fast-whisper
cubiq/ComfyUI_IPAdapter_plus
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
FORTH-ModelBasedTracker/MocapNET
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance
KevinLTT/video2bvh
Extracts human motion in video and save it as bvh mocap file.
royorel/Lifespan_Age_Transformation_Synthesis
Lifespan Age Transformation Synthesis code
melMass/comfy_mtb
Animation oriented nodes pack for ComfyUI
chaojie/ComfyUI-DragNUWA
HW140701/VideoTo3dPoseAndBvh
Convert video to the bvh motion file
Dene33/video_to_bvh
Convert human motion from video to .bvh
lunarring/lunar_tools
toolkit for interactive exhibitions
SHI-Labs/VCoder
[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models
CHATS-lab/persuasive_jailbreaker
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
vishwasg217/fin-sight
FinSight - Financial Insights at Your Fingertip: FinSight is a cutting-edge AI assistant tailored for portfolio managers, investors, and finance enthusiasts. It streamlines the process of gaining crucial insights and summaries about a company in a user-friendly manner.
leoneversberg/llm-chatbot-rag
A local LLM chatbot with RAG for PDF input files
markhliu/DGAI
Learn Generative AI with PyTorch (Manning Publications, 2024)
MunchkinChen/FADING
driesdepoorter/The-Selfie-Coach
Get instant selfie coaching from Kylie J.
mphirke/video2bvh2.0
https://github.com/Dene33/video_to_bvh but with python 3 and tensorflow2.0