petermckj

GreyLondon

petermckj's Stars

gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
Language:Python52.7k 516 4856.9k
QuivrHQ/quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
Language:Python37k 285 1.5k3.6k
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Language:C++31.5k 924 2k7.9k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python30.2k 217 2543k
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
Language:Python25.5k 180 1.7k3.7k
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.5k 673 94979
Sinaptik-AI/pandas-ai
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Language:Python13.9k 113 7661.4k
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.4k 123 3931.4k
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python13.2k 73 2751.1k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python11k 98 8171.1k
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python8k 76 226600
Vaibhavs10/insanely-fast-whisper
Language:Jupyter Notebook7.9k 68 199554
cubiq/ComfyUI_IPAdapter_plus
Language:Python4.4k 39 682331
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python4.1k 87 103367
FORTH-ModelBasedTracker/MocapNET
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance
Language:C++867 36 128137
KevinLTT/video2bvh
Extracts human motion in video and save it as bvh mocap file.
Language:Python586 17 5592
royorel/Lifespan_Age_Transformation_Synthesis
Lifespan Age Transformation Synthesis code
Language:Python583 16 30130
melMass/comfy_mtb
Animation oriented nodes pack for ComfyUI
Language:Python484 12 16956
chaojie/ComfyUI-DragNUWA
Language:Python397 3 2830
HW140701/VideoTo3dPoseAndBvh
Convert video to the bvh motion file
Language:Python391 10 3463
Dene33/video_to_bvh
Convert human motion from video to .bvh
Language:Jupyter Notebook378 24 43112
lunarring/lunar_tools
toolkit for interactive exhibitions
Language:Python282 5 125
SHI-Labs/VCoder
[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Language:Python269 9 815
CHATS-lab/persuasive_jailbreaker
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
Language:HTML265 4 419
vishwasg217/fin-sight
FinSight - Financial Insights at Your Fingertip: FinSight is a cutting-edge AI assistant tailored for portfolio managers, investors, and finance enthusiasts. It streamlines the process of gaining crucial insights and summaries about a company in a user-friendly manner.
Language:Jupyter Notebook203 6 1176
leoneversberg/llm-chatbot-rag
A local LLM chatbot with RAG for PDF input files
Language:Jupyter Notebook69 2 327
markhliu/DGAI
Learn Generative AI with PyTorch (Manning Publications, 2024)
Language:Jupyter Notebook62 2 328
MunchkinChen/FADING
Language:Python29 3 53
driesdepoorter/The-Selfie-Coach
Get instant selfie coaching from Kylie J.
Language:Python12 1 01
mphirke/video2bvh2.0
https://github.com/Dene33/video_to_bvh but with python 3 and tensorflow2.0
Language:Jupyter Notebook7 2 43

petermckj

petermckj's Stars

gpt-engineer-org/gpt-engineer

QuivrHQ/quivr

CMU-Perceptual-Computing-Lab/openpose

myshell-ai/OpenVoice

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

HumanAIGC/AnimateAnyone

Sinaptik-AI/pandas-ai

KwaiVGI/LivePortrait

lukas-blecher/LaTeX-OCR

Lightning-AI/litgpt

open-mmlab/Amphion

Vaibhavs10/insanely-fast-whisper

cubiq/ComfyUI_IPAdapter_plus

ali-vilab/AnyDoor

FORTH-ModelBasedTracker/MocapNET

KevinLTT/video2bvh

royorel/Lifespan_Age_Transformation_Synthesis

melMass/comfy_mtb

chaojie/ComfyUI-DragNUWA

HW140701/VideoTo3dPoseAndBvh

Dene33/video_to_bvh

lunarring/lunar_tools

SHI-Labs/VCoder

CHATS-lab/persuasive_jailbreaker

vishwasg217/fin-sight

leoneversberg/llm-chatbot-rag

markhliu/DGAI

MunchkinChen/FADING

driesdepoorter/The-Selfie-Coach

mphirke/video2bvh2.0