kyleeasterly's Stars
unclecode/crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
kyutai-labs/moshi
getomni-ai/zerox
PDF to Markdown with vision models
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
anysphere/priompt
Prompt design using JSX.
antimatter15/splat
WebGL 3D Gaussian Splat Viewer
homebrewltd/ichigo
Local realtime voice AI
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
twilio/twilio-csharp
Twilio C#/.NET Helper Library for .NET6+.
Vchitect/Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Azure-Samples/aoai-realtime-audio-sdk
Azure OpenAI code resources for using gpt-4o-realtime capabilities.
LordLiang/DrawingSpinUp
(SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Character Drawings
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
replicate/cog-flux
Cog inference for flux models
Stable-X/StableDelight
StableDelight: Revealing Hidden Textures by Removing Specular Reflections
mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
Cysharp/Claudia
Unofficial Anthropic Claude API client for .NET.
open-mmlab/Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
neoxic/ESCape32
BLDC motor control firmware for 32-bit ESCs
tghamm/Anthropic.SDK
An unofficial C#/.NET SDK for accessing the Anthropic Claude API
Human-VDM/Human-VDM
Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models
daswer123/deepspeed-windows-wheels
A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows