kyleeasterly

kyleeasterly's Stars

unclecode/crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
Language:Python16.3k 97 2221.2k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.7k 274 121812
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.3k 127 402870
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python7.3k 71 321877
kyutai-labs/moshi
Language:Python6.8k 78 82531
getomni-ai/zerox
PDF to Markdown with vision models
Language:Python6.5k 26 54356
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
Language:Python3k 58 30311
anysphere/priompt
Prompt design using JSX.
Language:TypeScript2k 23 7111
antimatter15/splat
WebGL 3D Gaussian Splat Viewer
Language:JavaScript2k 31 51206
homebrewltd/ichigo
Local realtime voice AI
Language:Python1.9k 19 6991
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Language:Python1.9k 29 89161
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
Language:Python1.8k 27 88123
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Language:Python1.6k 19 28101
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.4k 26 7370
menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
1.3k 110 2552
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Language:Python924 26 4334
twilio/twilio-csharp
Twilio C#/.NET Helper Library for .NET6+.
Language:C#676 97 379301
Vchitect/Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Language:Python647 7 1317
Azure-Samples/aoai-realtime-audio-sdk
Azure OpenAI code resources for using gpt-4o-realtime capabilities.
Language:TypeScript644 24 37111
LordLiang/DrawingSpinUp
(SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Character Drawings
Language:Python569 9 2453
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python461 19 2825
replicate/cog-flux
Cog inference for flux models
Language:Python283 15 631
Stable-X/StableDelight
StableDelight: Revealing Hidden Textures by Removing Specular Reflections
Language:Python215 6 36
mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
Language:Python212 7 1614
Cysharp/Claudia
Unofficial Anthropic Claude API client for .NET.
Language:C#167 6 1413
open-mmlab/Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
Language:Python167 7 710
neoxic/ESCape32
BLDC motor control firmware for 32-bit ESCs
Language:C152 15 1035
tghamm/Anthropic.SDK
An unofficial C#/.NET SDK for accessing the Anthropic Claude API
Language:C#75 4 2517
Human-VDM/Human-VDM
Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models
55 16 10
daswer123/deepspeed-windows-wheels
A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows
46 3 33