mk-minchul
Computer Vision Lab Ph.D Student Michigan State University
@Michigan State UniversityMichigan, USA
mk-minchul's Stars
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
karpathy/LLM101n
LLM101n: Let's build a Storyteller
fastapi/full-stack-fastapi-template
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
danielmiessler/fabric
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
philz1337x/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
getgrit/gritql
GritQL is a query language for searching, linting, and modifying code.
MarkFzp/act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
OnedocLabs/react-print-pdf
Build and generate PDF using React đź“„ UI kit for PDFs and print documents. Simple, reusable components and templates to create great invoices, docs, brochures. Use your favorite front-end framework React to build your next PDF.
adrianhajdin/social_media_app
Build a modern social app with a stunning UI with a native mobile feel, a special tech stack, an infinite scroll feature, and amazing performance using React JS, Appwrite, TypeScript, and more.
deedy5/duckduckgo_search
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
GAP-LAB-CUHK-SZ/gaustudio
A Modular Framework for 3D Gaussian Splatting and Beyond
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
srush/Triton-Puzzles
Puzzles for learning Triton
ashawkey/nerf2mesh
[ICCV2023] Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
yoheinakajima/prettygraph
An experimental UI for text-to-knowledge-graph generation
cheind/pytorch-blender
:sweat_drops: Seamless, distributed, real-time integration of Blender into PyTorch data pipelines
ironjr/StreamMultiDiffusion
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
meidachen/STPLS3D
🔥 Synthetic and real-world 2d/3d dataset for semantic and instance segmentation (BMVC 2022 Oral)
wang-zidu/3DDFA-V3
The official implementation of 3DDFA_V3 in CVPR2024 (Highlight).
KovenYu/WonderWorld
Code release for https://kovenyu.com/WonderWorld/
emanuelevivoli/awesome-comics-understanding
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
mk-minchul/CVLface
ZONG0004/MacroHFT
Traffic-Alpha/iLLM-TSC
This repository contains the code for the paper“iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement”
maswang32/hearinganythinganywhere
Hearing Anything Anywhere Code Release