Pinned Repositories
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
CCPD2020
Churn-prediction
Datascience
DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Diabets-prediction
face-segmenteation.Pytorch
gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Gaze_emotion
MaskR-CNN
Mrkomiljon's Repositories
Mrkomiljon/face-segmenteation.Pytorch
Mrkomiljon/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Mrkomiljon/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Mrkomiljon/Diabets-prediction
Mrkomiljon/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Mrkomiljon/Gaze_emotion
Mrkomiljon/StoryDiffusion
Create Magic Story!
Mrkomiljon/SVD
Mrkomiljon/uzbek-sign-language
#uzbek-sign-language
Mrkomiljon/3d_model_in_panoramic_image
"Projecting a panoramic image onto the plane of a 3D model", 3d machine learning
Mrkomiljon/AI_with_PythonPytorch_Uz
AI(ML,DL) projects for tutorial
Mrkomiljon/awesome-ai-awesomeness
A curated list of awesome awesomeness about artificial intelligence
Mrkomiljon/awesome-notebooks
A powerful data & AI notebook templates catalog: prompts, plugins, models, workflow automation, analytics, code snippets - following the IMO framework to be searchable and reusable in any context.
Mrkomiljon/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Mrkomiljon/developer-roadmap
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
Mrkomiljon/DiNet
to get high res lip sync video
Mrkomiljon/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Mrkomiljon/Gradient_Pytorch_Tensorflow_CIFAR10
Mrkomiljon/Instance_Seg_on_videos
instance segmentation on videos
Mrkomiljon/Language_detection
Mrkomiljon/Lipsync_by_single_image
Mrkomiljon/LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
Mrkomiljon/LucidDreamer
Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".
Mrkomiljon/Mrkomiljon
Mrkomiljon/MultiModdal
Classifying Multimodal Data using Transformers
Mrkomiljon/OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
Mrkomiljon/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Mrkomiljon/python_for_microscopists
https://www.youtube.com/channel/UC34rW-HtPJulxr5wp2Xa04w?sub_confirmation=1
Mrkomiljon/TTS_cloning
Mrkomiljon/VLM_materials
Collection of AWESOME vision-language models for vision tasks