Oguzhanercan

Torch illuminates vision

https://bilgem.tubitak.gov.tr/Turkiye, Istanbul

Oguzhanercan's Stars

facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python31.3k6.5k
yangxiaofeng/rectified_flow_prior
Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors [ICLR 2025]
Language:Python1184
LingxiaoYang2023/DSG2024
Official pytorch repository for “Guidance with Spherical Gaussian Constraint for Conditional Diffusion”
Language:Python604
pisacode/voice
Language:TypeScript1
roudimit/whisper-flamingo
[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
Language:Jupyter Notebook14810
fengredrum/finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
Language:Python55
backspacetg/simul_whisper
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
Language:Python625
plutonium-239/memsave_torch
Lowering PyTorch's Memory Consumption for Selective Differentiation
Language:Python101
google/RB-Modulation
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
Language:Jupyter Notebook37428
chen-wl20/DreamCinema
DreamCinema: Cinematic Transfer with Free Camera and 3D Character
921
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
Language:Python2.2k207
YuxinWenRick/diffusion_memorization
Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)
Language:Python708
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.6k79
guoqincode/DiT-Visualization
Visualization of DiT self attention features
Language:Python19711
adelacvg/detail_tts
All generative model in one for better TTS model
Language:Python668
Young98CN/LoRA_Composer
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Language:Python534
unity-research/IP-Adapter-Instruct
IP Adapter Instruct
Language:Python2044
wenet-e2e/speech-synthesis-paper
List of speech synthesis papers.
1k120
JackAILab/ConsistentID
Customized ID Consistent for human
Language:Python95274
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python21.2k1.5k
XiangZ-0/HiT-SR
[ECCV 2024 - Oral] HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Language:Python1243
LeonHLJ/FouriScale
Official implementation of FouriScale (ECCV2024)
Language:Python1525
ruohaoguo/ovavss
Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].
Language:Python233
Algolzw/daclip-uir
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
Language:Python74439
TheLartians/ModernCppStarter
🚀 Kick-start your C++! A template for modern C++ projects using CMake, CI, code coverage, clang-format, reproducible dependency management and much more.
Language:CMake4.7k405
phohenecker/switch-cuda
A simple bash script for switching between installed versions of CUDA.
Language:Shell625142
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9.3k614
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python28.5k3.3k
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
Language:Python31.1k2.2k
3b1b/manim
Animation engine for explanatory math videos
Language:Python76.6k6.6k

Oguzhanercan

Oguzhanercan's Stars

facebookresearch/fairseq

yangxiaofeng/rectified_flow_prior

LingxiaoYang2023/DSG2024

pisacode/voice

roudimit/whisper-flamingo

fengredrum/finetune-whisper-lora

backspacetg/simul_whisper

plutonium-239/memsave_torch

google/RB-Modulation

chen-wl20/DreamCinema

bghira/SimpleTuner

YuxinWenRick/diffusion_memorization

dvlab-research/ControlNeXt

guoqincode/DiT-Visualization

adelacvg/detail_tts

Young98CN/LoRA_Composer

unity-research/IP-Adapter-Instruct

wenet-e2e/speech-synthesis-paper

JackAILab/ConsistentID

black-forest-labs/flux

XiangZ-0/HiT-SR

LeonHLJ/FouriScale

ruohaoguo/ovavss

Algolzw/daclip-uir

TheLartians/ModernCppStarter

phohenecker/switch-cuda

voxel51/fiftyone

tinygrad/tinygrad

ManimCommunity/manim

3b1b/manim