dali92002

Researcher in Computer Vision

Computer Vision CenterBarcelona

dali92002's Stars

aiintelligentsystems/next-level-bert
Language:Python16
emanuelevivoli/awesome-comics-understanding
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
894
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
1.4k66
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Language:Python8.6k355
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.5k430
andreybarsky/annotation
annotation system for labelling bounding boxes using openCV
Language:Python1
ayanban011/GraphKD
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
Language:Python12
hecoding/Hyper-Modulation
Official Implementation for "Transferring Unconditional to Conditional GANs with Hyper-Modulation" CVPRW 22 https://arxiv.org/abs/2112.02219
Language:Python131
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Language:Python2.4k264
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9k583
jyf588/transformer-inertial-poser
Python implementation accompanying the Transformer Inertial Poser paper at SIGGRAPH Asia 2022
Language:Python7611
Xinyu-Yi/EgoLocate
A real-time system that simultaneously captures human pose, reconstructs the scene in sparse 3D points, and localizes the human in the scene with 6 IMUs and a body-worn phone camera
Language:C++9419
leitro/LabelAdaptiveMixup-SER
Language:Python6
rubenpt91/PFL-DocVQA-Competition
Language:Python193
eth-siplab/AvatarPoser
Official Code for ECCV 2022 paper "AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing"
Language:Python29750
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python31.1k2.8k
rossumai/docile
DocILE: Document Information Localization and Extraction Benchmark
Language:Python1199
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.6k1.1k
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.9k10.2k
weixi-feng/Structured-Diffusion-Guidance
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Language:Jupyter Notebook31120
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.1k1.6k
sjvasquez/handwriting-synthesis
Handwriting Synthesis with RNNs ✏️
Language:Python4.4k600
andreagemelli/doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
Language:Jupyter Notebook11720
furkanbiten/idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
Language:Python1016
ivy-llc/ivy
Convert Machine Learning Code Between Frameworks
Language:Python14k5.7k
ayanban011/GACNN
Generative Adverserial Convolutional Neural Network
Language:Python9
ayanban011/jNMF
Discovering De-similarities of Modular Structure Between Tumor Cells and Normal Cells by Integrating Multiple Data Sources Through Joint Non-Negative Matrix Factorization
Language:R9
ayanban011/dct-dft-fft-craft
DCT-DFT-FFT Based Method for Text Detection in Underwater Images
Language:Jupyter Notebook3
ayanban011/HAGNN
Gene Selection of Microarray Data using Heatmap Analysis and Graph Neural Network
Language:Jupyter Notebook2
ayanban011/Machine-Learning
In the summer 2020, I have get a chance to learn machine learning from Andrew Ng, coursework organised by Stanford University. Here, I am going to upload all the assignment done by me during the coursework.
Language:Jupyter Notebook51

dali92002

dali92002's Stars

aiintelligentsystems/next-level-bert

emanuelevivoli/awesome-comics-understanding

wangkai930418/awesome-diffusion-categorized

arogozhnikov/einops

FoundationVision/VAR

andreybarsky/annotation

ayanban011/GraphKD

hecoding/Hyper-Modulation

microsoft/table-transformer

voxel51/fiftyone

jyf588/transformer-inertial-poser

Xinyu-Yi/EgoLocate

leitro/LabelAdaptiveMixup-SER

rubenpt91/PFL-DocVQA-Competition

eth-siplab/AvatarPoser

lllyasviel/ControlNet

rossumai/docile

lucidrains/denoising-diffusion-pytorch

CompVis/stable-diffusion

weixi-feng/Structured-Diffusion-Guidance

CompVis/latent-diffusion

sjvasquez/handwriting-synthesis

andreagemelli/doc2graph

furkanbiten/idl_data

ivy-llc/ivy

ayanban011/GACNN

ayanban011/jNMF

ayanban011/dct-dft-fft-craft

ayanban011/HAGNN

ayanban011/Machine-Learning