IrohXu

Co-founder of PediaMed AI. CS PhD Student@UIUC. Focus on Autism spectrum disorder, Embodied AI, Generative AI, VLM.

UIUCPalo Alto

Pinned Repositories

diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27.2k 212 4.4k5.6k
Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
250 9 111
Chromosome_Classification_Deep_Learning_Method
It is a project based on IJCNN's paper Automatic Chromosome Classification using Deep Attention Based Sequence Learning of Chromosome Bands and process some new methods
Language:Jupyter Notebook9 0 03
Gomoku-XYH19
A AI project about Gomoku
Language:Python6 0 02
lanenet-lane-detection-pytorch
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
Language:Python155 2 3138
VCog-Bench
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
Language:Python6 1 10
waymo_to_semanticKITTI
Convert waymo open dataset 3D segmentation format to SemanticKITTI format.
Language:Python17 1 01
MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
Language:Python119 1 83
PIE
PIE: Simulating Disease Progression via Progressive Image Editing
Language:Python27 3 21
ViTASD
[ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial Diagnosis
Language:Python24 4 115

IrohXu's Repositories

IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
250 9 111
IrohXu/lanenet-lane-detection-pytorch
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
Language:Python155 2 3138
IrohXu/waymo_to_semanticKITTI
Convert waymo open dataset 3D segmentation format to SemanticKITTI format.
Language:Python17 1 01
IrohXu/VCog-Bench
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
Language:Python6 1 10
IrohXu/Infant-Pose-pytorch
Apply OpenPose and Infant Key-point Dataset to Evaluate Infant Posture
Language:C++5 1 01
IrohXu/MAE-ViT-pytorch
MAE-ViT-pytorch, structure is based on https://github.com/rwightman/pytorch-image-models
Language:Python4 1 00
IrohXu/Crossing_Aggregation_Network
It is a U-Net based network which absorb ideas from deep aggregation layers(DLA), Unet++, ET-Net......
Language:Python2 1 0
IrohXu/TRN-pytorch-Temporal-Relational-Reasoning-in-Videos
Implementation for Temporal Relational Reasoning in Videos. This is a NYU course project for DS-GA 3001.004/.005 Introduction to Computer Vision (Spring 2021)
Language:Jupyter Notebook2 1 00
IrohXu/IrohXu
1 1 0
IrohXu/irohxu.github.io
github.io for Iroh Cao
Language:CSS1 1 0
IrohXu/AggPose
AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation
Language:Python0 0
IrohXu/android-demo-app-for-TB-project
PyTorch android examples of usage in applications
Language:Java0 0
IrohXu/BotBuilder-Samples
Welcome to the Bot Framework samples repository. Here you will find task-focused samples in C#, JavaScript and TypeScript to help you get started with the Bot Framework SDK!
Language:C#0 0
IrohXu/chatgpt-web
Language:Vue0 0
IrohXu/Cylinder3D_waymo
Use Cylinder3D in Waymo
Language:Python0 0
IrohXu/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python0 0
IrohXu/k-diffusion-med
Karras et al. (2022) diffusion models used for Med data
Language:Python0 0
IrohXu/Lipschitz-Transformer-PyTorch
PyTorch Implementation of Lipschitz Transformer
Language:Python1 0
IrohXu/microsoft-teams-app-checklist
Checklist is a custom Teams message extension app that enables users to Collaborate with their team by creating a shared checklist in a chat or channel. Checklist app is supported across all platforms – Teams desktop, browser, iOS, and Android clients. It is ready for deployment as part of your existing Microsoft 365 subscription.
Language:TypeScript0 0
IrohXu/mmdetection3d_waymo
OpenMMLab's next-generation platform for general 3D object detection.
Language:Python0 0
IrohXu/MpoxVLM
[ML4H 2024] MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection
IrohXu/MVG
MVG: Medical Video Generation for Disease Progression Simulation
IrohXu/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
Language:Python0 0
IrohXu/PyHealth
A Deep Learning Python Toolkit for Healthcare Applications.
IrohXu/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Language:Python0 0
IrohXu/stable-diffusion-webui
Stable Diffusion web UI
Language:Python0 0
IrohXu/TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
Language:Python0 0
IrohXu/Voxel-MAE
Voxel-MAE: Masked Autoencoders for Pre-training Large-scale Point Clouds
Language:Python0 0
IrohXu/vue-admin-template
a vue2.0 minimal admin template
Language:JavaScript0 0
IrohXu/vue-element-admin
:tada: A magical vue admin https://panjiachen.github.io/vue-element-admin
Language:Vue0 0