IrohXu
Co-founder of PediaMed AI. CS PhD Student@UIUC. Focus on Autism spectrum disorder, Embodied AI, Generative AI, VLM.
UIUCPalo Alto
Pinned Repositories
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
Chromosome_Classification_Deep_Learning_Method
It is a project based on IJCNN's paper Automatic Chromosome Classification using Deep Attention Based Sequence Learning of Chromosome Bands and process some new methods
Gomoku-XYH19
A AI project about Gomoku
lanenet-lane-detection-pytorch
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
VCog-Bench
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
waymo_to_semanticKITTI
Convert waymo open dataset 3D segmentation format to SemanticKITTI format.
MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
PIE
PIE: Simulating Disease Progression via Progressive Image Editing
ViTASD
[ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial Diagnosis
IrohXu's Repositories
IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
IrohXu/lanenet-lane-detection-pytorch
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
IrohXu/waymo_to_semanticKITTI
Convert waymo open dataset 3D segmentation format to SemanticKITTI format.
IrohXu/VCog-Bench
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
IrohXu/Infant-Pose-pytorch
Apply OpenPose and Infant Key-point Dataset to Evaluate Infant Posture
IrohXu/MAE-ViT-pytorch
MAE-ViT-pytorch, structure is based on https://github.com/rwightman/pytorch-image-models
IrohXu/Crossing_Aggregation_Network
It is a U-Net based network which absorb ideas from deep aggregation layers(DLA), Unet++, ET-Net......
IrohXu/TRN-pytorch-Temporal-Relational-Reasoning-in-Videos
Implementation for Temporal Relational Reasoning in Videos. This is a NYU course project for DS-GA 3001.004/.005 Introduction to Computer Vision (Spring 2021)
IrohXu/IrohXu
IrohXu/irohxu.github.io
github.io for Iroh Cao
IrohXu/AggPose
AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation
IrohXu/android-demo-app-for-TB-project
PyTorch android examples of usage in applications
IrohXu/BotBuilder-Samples
Welcome to the Bot Framework samples repository. Here you will find task-focused samples in C#, JavaScript and TypeScript to help you get started with the Bot Framework SDK!
IrohXu/chatgpt-web
IrohXu/Cylinder3D_waymo
Use Cylinder3D in Waymo
IrohXu/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
IrohXu/k-diffusion-med
Karras et al. (2022) diffusion models used for Med data
IrohXu/Lipschitz-Transformer-PyTorch
PyTorch Implementation of Lipschitz Transformer
IrohXu/microsoft-teams-app-checklist
Checklist is a custom Teams message extension app that enables users to Collaborate with their team by creating a shared checklist in a chat or channel. Checklist app is supported across all platforms – Teams desktop, browser, iOS, and Android clients. It is ready for deployment as part of your existing Microsoft 365 subscription.
IrohXu/mmdetection3d_waymo
OpenMMLab's next-generation platform for general 3D object detection.
IrohXu/MpoxVLM
[ML4H 2024] MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection
IrohXu/MVG
MVG: Medical Video Generation for Disease Progression Simulation
IrohXu/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
IrohXu/PyHealth
A Deep Learning Python Toolkit for Healthcare Applications.
IrohXu/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
IrohXu/stable-diffusion-webui
Stable Diffusion web UI
IrohXu/TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
IrohXu/Voxel-MAE
Voxel-MAE: Masked Autoencoders for Pre-training Large-scale Point Clouds
IrohXu/vue-admin-template
a vue2.0 minimal admin template
IrohXu/vue-element-admin
:tada: A magical vue admin https://panjiachen.github.io/vue-element-admin