IrohXu
Co-founder of PediaMed AI. CS PhD Student@UIUC. Focus on Autism spectrum disorder, Embodied AI, Generative AI, VLM.
UIUCPalo Alto
Pinned Repositories
Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
Chromosome_Classification_Deep_Learning_Method
It is a project based on IJCNN's paper Automatic Chromosome Classification using Deep Attention Based Sequence Learning of Chromosome Bands and process some new methods
Gomoku-XYH19
A AI project about Gomoku
Infant-Pose-pytorch
Apply OpenPose and Infant Key-point Dataset to Evaluate Infant Posture
lanenet-lane-detection-pytorch
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
waymo_to_semanticKITTI
Convert waymo open dataset 3D segmentation format to SemanticKITTI format.
MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
AggPose
[IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation
PIE
PIE: Simulating Disease Progression via Progressive Image Editing
ViTASD
[ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial Diagnosis
IrohXu's Repositories
IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
IrohXu/lanenet-lane-detection-pytorch
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
IrohXu/waymo_to_semanticKITTI
Convert waymo open dataset 3D segmentation format to SemanticKITTI format.
IrohXu/Infant-Pose-pytorch
Apply OpenPose and Infant Key-point Dataset to Evaluate Infant Posture
IrohXu/MAE-ViT-pytorch
MAE-ViT-pytorch, structure is based on https://github.com/rwightman/pytorch-image-models
IrohXu/VCog-Bench
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
IrohXu/contour-aware-Unet
The realization of different classes of Unet framework including contour-aware-Unet, DCAN, Dual Unet, Attention Unet, Unet++
IrohXu/Crossing_Aggregation_Network
It is a U-Net based network which absorb ideas from deep aggregation layers(DLA), Unet++, ET-Net......
IrohXu/SEC-UNET_SEMANTIC_EMBEDDING_AND_CONTOUR_ASSIST_UNET_FOR_BACTERIA_SEGMENTATION-AND-DETECTION
The number of bacterial types is a critical monitoring indicator for indoor air quality standards. It is a challenging task to cultivate and count colonies of bacteria which is expertise required and time-consuming. In this work, we investigate several U-Net improvement approaches. We are motivated by the assumption that contour information and semantic embedding branch can enhance U-Net's segmentation capacity for blurred and overlapping objects. Therefore, we propose Semantic Embedding and Contour Assist U-Net (SEC-U-Net) for direct bacteria segmentation and a shallow CNN for bacteria classification. This algorithm designed the detection of bacteria as a two-stage segmentation and classification task. Experimental results demonstrate the proposed method outperforms the state-of-the-art improved U-Net approaches on our bacteria dataset. Proposed SEC-U-NET+CNN based training presented over 91% and 85% precision rate for E.coli and S.aureus, respectively.
IrohXu/TRN-pytorch-Temporal-Relational-Reasoning-in-Videos
Implementation for Temporal Relational Reasoning in Videos. This is a NYU course project for DS-GA 3001.004/.005 Introduction to Computer Vision (Spring 2021)
IrohXu/IrohXu
IrohXu/irohxu.github.io
github.io for Iroh Cao
IrohXu/AggPose
AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation
IrohXu/android-demo-app-for-TB-project
PyTorch android examples of usage in applications
IrohXu/BotBuilder-Samples
Welcome to the Bot Framework samples repository. Here you will find task-focused samples in C#, JavaScript and TypeScript to help you get started with the Bot Framework SDK!
IrohXu/chatgpt-web
IrohXu/Cylinder3D_waymo
Use Cylinder3D in Waymo
IrohXu/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
IrohXu/k-diffusion-med
Karras et al. (2022) diffusion models used for Med data
IrohXu/Lipschitz-Transformer-PyTorch
PyTorch Implementation of Lipschitz Transformer
IrohXu/microsoft-teams-app-checklist
Checklist is a custom Teams message extension app that enables users to Collaborate with their team by creating a shared checklist in a chat or channel. Checklist app is supported across all platforms – Teams desktop, browser, iOS, and Android clients. It is ready for deployment as part of your existing Microsoft 365 subscription.
IrohXu/mmdetection3d_waymo
OpenMMLab's next-generation platform for general 3D object detection.
IrohXu/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
IrohXu/PyHealth
A Deep Learning Python Toolkit for Healthcare Applications.
IrohXu/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
IrohXu/stable-diffusion-webui
Stable Diffusion web UI
IrohXu/TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
IrohXu/Voxel-MAE
Voxel-MAE: Masked Autoencoders for Pre-training Large-scale Point Clouds
IrohXu/vue-admin-template
a vue2.0 minimal admin template
IrohXu/vue-element-admin
:tada: A magical vue admin https://panjiachen.github.io/vue-element-admin