Pinned Repositories
albumentations
fast image augmentation library and easy to use wrapper around other libraries
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
ARM_NEON_2_x86_SSE
AutoAugment
Unofficial implementation of the ImageNet, CIFAR 10 and SVHN Augmentation Policies learned by AutoAugment using pillow
awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Awesome-Trajectory-Motion-Prediction-Papers
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Awesome-VLM-AD-ITS
This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the latest update.
benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
ML-Tutorial-Experiment
Coding the Machine Learning Tutorial for Learning to Learn
KevenLee's Repositories
KevenLee/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
KevenLee/Awesome-Trajectory-Motion-Prediction-Papers
KevenLee/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
KevenLee/Awesome-VLM-AD-ITS
This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the latest update.
KevenLee/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
KevenLee/ChatSim
[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
KevenLee/Chinese-LLaVA
支持中英文双语视觉-文本对话的开源可商用多模态模型。
KevenLee/DriveArena
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
KevenLee/End-to-end-Autonomous-Driving
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
KevenLee/FudanOCR
A toolbox of scene text super-resolution and recognition
KevenLee/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
KevenLee/kohya-trainer
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
KevenLee/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
KevenLee/LTX-Video
Official repository for LTX-Video
KevenLee/MagicDriveDiT
Official implementation of the paper “MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”
KevenLee/MakeLongVideo
Implementation of long video generation
KevenLee/munkres-cpp
Kuhn-Munkres (Hungarian) Algorithm in C++
KevenLee/MVPbev
[ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability"
KevenLee/navsim
[NeurIPS 2024] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking
KevenLee/OpenLane-V2
[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving
KevenLee/panacea
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
KevenLee/PerlDiff
PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
KevenLee/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
KevenLee/QualityScaler
Image/video deeplearning upscaler app for Windows - BRSGAN & RealSR_JPEG
KevenLee/rectified-flow
从零手搓Flow Matching(Rectified Flow)
KevenLee/scene_text
KevenLee/TopoMLP
[ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
KevenLee/trocr-chinese
transformers ocr for chinese
KevenLee/WeatherDG
Official implementation for "WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation"
KevenLee/Yolox_augment
Add some features to yolox