ChiCheng123
I am a fifth-year bachlor-straight-to-PhD student and my research interest includes object detection, face detection and robotic grasp.
IECAS, CASIABeijing, China
ChiCheng123's Stars
JeffreyYH/Awesome-Generalist-Robots-via-Foundation-Models
Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
huangwl18/ReKep
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Hedlen/awesome-segment-anything
Tracking and collecting papers/projects/others related to Segment Anything.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
mlfoundations/open_clip
An open source implementation of CLIP.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
modelscope/lite-sora
An initiative to replicate Sora
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
JeffWang987/DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
klintan/pytorch-lanenet
LaneNet implementation in PyTorch
IrohXu/lanenet-lane-detection-pytorch
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
harryhan618/LaneNet
Pytorch implementation of "Towards end-to-end lane detection: an instance segmentation approach"
MaybeShewill-CV/lanenet-lane-detection
Unofficial implemention of lanenet model for real time lane detection
LLVM-AD/MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
chaytonmin/Awesome-Occupancy-Prediction-Autonomous-Driving
Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy
ArtificialZeng/Baichuan2-Explained
Baichuan2代码的逐行解析版本,适合小白
qiantianwen/NuScenes-QA
[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.
OpenDriveLab/DriveLM
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
wudongming97/Prompt4Driving
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.