Pinned Repositories
openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
sglang
SGLang is a fast serving framework for large language models and vision language models.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
luciaganlulu's Repositories
luciaganlulu doesn’t have any repository yet.