Lwt-diamond's Stars
Liangyh18/COD_survey
Zhaozixiang1228/GDSR-DCTNet
[CVPR 2022 Oral] Official implementation for "Discrete Cosine Transform Network for Guided Depth Map Super-Resolution."
kaix90/DCTNet
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
vita-epfl/ttt-plus-plus
[NeurIPS21] TTT++: When Does Self-supervised Test-time Training Fail or Thrive?
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
conda-forge/miniforge
A conda-forge distribution.
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
MarkMoHR/Awesome-Referring-Image-Segmentation
:books: A collection of papers about Referring Image Segmentation.
henghuiding/MeViS
[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
verlab/accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
Atten4Vis/MS-DETR
[CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"
Lordog/dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
usr922/vgtr
[ICME'22] Visual Grounding with Transformers
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
Janspiry/Image-Super-Resolution-via-Iterative-Refinement
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
OUCVisionGroup/CLIP-UIE
Underwater Image Enhancement by Diffusion Model with Customized CLIP-Classifier
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
sfzhang15/ATSS
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection, CVPR, Oral, 2020
beichenzbc/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
hszhao/semseg
Semantic Segmentation in Pytorch
dvlab-research/PFENet
PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
dvlab-research/TagCLIP
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
google-research/omniglue
Code release for CVPR'24 submission 'OmniGlue'