Ericforfun's Stars
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
bazingagin/npc_gzip
Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors
One-2-3-45/One-2-3-45
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
zhengchen1999/DAT
PyTorch code for our ICCV 2023 paper "Dual Aggregation Transformer for Image Super-Resolution"
liyunsheng13/micronet
Srameo/LED
[ICCV 2023] Lighting Every Darkness in Two Pairs: A Calibration-Free Pipeline for RAW Denoising && [Arxiv 2023] Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model
jiggy-ai/pair
REPL environment for GPT pair programming
zhihou7/BatchFormer
CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522
AmineDiro/cria
OpenAI compatible API for serving LLAMA-2 model
UMass-Foundation-Model/Co-LLM-Agents
Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"
3d-vista/3D-VisTA
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
rgbkrk/chatlab
⚡️🧪 Fast LLM Tool Calling Experimentation, big and smol
AIR-DISCOVER/TOIST
[NeurIPS 2022] TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation
wyf0912/ExposureDiffusion
[ICCV 2023] ExposureDiffusion: Learning to Expose for Low-light Image Enhancement
kernelmachine/silo-lm
SILO Language Models code repository
bo-miao/SgMg
[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.
zyhbili/LivelySpeaker
[ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".
dki-lab/Pangu
Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
lyx0208/3dSwap
Code and project page for "3D-aware Face Swapping" in CVPR 2023
wpy1999/BAS
[CVPR2022] PyTorch implementation of ''Background Activation Suppression for Weakly Supervised Object Localization''.
rpl-cmu/neusis
Release code for Neural Implicit Surface Reconstruction using Imaging Sonar (ICRA 2023)
bytedance/DQ-Det
Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
islamamirul/PermuteNet
KoMyeongJin/SpecDiff-GAN
Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS
ziweiWWANG/EFR
Code and dataset for paper "A Linear Comb Filter for Event Flicker Removal", ICRA 2022. An asynchronous linear filter to preprocess event data to remove unwanted flicker events from an event stream.
clear-nus/NCDSSM
PyTorch implementation of the NCDSSM models presented in the ICML '23 paper "Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series".
NeurAI-Lab/DoGo
This is the official repo for the CVPR 2021 L2ID paper "Distill on the Go: Online knowledge distillation in self-supervised learning"
luilui97/DSPP
ACCV2022 Source Code of paper "Feature Decoupled Knowledge Distillation via Spatial Pyramid Pooling"