Pinned Repositories
garfield
[CVPR'24] Group Anything with Radiance Fields
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
SciCode
A benchmark that challenges language models to code solutions for scientific problems
battle_game
Chat-UniVi
[CVPR 2024 Highlightš„] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.
SciCode
A benchmark that challenges language models to code solutions for scientific problems
segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
textgrad
textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
XuGW-Kevin's Repositories
XuGW-Kevin/DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.
XuGW-Kevin/battle_game
XuGW-Kevin/Chat-UniVi
[CVPR 2024 Highlightš„] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
XuGW-Kevin/SciCode
A benchmark that challenges language models to code solutions for scientific problems
XuGW-Kevin/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
XuGW-Kevin/textgrad