bowen-upenn

Ph.D. candidate in Computer and Information Science

GRASP Lab, University of PennsylvaniaPhiladelphia, United States

Pinned Repositories

Agent_Rationality
This is the official repository of the paper "Towards Rationality in Language and Multimodal Agents: A Survey"
25 1 00
AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Language:Python00
Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
0 0 00
CCD
[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition
Language:Python0 0 00
CFR_VQA
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
Language:Python0 0 00
llm_token_bias
[EMNLP 2024] This is the official implementation of the paper "A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners" in PyTorch.
Language:Python13 3 01
Multi-Agent-VQA
[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
Language:Python9 3 00
Rethinking-Text-Segmentation
[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach
Language:Python0 0 00
scene_graph_commonsense
[WACV 2025] This is the official implementation of the paper "Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge" in PyTorch.
Language:Python23 2 52
WildfireGPT
Language:Python11

bowen-upenn's Repositories

bowen-upenn/scene_graph_commonsense
[WACV 2025] This is the official implementation of the paper "Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge" in PyTorch.
Language:Python23 2 52
bowen-upenn/MMMA_Rationality
This is the official repository of the paper "Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey"
16 1 00
bowen-upenn/llm_token_bias
[EMNLP 2024] This is the official implementation of the paper "A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners" in PyTorch.
Language:Python13 3 01
bowen-upenn/Multi-Agent-VQA
[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
Language:Python9 3 00
bowen-upenn/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Language:Python00
bowen-upenn/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
0 0 00
bowen-upenn/CCD
[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition
Language:Python0 0 00
bowen-upenn/CFR_VQA
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
Language:Python0 0 00
bowen-upenn/Rethinking-Text-Segmentation
[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach
Language:Python0 0 00
bowen-upenn/SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Language:Python0 0 00
bowen-upenn/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python0 0 00
bowen-upenn/VLSAT
CVPR2023 : VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
Language:Python0 0 00

bowen-upenn

Pinned Repositories

Agent_Rationality

AnyText

Awesome-LLM-Reasoning

CCD

CFR_VQA

llm_token_bias

Multi-Agent-VQA

Rethinking-Text-Segmentation

scene_graph_commonsense

WildfireGPT

bowen-upenn's Repositories

bowen-upenn/scene_graph_commonsense

bowen-upenn/MMMA_Rationality

bowen-upenn/llm_token_bias

bowen-upenn/Multi-Agent-VQA

bowen-upenn/AnyText

bowen-upenn/Awesome-LLM-Reasoning

bowen-upenn/CCD

bowen-upenn/CFR_VQA

bowen-upenn/Rethinking-Text-Segmentation

bowen-upenn/SeeAct

bowen-upenn/unilm

bowen-upenn/VLSAT