bowen-upenn
Ph.D. candidate in Computer and Information Science
GRASP Lab, University of PennsylvaniaPhiladelphia, United States
bowen-upenn's Stars
The-Run-Philosophy-Organization/run
润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新**人的核心宗教,核心信念。
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
atfortes/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
cvlab-columbia/viper
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
WLiK/LLM4Rec-Awesome-Papers
A list of awesome papers and resources of recommender system on large language model (LLM).
srush/awesome-o1
A bibliography and survey of the papers surrounding o1
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
OSU-NLP-Group/SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
wenbowen123/BundleTrack
[IROS 2021] BundleTrack: 6D Pose Tracking for Novel Objects without Instance or Category-Level 3D Models
randyzwitch/streamlit-folium
Streamlit Component for rendering Folium maps
SHI-Labs/Rethinking-Text-Segmentation
[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach
smartyfh/LLM-Uncertainty-Bench
Benchmarking LLMs via Uncertainty Quantification
yflv-yanxia/scene_text
nuster1128/LLM_Agent_Memory_Survey
TongkunGuan/CCD
[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition
schen149/sub-sentence-encoder
The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".
Storia-AI/font-classify
font-classify
Not-Diamond/awesome-ai-model-routing
A curated list of awesome approaches to AI model routing
LCS2-IIITD/SPARTA_WSDM2022
This repository contains the code and dataset for our paper titled Speaker and Time-aware Joint Contextual Learning for Dialogue-act Classification in Counselling Conversations accepted at WSDM Conference, 2022.
Yuqifan1117/CaCao
This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World" (Accepted by ICCV 2023)
bowen-upenn/Agent_Rationality
This is the official repository of the paper "Towards Rationality in Language and Multimodal Agents: A Survey"
EternityYW/TRAM-Benchmark
TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)
taeho-kil/Scene-Text-Rectification
Scene text rectification using glyph and character alignment properties
Harryqu123/LMC
[NeurIPS 2023] LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition
voxel51/reconstruction-error-ratios
Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!
tyxsspa/AnyText2
denabazazian/scene_text_segmentation
Pytorch implementation for pixel-wise scene text segmentation based on DeepLabV3+
zzjun725/Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
Xieyangxinyu/ClimRRGPT-beta