Pinned Repositories
lifelong-memory
Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos
bb-handwriting-model
CRNN handwritten ledger recognition
episodic-memory
LifelongMemory
LLM-Inner-Speech
The third place solution to Ego4d NLQ Challenge 2023
lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
M2IB
Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution
mais-bootcamp-w2019
Course content for MAIS 202 (Winter 2019) based off of Machine Learning UC Berkeley material: https://github.com/mlberkeley/Machine-Learning-Decal-Fall-2018
McGill_ComputerScience
xMDETR
Adapting Grounded Visual Question Answering Models to Low Resource Languages
YingWANGG's Repositories
YingWANGG/M2IB
Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution
YingWANGG/xMDETR
Adapting Grounded Visual Question Answering Models to Low Resource Languages
YingWANGG/LLM-Inner-Speech
The third place solution to Ego4d NLQ Challenge 2023
YingWANGG/bb-handwriting-model
CRNN handwritten ledger recognition
YingWANGG/episodic-memory
YingWANGG/LifelongMemory
YingWANGG/lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
YingWANGG/mais-bootcamp-w2019
Course content for MAIS 202 (Winter 2019) based off of Machine Learning UC Berkeley material: https://github.com/mlberkeley/Machine-Learning-Decal-Fall-2018
YingWANGG/McGill_ComputerScience
YingWANGG/McGill_Finance
YingWANGG/McGill_Math
YingWANGG/YingWANGG.github.io