tianyu-z
A Ph.D. student with Dr. Yoshua Bengio in machine learning@Mila & an ever-lasting learner
MilaMontreal
Pinned Repositories
climate-cooperation-competition
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N. ai4climatecoop.org
a-github-copy-of-FRB-US-Model
A copy of code from https://www.federalreserve.gov/econres/us-models-python.htm
Andrew_Ng_Deep_Learning_Specialization
Andrew Ng's Deep Learning (Machine Translation with GRU Attention, Car Detection with YOLO, Face Recognition with Siamese)
CLAP
Contrastive Language-Audio Pretraining
CPP_Programming_for_MFE
Kritzman-Regime-Detection
A HMM application in Kritzman Regime Detection
NYU-Rob-Fergus-Computer-Vision
The projects of the Computer Vision Course taught by Rob Fergus in NYU Courant
pettingzoo_dilemma_envs
Tactical_Asset_Allocation_with_Ensemble_Learning_Using_Walk_forward_Optimization
Capstone Research Project in NYU Courant
VCR
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
tianyu-z's Repositories
tianyu-z/VCR
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
tianyu-z/cloudimage
Personal
tianyu-z/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
tianyu-z/VCR-wiki-en-easy-test-500
Raw data for VCR-wiki-en-easy-test-500 from https://huggingface.co/datasets/vcr-org/VCR-wiki-en-easy-test-500
tianyu-z/VCR-wiki-zh-easy-test-500
Raw data for VCR-wiki-zh-easy-test-100 from https://huggingface.co/datasets/vcr-org/VCR-wiki-zh-easy-test-100
tianyu-z/VCR-wiki-zh-hard-test-500
Raw data for VCR-wiki-zh-hard-test-500 from https://huggingface.co/datasets/vcr-org/VCR-wiki-zh-hard-test-500
tianyu-z/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
tianyu-z/decentralized
tianyu-z/DeepSeek-Prover-V1.5
tianyu-z/DLS
decentralized learning scheduler
tianyu-z/EfficientZeroV2
[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
tianyu-z/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
tianyu-z/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
tianyu-z/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
tianyu-z/LLM_Tree_Search
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
tianyu-z/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
tianyu-z/MCTS-LLM
tianyu-z/mergekit
Tools for merging pretrained large language models.
tianyu-z/MergeLM
Codebase for Merging Language Models
tianyu-z/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
tianyu-z/Nordhaus-OPEN
A github copy of https://yale.app.box.com/s/whlqcr7gtzdm4nxnrfhvap2hlzebuvvm from https://williamnordhaus.com/
tianyu-z/pykan
Kolmogorov Arnold Networks
tianyu-z/pymdp
A Python implementation of active inference for Markov Decision Processes
tianyu-z/Richelieu
tianyu-z/surya
OCR, layout analysis, reading order, line detection in 90+ languages
tianyu-z/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
tianyu-z/UserActivityTracker
A lightweight real-time tracker of user interactions for WPF. Support both mouse and keyboard actions. Able to save the tracked recording to a string value and play the recorded actions for UI/UX analysis. Support full window monitoring or a specified focus on a particular element. Support saving the initial size and other states upon starting.
tianyu-z/VAR
[GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
tianyu-z/VCR-wiki-en-hard-test-500
Raw data for VCR-wiki-en-hard-test-500 from https://huggingface.co/datasets/vcr-org/VCR-wiki-en-hard-test-500
tianyu-z/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks