yuewang-cuhk
Senior Research Scientist at Salesforce AI Research, building LLMs for Code
Salesforce ResearchSingapore
yuewang-cuhk's Stars
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
salesforce/Merlion
Merlion: A Machine Learning Framework for Time Series Intelligence
salesforce/CodeT5
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
microsoft/CodeBERT
CodeBERT
microsoft/Graphormer
Graphormer is a general-purpose deep learning backbone for molecular modeling.
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
microsoft/CodeXGLUE
CodeXGLUE
salesforce/CodeTF
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
facebookresearch/vilbert-multi-task
Multi Task Vision and Language
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
salesforce/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
salesforce/PyRCA
PyRCA: A Python Machine Learning Library for Root Cause Analysis
hendrycks/apps
APPS: Automated Programming Progress Standard (NeurIPS 2021)
jasonwu0731/ToD-BERT
Pre-Trained Models for ToD-BERT
wasiahmad/PLBART
Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].
danielzuegner/code-transformer
Implementation of the paper "Language-agnostic representation learning of source code from structure and context".
microsoft/methods2test
methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositories
pdlan/OSCAR
Code for ICML 2021 paper: How could Neural Networks understand Programs?
salesforce/QAConv
This repository maintains the QAConv dataset, a question-answering dataset on informative conversations including business emails, panel discussions, and work channels.
eth-sri/TFix
yuewang-cuhk/awesome-programming-language-pretraining-papers
Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)
bdqnghi/awesome-ai4code
A collection of recent papers, benchmarks and datasets of AI4Code domain.
pkuzqh/Recoder
salesforce/VD-BERT
Yifan-Gao/explicit_memory_tracker
[ACL 2020] Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading
Yifan-Gao/Discern
[EMNLP 2020] Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading
guoday/CodeBERT
CodeBERT
yuewang-cuhk/CMKP
Official code and data for EMNLP 2020 paper "Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings"