Pinned Repositories
MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Pangea
This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"
OpenCodeInterpreter
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.
AttrScore
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)
MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
BioNEV
Graph Embedding Evaluation / Code and Datasets for "Graph Embedding on Biomedical Networks: Methods, Applications, and Evaluations" (Bioinformatics 2020)
CliniRC
Code for the paper "Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset" (ACL 2020)
DP-Forward
SanText
Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"
xiangyue9607's Repositories
xiangyue9607/BioNEV
Graph Embedding Evaluation / Code and Datasets for "Graph Embedding on Biomedical Networks: Methods, Applications, and Evaluations" (Bioinformatics 2020)
xiangyue9607/SanText
Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"
xiangyue9607/DP-Forward
xiangyue9607/CliniRC
Code for the paper "Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset" (ACL 2020)
xiangyue9607/QVE
Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"
xiangyue9607/SCMFDD
Code and dataset for "Predicting drug-disease associations by using similarity constrained matrix factorization"
xiangyue9607/Sentence-LDP
Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"
xiangyue9607/C-MORE
Code for the ACL2022 paper "C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References"
xiangyue9607/single-cell-classification
We use two kinds of neural networks: Multilayer Perceptron (MLP) and Recurrent Neural Network (RNN) to predict the single-cell cycle stage based on transcriptome data.
xiangyue9607/xiangyue9607.github.io
xiangyue9607/iManager
An amazing online platform to display campus information based on student personal interests.
xiangyue9607/medical-data
xiangyue9607/NRLPapers
Must-read papers on network representation learning (NRL) / network embedding (NE)
xiangyue9607/awesome-rnn
Recurrent Neural Network - A curated list of resources dedicated to RNN
xiangyue9607/ciss2_materials
Let's put all materials into this repository
xiangyue9607/ClarityNLP
An NLP framework for clinical phenotyping. Docker | Python | Solr | OMOP. http://claritynlp.readthedocs.io/en/latest/
xiangyue9607/DrQA
Reading Wikipedia to Answer Open-Domain Questions
xiangyue9607/embeddings
Code for AMIA CRI 2016 paper "Learning Low-Dimensional Representations of Medical Concepts" (http://cs.nyu.edu/~dsontag/papers/ChoiChiuSontag_AMIA_CRI16.pdf)
xiangyue9607/google-10000-english
This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
xiangyue9607/Interactive-Semantic-Parsing
Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning
xiangyue9607/MaterialDrawer
The flexible, easy to use, all in one drawer library for your Android project.
xiangyue9607/miband-sdk-android
小米手环sdk
xiangyue9607/natural-language-processing
Resources for "Natural Language Processing" Coursera course.
xiangyue9607/nvm
Node Version Manager - Simple bash script to manage multiple active node.js versions
xiangyue9607/OpenBookQA
Code for experiments on OpenBookQA from the EMNLP 2018 paper "Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering"
xiangyue9607/OpenDevin
xiangyue9607/PHICON
xiangyue9607/StockPredictionRNN
High Frequency Trading Price Prediction using LSTM Recursive Neural Networks
xiangyue9607/Text_Crawl
尝试抓取网页中的正文到本地
xiangyue9607/uts-rest-api
A repository of code samples in various languages that show how to use the UMLS Terminology Services REST API