EdwardZH

Tsinghua UniversityBeijing, China

Pinned Repositories

1-billion-word-language-modeling-benchmark
Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmark
Language:Perl0 2 00
2019-nCov-api
本项目通过爬取腾讯、新浪、丁香园等疫情数据，获取新冠肺炎相关数据，并整合为api数据，做法简单粗暴，类似于端口转发。数据包含口罩预约、同乘车辆、疫情小区、数据分析、国内外详细数据、实时新闻动态、确诊人员信息流动轨迹、疫情谣言等。
Language:Java0 1 00
acl2020-openqa-tutorial
ACL2020 Tutorial: Open-Domain Question Answering
0 1 00
AGIEval
Language:Python0 0 00
bob-plugin-openai-translator
基于 ChatGPT API 的文本翻译、文本润色、语法纠错 Bob 插件，让我们一起迎接不需要巴别塔的新时代！
Language:JavaScript1 0 00
ContrastQG
Language:Python1 2 00
EntityLinkingRetrieval-ELR
Exploiting entity linking in queries for entity retrieval
Language:Python2 2 00
OpenMatch
An Open-Source Package for Information Retrieval
Language:Python1 0 00
OpenMatch_docs
Language:Python2 4 11
QASurvey
3 2 00

EdwardZH's Repositories

EdwardZH/QASurvey
3 2 00
EdwardZH/OpenMatch_docs
Language:Python2 4 11
EdwardZH/bob-plugin-openai-translator
基于 ChatGPT API 的文本翻译、文本润色、语法纠错 Bob 插件，让我们一起迎接不需要巴别塔的新时代！
Language:JavaScript1 0 00
EdwardZH/ContrastQG
Language:Python1 2 00
EdwardZH/OpenMatch
An Open-Source Package for Information Retrieval
Language:Python1 0 00
EdwardZH/2019-nCov-api
本项目通过爬取腾讯、新浪、丁香园等疫情数据，获取新冠肺炎相关数据，并整合为api数据，做法简单粗暴，类似于端口转发。数据包含口罩预约、同乘车辆、疫情小区、数据分析、国内外详细数据、实时新闻动态、确诊人员信息流动轨迹、疫情谣言等。
Language:Java0 1 00
EdwardZH/acl2020-openqa-tutorial
ACL2020 Tutorial: Open-Domain Question Answering
0 1 00
EdwardZH/AGIEval
Language:Python0 0 00
EdwardZH/Automatic-Corpus-Generation
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
Language:Python0 1 00
EdwardZH/bert_score
BERT score for text generation
Language:Jupyter Notebook1 0
EdwardZH/cmu-10717-the-art-of-the-paper
Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".
1 0
EdwardZH/crosentgec
Code for cross-sentence grammatical error correction using multilayer convolutional seq2seq models (ACL 2019)
Language:Python1 0
EdwardZH/DialFact
We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence from Wikipedia.
Language:Python0 0
EdwardZH/DiskANN
Graph based indices for approximate nearest neighbor search
Language:C++1 0
EdwardZH/DrQA
Reading Wikipedia to Answer Open-Domain Questions
Language:Python2 0
EdwardZH/EdwardZH.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
Language:SCSS0 02
EdwardZH/fever-2018-team-athene
Language:Python2 0
EdwardZH/GEAR
Source code for ACL 2019 paper "GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification"
Language:Python1 0
EdwardZH/GYAFC-corpus
This is the Grammarly's Yahoo Answers Formality Corpus
2 0
EdwardZH/MSMARCOV2
Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET
Language:Python2 0
EdwardZH/neuqe
Neural quality estimation toolkit for grammatical error correction and other language generation applications.
Language:Python2 0
EdwardZH/NRE
Neural Relation Extraction, including CNN, PCNN, CNN+ATT, PCNN+ATT
Language:C++2 0
EdwardZH/Orca
Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension
Language:Python0 0
EdwardZH/PyChatGPT
⚡️ Python client for the unofficial ChatGPT API with auto token regeneration, conversation tracking, proxy support and more.
Language:Python0 0
EdwardZH/pytorch-pretrained-BERT
The Big-&-Extending-Repository-of-Transformers: PyTorch pretrained models for Google's BERT, OpenAI GPT & GPT-2 and Google/CMU Transformer-XL.
Language:Jupyter Notebook2 0
EdwardZH/QAConv
This repository maintains the QAConv dataset, a question-answering dataset on informative conversations including business emails, panel discussions, and work channels.
Language:Python0 0
EdwardZH/Texygen
A text generation benchmarking platform
Language:Python2 0
EdwardZH/tSNE-Animation
Hacking sklearn's t-SNE implementation to animate embedding process
Language:Python1 0
EdwardZH/wit
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
1 0
EdwardZH/zmPDSwR
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
Language:HTML1 0