Pinned Repositories
1-billion-word-language-modeling-benchmark
Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmark
2019-nCov-api
本项目通过爬取腾讯、新浪、丁香园等疫情数据,获取新冠肺炎相关数据,并整合为api数据,做法简单粗暴,类似于端口转发。数据包含口罩预约、同乘车辆、疫情小区、数据分析、国内外详细数据、实时新闻动态、确诊人员信息流动轨迹、疫情谣言等。
acl2020-openqa-tutorial
ACL2020 Tutorial: Open-Domain Question Answering
AGIEval
bob-plugin-openai-translator
基于 ChatGPT API 的文本翻译、文本润色、语法纠错 Bob 插件,让我们一起迎接不需要巴别塔的新时代!
ContrastQG
EntityLinkingRetrieval-ELR
Exploiting entity linking in queries for entity retrieval
OpenMatch
An Open-Source Package for Information Retrieval
OpenMatch_docs
QASurvey
EdwardZH's Repositories
EdwardZH/QASurvey
EdwardZH/OpenMatch_docs
EdwardZH/bob-plugin-openai-translator
基于 ChatGPT API 的文本翻译、文本润色、语法纠错 Bob 插件,让我们一起迎接不需要巴别塔的新时代!
EdwardZH/ContrastQG
EdwardZH/OpenMatch
An Open-Source Package for Information Retrieval
EdwardZH/2019-nCov-api
本项目通过爬取腾讯、新浪、丁香园等疫情数据,获取新冠肺炎相关数据,并整合为api数据,做法简单粗暴,类似于端口转发。数据包含口罩预约、同乘车辆、疫情小区、数据分析、国内外详细数据、实时新闻动态、确诊人员信息流动轨迹、疫情谣言等。
EdwardZH/acl2020-openqa-tutorial
ACL2020 Tutorial: Open-Domain Question Answering
EdwardZH/AGIEval
EdwardZH/Automatic-Corpus-Generation
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
EdwardZH/bert_score
BERT score for text generation
EdwardZH/cmu-10717-the-art-of-the-paper
Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".
EdwardZH/crosentgec
Code for cross-sentence grammatical error correction using multilayer convolutional seq2seq models (ACL 2019)
EdwardZH/DialFact
We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence from Wikipedia.
EdwardZH/DiskANN
Graph based indices for approximate nearest neighbor search
EdwardZH/DrQA
Reading Wikipedia to Answer Open-Domain Questions
EdwardZH/EdwardZH.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
EdwardZH/fever-2018-team-athene
EdwardZH/GEAR
Source code for ACL 2019 paper "GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification"
EdwardZH/GYAFC-corpus
This is the Grammarly's Yahoo Answers Formality Corpus
EdwardZH/MSMARCOV2
Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET
EdwardZH/neuqe
Neural quality estimation toolkit for grammatical error correction and other language generation applications.
EdwardZH/NRE
Neural Relation Extraction, including CNN, PCNN, CNN+ATT, PCNN+ATT
EdwardZH/Orca
Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension
EdwardZH/PyChatGPT
⚡️ Python client for the unofficial ChatGPT API with auto token regeneration, conversation tracking, proxy support and more.
EdwardZH/pytorch-pretrained-BERT
The Big-&-Extending-Repository-of-Transformers: PyTorch pretrained models for Google's BERT, OpenAI GPT & GPT-2 and Google/CMU Transformer-XL.
EdwardZH/QAConv
This repository maintains the QAConv dataset, a question-answering dataset on informative conversations including business emails, panel discussions, and work channels.
EdwardZH/Texygen
A text generation benchmarking platform
EdwardZH/tSNE-Animation
Hacking sklearn's t-SNE implementation to animate embedding process
EdwardZH/wit
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
EdwardZH/zmPDSwR
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)