kobi-2's Stars
lab-v2/pyreason-gym
An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting
lab-v2/pyreason
An explainable inference software supporting annotated, real valued, graph based and temporal logic
zarif98sjs/how-not-to-do-phd-application
So you want to do a PhD?
Chandra0505/Data-Science-Resources
zhijing-jin/nlp-phd-global-equality
A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP
dair-ai/ML-YouTube-Courses
📺 Discover the latest machine learning / AI courses on YouTube.
iamadamdev/bypass-paywalls-chrome
Bypass Paywalls web browser extension for Chrome and Firefox.
karlstratos/nlp-class
Lectures on NLP
neubig/nlp-from-scratch-assignment-2022
An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch
aws-samples/aws-machine-learning-university-responsible-ai
ml-tooling/best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
thunlp/XQA
Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"
princeton-nlp/MultilingualAnalysis
Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"
siznax/wptools
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
goldsmith/Wikipedia
A Pythonic wrapper for the Wikipedia API
google-research-datasets/tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.
banglakit/awesome-bangla
A collection of tools, datasets and resources on Bangla computing
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
fastai/fastbook
The fastai book, published as Jupyter Notebooks
Hellisotherpeople/DebateSum
Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"
google-research-datasets/common-crawl-domain-names
Corpus of domain names scraped from Common Crawl and manually annotated to add word boundaries (e.g. "commoncrawl" to "common crawl").
google-research-datasets/dakshina
The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia text, a romanization lexicon of words in the native script with attested romanizations, and some full sentence parallel data in both a native script of the language and the basic Latin alphabet.
google-research-datasets/turkish-treebanks
A human-annotated morphosyntactic treebank for Turkish.
google-research-datasets/Disfl-QA
A Benchmark Dataset for Understanding Disfluencies in Question Answering
google-research-datasets/screen2words
The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of the screen2words models (our paper accepted by UIST'21 will be linked soon).
google-research-datasets/QED
QED: A Framework and Dataset for Explanations in Question Answering
google-research-datasets/natural-questions
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
google-research-datasets/aquamuse
AQuaMuSe is a novel scalable approach to automatically mine dual query based multi-document summarization datasets for extractive and abstractive summaries using question answering dataset (Google Natural Questions) and large document corpora (Common Crawl)
nowshintabassum/Random-Headline-Generator
Using Markov chains (Markovify Library) to generate random news headline generator