bwegge's Stars
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
CSSEGISandData/COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
google-research/simclr
SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
valentjn/vscode-ltex
LTeX: Grammar/spell checker :mag::heavy_check_mark: for VS Code using LanguageTool with support for LaTeX :mortar_board:, Markdown :pencil:, and others
google-research-datasets/paws
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification.
ChunyuanLI/Optimus
Optimus: the first large-scale pre-trained VAE language model
marklagendijk/node-toogoodtogo-watcher
Node.js cli tool for monitoring your favorite TooGoodToGo businesses. Docker image available.
the-markup/location-data-industry
This contains the data and methodology that we used for our story "There’s a Multibillion-Dollar Market for Your Phone’s Location Data ."
florex/resume_corpus
multi-labeled dataset of resumes
areinhardt/tracebase
The tracebase appliance-level power consumption data set
YingzhenLi/VRbound
code release for the NIPS 2016 paper
gireeshkbogu/LAAD
Uses LSTM-based autoencoders to detect abnormal resting heart rate during the coronavirus (SARS-CoV-2) infectious period using the wearables data.
pnbc/how-to-dp-fy-ml
How to DP-fy ML tutorial
mozilla/rappor
RAPPOR: Privacy-Preserving Reporting Algorithms