DrPav's Stars
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
amueller/word_cloud
A little word cloud generator in Python
pypa/sampleproject
A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"
unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
ploomber/ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
scikit-learn-contrib/hdbscan
A high performance implementation of HDBSCAN clustering.
airbnb/airpal
Web UI for PrestoDB.
scikit-learn-contrib/category_encoders
A library of sklearn compatible categorical variable encoders
VerbalExpressions/PythonVerbalExpressions
Python regular expressions made easy
reiinakano/xcessiv
A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.
glample/tagger
Named Entity Recognition Tool
OxCGRT/covid-policy-tracker
Systematic dataset of Covid-19 policy, from Oxford University
njtierney/naniar
Tidy data structures, summaries, and visualisations for missing data
dgorissen/pycel
A library for compiling excel spreadsheets to python code & visualizing them as a graph
ropensci/stplanr
Sustainable transport planning with R
ukgovdatascience/data_scientist_career_path
Draft Data Scientist career path by the Government Data Science Partnership
minimaxir/amazon-spark
R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark
deltaDNA/sql-cookbook
Common SQL recipes and best practises
AlexIoannides/pipeliner
Machine learning pipelines for R.
ukgovdatascience/govstyle
Theme for use with ggplot2 for creating government style visualisations
wleepang/shiny-directory-input
An shiny input widget for selecting directories
rs-delve/covid19_datasets
Interfacing several COVID-19 related datasets
johnbaums/hues
Generate palettes of distinct colours through k means clustering of LAB colour space.
alphagov/govuk-lda-tagger
An experiment of using the LDA machine learning algorithm to generate topics from documents and tag them with those topics
mattilyra/glove2h5
A small utility for converting Stanford GloVe vectors to HDF5 / NumPy
moj-analytical-services/pq-tool
Tool to analyse past parliamentary questions with visualisation in RShiny
analytics-scotland/babynames
Robinlovelace/mlCars
tillbe/jsd
R package for Jensen-Shannon Divergence