pydata
There are 103 repositories under pydata topic.
dask/dask
Parallel computing with task scheduling
rapidsai/cudf
cuDF - GPU DataFrame Library
TDAmeritrade/stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
databricks/koalas
Koalas: pandas API on Apache Spark
pydata/pandas-datareader
Extract data from a wide range of Internet sources into a pandas DataFrame.
dask/distributed
A distributed task scheduler for Dask
pyjanitor-devs/pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
pydata/pydata-sphinx-theme
A clean, three-column Sphinx theme with Bootstrap for the PyData community
DataTau/datascience-anthology-pydata
PyData, The Complete Works of
sgkit-dev/sgkit
Scalable genetics toolkit
data-apis/array-api
RFC document, tooling and other content related to the array API standard
stringfestdata/advancing-into-analytics-book
Resources for Advancing into Analytics: From Excel to R and Python by George Mount (O'Reilly Media, 2021)
JDASoftwareGroup/kartothek
A consistent table management library in python
JasonKessler/Scattertext-PyData
Notebooks for the Seattle PyData 2017 talk on Scattertext
dimgold/pycon_social_networkx
Social network analysis code examples for PyCon 2019 talk
rasbt/pydata-chicago2016-ml-tutorial
Machine learning with scikit-learn tutorial at PyData Chicago 2016
python-graphblas/python-graphblas
Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics
sktime/sktime-tutorial-pydata-amsterdam-2020
Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020
WinVector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
heavyai/pymapd
Python client for OmniSci GPU-accelerated SQL engine and analytics platform
python-graphblas/graphblas-algorithms
Graph algorithms written in GraphBLAS
mattilyra/pydataberlin-2017
Repo for my talk at the PyData Berlin 2017 conference
data-apis/array-api-comparison
Data and tooling to compare the API surfaces of various array libraries.
sktime/sktime-tutorial-pydata-global-2021
Introduction to sktime at the PyData Global 2021
makepath/mapshader
Simple Python GIS Web Services
martinapugliese/tales-science-data
WORK UNDER RESTRUCTURING
jseabold/pandas-selectable
A `select` accessor for easier subsetting of pandas DataFrames and Series
stanleyjzheng/PyData-Pseudolabelling-Keynote
Accompanying notebook and sources to "A Guide to Pseudolabelling: How to get a Kaggle medal with only one model" (Dec. 2020 PyData Boston-Cambridge Keynote)
yinleon/pydata2017
This is the code and presentation for my PyData2017 talk "Reverse Image Search Using Out-of-the-box Machine Learning Libraries
dask-contrib/dask-histogram
Histograms with task scheduling.
gcampanella/pydata-london-2018
Slides and notebooks for my tutorial at PyData London 2018
bweigel/ml_at_awslambda_pydatabln2018
Material for working alongside my workshop session at PyData Berlin 2018
pydataberlin/meetup-slides
Speaker slides from monthly meetups and conference
lucasdurand/network-graph-tutorial
construct a network graph to explore and visualize how people connect in an organisation
josephofiowa/pydata-dc-2018
@matthewbrems and I presented "Recreating, Understanding, and Visualizing FiveThirtyEight's Elections Forecast" at PyData DC 2018
koaning/kadro
A friendly pandas wrapper with a more composable grammar support.