wesm
Principal Architect at https://posit.co. Creator of Python pandas and Ibis. Co-creator Apache Arrow. @apache Member and Apache Parquet PMC
@posit-pbcNashville, TN
Pinned Repositories
arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
ibis
the portable Python dataframe library
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
statsmodels
Statsmodels: statistical modeling and econometrics in Python
feather
Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
pandas2
Design documents and code for the pandas 2.0 effort.
pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
vbench
vbench: A tool for benchmarking your code through time, for showing performance improvement or regressions
vldb-2019-apache-arrow-workshop
Materials for Apache Arrow workshop at VLDB 2019
wesm's Repositories
wesm/pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
wesm/feather
Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow
wesm/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
wesm/vldb-2019-apache-arrow-workshop
Materials for Apache Arrow workshop at VLDB 2019
wesm/dataframe-protocol
An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary
wesm/arrow
Mirror of Apache Arrow
wesm/crossbow
wesm/conbench
General purpose, language-agnostic Continuous Benchmarking (CB) framework
wesm/fastparquet
python implementation of the parquet columnar file format.
wesm/kudu
Mirror of Apache Kudu
wesm/velox
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
wesm/wesm
wesm/arrow-rs
Official Rust implementation of Apache Arrow
wesm/arrow-site
Mirror of Apache Arrow site
wesm/infrastructure-puppet
Apache Infrastructure Puppet
wesm/orc-feedstock
A conda-smithy repository for orc.
wesm/turbodbc
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.
wesm/arrow-activity
wesm/arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
wesm/arrow-testing
Auxiliary testing files for Apache Arrow
wesm/dataframe_spec
wesm/distributed
A distributed task scheduler for Dask
wesm/duckdb
DuckDB is an embeddable SQL OLAP Database Management System
wesm/flatbuffers
FlatBuffers: Memory Efficient Serialization Library
wesm/grpc-cpp-feedstock
A conda-smithy repository for grpc-cpp.
wesm/pelican-bootstrap3
Bootstrap 3 theme for Pelican
wesm/xxHash
Extremely fast non-cryptographic hash algorithm
wesm/benchmark
A microbenchmark support library
wesm/benchmark-feedstock
A conda-smithy repository for benchmark.
wesm/libprotobuf-feedstock
A conda-smithy repository for libprotobuf.