dataframe
There are 1128 repositories under dataframe topic.
pola-rs/polars
Extremely fast Query Engine for DataFrames, written in Rust
Kanaries/pygwalker
PyGWalker: Turn your dataframe into an interactive UI for visual analysis
modin-project/modin
Modin: Scale your Pandas workflows by changing a single line of code
rapidsai/cudf
cuDF - GPU DataFrame Library
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
apache/datafusion
Apache DataFusion SQL Query Engine
haifengl/smile
Statistical Machine Intelligence & Learning Engine
javascriptdata/danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
lk-geimfari/mimesis
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
jtablesaw/tablesaw
Java dataframe and visualization library
databricks/koalas
Koalas: pandas API on Apache Spark
adamerose/PandasGUI
A GUI for Pandas DataFrames
sngyai/Sequoia
A股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
hosseinmoein/DataFrame
C++ DataFrame for statistical, financial, and ML analysis in modern C++
mars-project/mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
sfu-db/connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
approximatelabs/sketch
AI code-writing assistant that understands data content
apache/hamilton
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
alexhallam/tv
📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
man-group/ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
apache/datafusion-ballista
Apache DataFusion Ballista Distributed Query Engine
shramos/Awesome-Cybersecurity-Datasets
A curated list of amazingly awesome Cybersecurity datasets
skrub-data/skrub
Machine learning with dataframes
pyjanitor-devs/pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
uwdata/arquero
Query processing and transformation of array-backed data tables.
rocketlaunchr/dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
michaelchu/optopsy
A nimble options backtesting library for Python
graphframes/graphframes
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs
comet-ml/kangas
🦘 Explore multimedia datasets at scale
Kotlin/dataframe
Structured data processing in Kotlin
RedisLabs/spark-redis
A connector for Spark that allows reading and writing to/from Redis cluster
microsoft/Mobius
C# and F# language binding and extensions to Apache Spark
freqtrade/technical
Various indicators developed or collected for the Freqtrade
stitchfix/hamilton
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
mrpowers-io/spark-daria
Essential Spark extensions and helper methods ✨😲
techascent/tech.ml.dataset
A Clojure high performance data processing system