dataframe
There are 965 repositories under dataframe topic.
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
modin-project/modin
Modin: Scale your Pandas workflows by changing a single line of code
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
rapidsai/cudf
cuDF - GPU DataFrame Library
haifengl/smile
Statistical Machine Intelligence & Learning Engine
apache/datafusion
Apache DataFusion SQL Query Engine
twopirllc/pandas-ta
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
javascriptdata/danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
lk-geimfari/mimesis
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
jtablesaw/tablesaw
Java dataframe and visualization library
databricks/koalas
Koalas: pandas API on Apache Spark
adamerose/PandasGUI
A GUI for Pandas DataFrames
mars-project/mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
hosseinmoein/DataFrame
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
approximatelabs/sketch
AI code-writing assistant that understands data content
alexhallam/tv
📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
sfu-db/connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
Eventual-Inc/Daft
Distributed DataFrame for Python designed for the cloud, powered by Rust
sngyai/Sequoia
A股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
DAGWorks-Inc/hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
apache/datafusion-ballista
Apache Arrow Ballista Distributed Query Engine
pyjanitor-devs/pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
shramos/Awesome-Cybersecurity-Datasets
A curated list of amazingly awesome Cybersecurity datasets
uwdata/arquero
Query processing and transformation of array-backed data tables.
man-group/ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
rocketlaunchr/dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
comet-ml/kangas
🦘 Explore multimedia datasets at scale
microsoft/Mobius
C# and F# language binding and extensions to Apache Spark
RedisLabs/spark-redis
A connector for Spark that allows reading and writing to/from Redis cluster
michaelchu/optopsy
A nimble options backtesting library for Python
stitchfix/hamilton
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
MrPowers/spark-daria
Essential Spark extensions and helper methods ✨😲
Kotlin/dataframe
Structured data processing in Kotlin
pdpipe/pdpipe
Easy pipelines for pandas DataFrames.
freqtrade/technical
Various indicators developed or collected for the Freqtrade