dataframes
There are 344 repositories under dataframes topic.
pola-rs/polars
Extremely fast Query Engine for DataFrames, written in Rust
unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
TileDB-Inc/TileDB
The Universal Storage Engine
JuliaData/DataFrames.jl
In-memory tabular data in Julia
skrub-data/skrub
Machine learning with dataframes
rocketlaunchr/dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
elixir-explorer/explorer
Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir
graphframes/graphframes
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs
pdpipe/pdpipe
Easy pipelines for pandas DataFrames.
elastic/eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
capitalone/datacompy
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
polyaxon/traceml
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
JuliaData/DataFramesMeta.jl
Metaprogramming tools for DataFrames
static-frame/static-frame
Immutable and statically-typeable DataFrames with runtime type and data validation
stefmolin/pandas-workshop
An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.
aiguofer/gspread-pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
rtosholdings/riptable
64bit multithreaded python data analytics tools for numpy arrays and datasets
typedef-ai/fenic
Build reliable AI and agentic applications with DataFrames
RumbleDB/rumble
Quick start: pip install jsoniq ⛈️ RumbleDB 2.0.0 "Lemon Ironwood" 🌳 for Apache Spark | Run queries on your large-scale, messy datasets (JSON, text, CSV, Parquet, Delta...) | Data Lakehouse with Updates, Scripting, Declarative Machine Learning and more
mahmoudparsian/data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
open2c/bioframe
Genomic interval operations on Pandas DataFrames
alteryx/woodwork
Woodwork is a Python library that provides robust methods for managing and communicating data typing information.
DataHaskell/dh-core
Functional data science
sl-solution/InMemoryDatasets.jl
Multithreaded package for working with tabular data in Julia
JuliaAcademy/DataFrames
Welcome to DataFrames.jl with Bogumił Kamiński
data-apis/dataframe-api
RFC document, tooling and other content related to the dataframe API standard
zbrookle/dataframe_sql
A Python package that parses SQL and interprets it as methods that act upon existing pandas (or other types of) DataFrames that have been declared and registered
Thomas-George-T/Movies-Analytics-in-Spark-and-Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
biodatageeks/polars-bio
Blazing-Fast Bioinformatic Operations on Python DataFrames
red-data-tools/red_amber
A dataframe library for Rubyists.
wesselhuising/pandantic
Gone are the days of black-box dataframes in otherwise type-safe code! Pandantic builds off the Pydantic API to enable validation and filtering of the usual dataframe types (i.e., pandas, etc.)
zbrookle/sql_to_ibis
A Python package that parses sql and converts it to ibis expressions
hablapps/sparkOptics
Optics for Spark DataFrames
dlab-berkeley/R-Data-Wrangling-Legacy
D-Lab's 6 hour introduction to data wrangling with R. Learn how to manipulate dataframes using the tidyverse in R.
openweathermap/deker
Multidimensional arrays storage engine