capitalone/datacompy
Pandas, Polars, and Spark DataFrame comparison for humans and more!
PythonApache-2.0
Issues
- 3
- 0
switch to ruff for linting and all the things.
#289 opened by fdosani - 2
Fugue support for extra helper functions from core
#214 opened by fdosani - 2
- 1
Please add Snowpark support
#290 opened by achrusciel - 2
- 1
[Discussion] Deprecate the native Spark implementation in favour of Fugue or Pandas on Spark
#274 opened by fdosani - 0
Just going to add a note here for future, currently seeing a small difference in pandas vs spark report sample rows when there are rows only in one dataframe.
#288 opened by fdosani - 1
SparkCompare [PARSE_SYNTAX_ERROR] if a non-join column name contains unicode symbols
#284 opened by kformanowicz-dotdata - 2
SparkCompare [PARSE_SYNTAX_ERROR] if column name contains unicode symbols
#280 opened by kformanowicz-dotdata - 0
- 8
Add list of dissimilar columns to report
#235 opened by janinebp - 2
Error df1 must have all columns from join_columns
#194 opened by bukreevai - 4
- 3
Abstract base class for native Compare functionality
#260 opened by fdosani - 12
Python 3.11 support
#227 opened by fdosani - 1
Error when comparing pandas string type with NAs
#193 opened by robne1982 - 0
edgetest is broken and needs some investigating.
#267 opened by fdosani - 9
Issue in writing report
#256 opened by rangav07 - 0
Snowflake and SQL support via Fugue
#264 opened by fdosani - 2
Are there plans to support Python 3.12.1?
#262 opened by RicardoEscobar - 2
- 2
who can help make the result significantly
#255 opened by swloveydp - 8
- 0
- 1
- 4
Add mypy to the project
#247 opened by aguiddir - 2
confused about df_unq_rows
#243 opened by swloveydp - 6
- 2
- 3
No objects to concatenate issue with Fugue
#218 opened by fdosani - 3
The intersection logic of Compare has problems.
#221 opened by goodwanghan - 4
Datacompare for Date field is not working
#230 opened by RRajavel - 1
SparkCompare() not working for dask - dropDuplicates
#233 opened by hb0313 - 4
- 2
Speed up spark unit tests
#223 opened by krishnanravi - 9
- 2
- 0
Pandas 2.0 support
#213 opened by fdosani - 1
documentation about fugue functionality
#203 opened by fdosani - 0
modernize docs
#204 opened by fdosani - 2
Fugue Phase 2 functionality
#206 opened by fdosani - 1
convert all docs to markdown from rst.
#196 opened by fdosani - 3
- 4
DataFrame is highly fragmented warning
#188 opened by jpvillemalard - 4
- 3
bump up minimum python version to 3.8
#173 opened by fdosani - 1
Replace called_with with assert_called_with
#182 opened by tirkarthi - 2
all_mismatch method raise exception if row count is different between baseline dataframe with compared dataframe
#177 opened by Sunny08012012 - 2
join_columns is changing the content's column
#163 opened by FrancisMorelli