moj-analytical-services/mojap-arrow-pd-parser
Conforms pandas to "correct" datatypes to ensure data in/out using CSV, JSONL and Parquet is read the same (using arrow).
PythonMIT
Issues
- 0
Pandas to parquet
#4 opened by isichei - 1
Parquet to Pandas via Arrow
#3 opened by isichei - 1
Pyarrow v5
#61 opened by s-ducks - 0
Add Logging to repo?
#109 opened by parminder-thindal-moj - 0
Python 3.11 support
#102 opened by mratford - 2
Not finding file on S3 bucket
#96 opened by mshodge - 1
- 0
- 0
- 0
- 0
Low test coverage
#62 opened by s-ducks - 1
Allow pd_arrow_parser to accept Metadata class
#49 opened by isichei - 1
Add Pandas converter
#48 opened by isichei - 0
Casting and reading in complex data types
#53 opened by s-ducks - 0
- 2
Pandas to CSV/JSON
#5 opened by isichei - 0
Add testing for dependencies updating
#39 opened by gkelly900 - 0
jsonl read fails on decimal type
#40 opened by isichei - 2
github actions to run testing
#16 opened by isichei - 2
- 0
Pass an Arrow schema into JSON and CSV reader
#32 opened by isichei - 0
- 0
Add pandas typing and docstrings to functions
#33 opened by isichei - 2
- 0
- 0
CSV and JSON to Pandas reader
#1 opened by isichei - 1
Standard package layout
#6 opened by isichei - 0
Tests currently failing for boolean type
#17 opened by isichei - 0
- 0
- 1
Interaction with S3
#2 opened by isichei - 0
handle booleans
#10 opened by Thomas-Hirsch