capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
PythonApache-2.0
Issues
- 2
- 0
certifi-2024.6.2-py3-none-any.whl: 1 vulnerabilities (highest severity is: 7.5)
#1158 opened by mend-for-github-com - 0
urllib3-1.26.18-py2.py3-none-any.whl: 1 vulnerabilities (highest severity is: 4.4)
#1153 opened by mend-for-github-com - 0
Issues with Transfer Learning for Default Labeler
#1155 opened by DylanVig - 0
Numpy2.0 Import Error
#1154 opened by VrajCodes - 1
requests-2.31.0-py3-none-any.whl: 1 vulnerabilities (highest severity is: 5.6) - autoclosed
#1142 opened by mend-for-github-com - 3
Dask Max Version Tag
#1121 opened by taylorfturner - 0
build test warns on __init__ constructor
#1147 opened by gliptak - 7
Can't get the full package to work
#1144 opened by DylanVig - 1
Unhashable type: list when initializing DataLabeler
#1140 opened by js430 - 8
Cannot load DataLabeler due to error in labeler utils.
#1126 opened by JGSweets - 1
Column profiled as int but should be text/string
#1130 opened by carlsonp - 3
Categorical Column Profiling Error
#1048 opened by scottiegarcia - 3
- 2
JSON Serialization Error
#1100 opened by scottiegarcia - 1
Add argument to Profiler for samples ratio
#1094 opened by carlsonp - 1
- 2
Feature: Support for Polars Data Frame
#1076 opened by taylorfturner - 14
Fix broken hyperlink in the documentation
#972 opened by rakeshgowerneni - 1
Profiles Int64 variables as float
#1075 opened by anxti - 3
- 2
Bug while running the merge profile list notebook
#1013 opened by ksneab7 - 1
Confusing name for degree of freedom in Chi2 metrics
#1030 opened by ksneab7 - 2
Support for PySpark
#1055 opened by gracemiguel - 3
- 2
urllib3-2.0.4-py3-none-any.whl: 2 vulnerabilities (highest severity is: 8.1) - autoclosed
#1049 opened by mend-for-github-com - 1
pyarrow-12.0.1-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl: 1 vulnerabilities (highest severity is: 5.5) - autoclosed
#1062 opened by mend-for-github-com - 0
Implement a PSI calculation that uses the `_calculate_psi` function in categorical columns
#1028 opened by ksneab7 - 0
Preset column profile type
#794 opened by taylorfturner - 1
- 0
Running space analysis async rather than synchronous
#783 opened by ksneab7 - 2
Fuse the functionality used in both `_merge_histogram` and the newly created `_assimilate_histogram`
#838 opened by ksneab7 - 0
- 0
DataLabeler `from_disk`
#941 opened by taylorfturner - 0
Improving out of generate_dataset_by_class function to include naming convention
#782 opened by ksneab7 - 0
Need to update Categorical for TimeIt Tests
#806 opened by JGSweets - 2
- 0
- 11
`_assimilate_histogram` and `_regenerate_histogram` refactor into standalone functions
#820 opened by ksneab7 - 0
_assimilate_histogram function not handling reallocation of bucket counts ideally
#1017 opened by ksneab7 - 7
cannot import name 'F1Score' from 'dataprofiler.labelers.character_level_cnn_model'
#773 opened by brijesh1100 - 2
Create option for num_quantiles attribute
#853 opened by ksneab7 - 3
Fix snappy installation on Mac
#970 opened by sharattadimalla - 4
`DATAPROFILER_SEED` global input validation testing
#916 opened by micdavis - 1
Action Required: Fix Mend Configuration File - .whitesource - autoclosed
#975 opened by mend-for-github-com - 2
Scipy Verision + Graph Issues
#905 opened by taylorfturner - 1
- 1
Add documentation for `sampling_ratio` option
#856 opened by taylorfturner - 1
RowStatisticsOptions: Add null row count
#859 opened by micdavis - 6
update supported version of tensorflow
#775 opened by leos