capitalone/DataProfiler

Unhashable type: list when initializing DataLabeler

js430 opened this issue · 1 comments

General Information:

  • OS: MacOS Sonoma 14.4, intel i5 processor
  • Python version: 3.9
  • Library version: 0.10.9

Describe the bug:
When initializing a datalabeler, I get the following error:

TypeError: unhashable type: 'list'

To Reproduce:

import dataprofiler as dp
from dataprofiler.data_readers.csv_data import CSVData

data=dp.Data("sample_data.csv")

data_labeler = dp.DataLabeler(labeler_type='structured')

Expected behavior:

I would expect no output, just the initialization of it and then afterwards, I could run something like:

predictions = data_labeler.predict(data)

Screenshots:
Full error log:
unhashable_type_list

Additional context:

Attached is the sample data file I used, just a list of randomly generated Mac Addresses, IP addresses, and IMSIs
sample_data.csv

Hey @js430! Thanks for opening the issue.

I'm unable to replicate this on my end with 0.10.9.

import dataprofiler as dp

data=dp.Data("sample_data.csv")

data_labeler = dp.DataLabeler(labeler_type='structured')

predictions = data_labeler.predict(data)

I'm on sonoma and M1 chip, myself.