data-apis/python-record-api

String literals must not be treated as separate types

rgommers opened this issue · 1 comments

Strings are things like column names; there is no point treating those as separate types, it gives a huge amount of pollution in the generated type annotations. Example: typing/pandas.core.frame.py has 10,000+ lines for __getitem__/__setitem__ overloads.

Related, issue to make results more interpretable: gh-71

This issue seems more relevant for the dataframe analysis. For array analysis, treating strings as unique was useful to see which kwarg values were provided by downstream libraries.