ENH: improved dtype inference for Index.map #44609

jbrockmendel · 2021-11-24T21:44:48Z

closes #xxxx
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

Broken off from #43930

jreback · 2021-11-25T17:03:13Z

looks good. is this a user visible thing? if so can you add a whatsnew

jorisvandenbossche · 2021-11-26T10:30:06Z

An example to illustrate, assume we have an uint32 index:

In [1]: idx = pd.NumericIndex([1, 2, 3], dtype="uint32")

In [2]: idx
Out[2]: NumericIndex([1, 2, 3], dtype='uint32')

In [3]: idx.map(lambda x: x)
Out[3]: NumericIndex([1, 2, 3], dtype='int64')

On master we always get an int64 index back, while with this PR it tries to preserve the original dtype:

In [3]: idx.map(lambda x: x)
Out[3]: NumericIndex([1, 2, 3], dtype='uint32')

Now, I don't know if this is actually a user visible change compared to the released version, since the above example is with a non-int64 dtype, which is only in master (so in that sense you could see it as a bug fix in the new not-yet-released feature)

jorisvandenbossche · 2021-11-26T10:32:21Z

Now, I don't know if this is actually a user visible change compared to the released version, since the above example is with a non-int64 dtype, which is only in master (so in that sense you could see it as a bug fix in the new not-yet-released feature)

Ah, we of course already had UInt64Index, for example, and that will now also preserve the dtype instead of returning Int64Index in the example above.

jorisvandenbossche

The current tests you edited are (all?) simple lambda x: x checks that ensure the dtype is now preseved. Can you ensure we also test the case where a maybe_cast_pointwise_result needs to fallback? Eg idx.map(lambda x: x*1000) with an uint8 index, where the resulting values thus don't fit in the original dtype.

jbrockmendel · 2021-11-26T18:38:47Z

is this a user visible thing? if so can you add a whatsnew

yes, will add whatsnew.

jbrockmendel · 2021-11-26T18:41:31Z

The current tests you edited are (all?) simple lambda x: x checks that ensure the dtype is now preseved. Can you ensure we also test the case where a maybe_cast_pointwise_result needs to fallback? Eg idx.map(lambda x: x*1000) with an uint8 index, where the resulting values thus don't fit in the original dtype.

I do intend to do this (https://github.com/pandas-dev/pandas/pull/44609/files#diff-9090cca4ac914e526dfb58dd84ce7c75e2935e41cdeb3ee1618bb5068f05e2dbR549), the question is when. This PR in its current state is just splitting the Index.map changes off from #43930. I'm happy to take the time to do this The Right Way, but that does mean marginally slowing down the ExtensionIndex PR.

ENH: improved dtype inference for Index.map

c5d9a44

jreback added Dtype Conversions Unexpected or buggy dtype conversions ExtensionArray Extending pandas with custom dtypes or arrays. labels Nov 25, 2021

jreback added this to the 1.4 milestone Nov 25, 2021

jorisvandenbossche mentioned this pull request Nov 26, 2021

ENH: allow storing ExtensionArrays in Index #43930

Merged

7 tasks

jorisvandenbossche reviewed Nov 26, 2021

View reviewed changes

jbrockmendel added 3 commits December 3, 2021 15:14

Merge branch 'master' into bug-index-map

cdd3f8e

whatsnew

ba92469

new test

ebd87e2

jreback merged commit bb50531 into pandas-dev:master Dec 5, 2021

jbrockmendel deleted the bug-index-map branch December 5, 2021 02:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: improved dtype inference for Index.map #44609

ENH: improved dtype inference for Index.map #44609

Uh oh!

jbrockmendel commented Nov 24, 2021

Uh oh!

jreback commented Nov 25, 2021

Uh oh!

jorisvandenbossche commented Nov 26, 2021

Uh oh!

jorisvandenbossche commented Nov 26, 2021

Uh oh!

jorisvandenbossche left a comment

Uh oh!

jbrockmendel commented Nov 26, 2021

Uh oh!

jbrockmendel commented Nov 26, 2021

Uh oh!

Uh oh!

Uh oh!

ENH: improved dtype inference for Index.map #44609

ENH: improved dtype inference for Index.map #44609

Uh oh!

Conversation

jbrockmendel commented Nov 24, 2021

Uh oh!

jreback commented Nov 25, 2021

Uh oh!

jorisvandenbossche commented Nov 26, 2021

Uh oh!

jorisvandenbossche commented Nov 26, 2021

Uh oh!

jorisvandenbossche left a comment

Choose a reason for hiding this comment

Uh oh!

jbrockmendel commented Nov 26, 2021

Uh oh!

jbrockmendel commented Nov 26, 2021

Uh oh!

Uh oh!