Skip to content

Speed optimization: Block do not use their own Index objects #162

Closed
@wesm

Description

@wesm

A lot of time is spent in Index.new in some computations. I do not think there's an essential need to create Index objects when you could just have an array of integers referring back to the single Index having all the column names for the DataFrame. This would also facilitate renaming as renaming would only need to happen in one place. Definitely not a trivial refactor and would unfortunately break legacy pickles yet again. "For the greater good"

Metadata

Metadata

Assignees

No one assigned

    Labels

    EnhancementInternalsRelated to non-user accessible pandas implementationPerformanceMemory or execution speed performanceRefactorInternal refactoring of code

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions