Skip to content

Performance issue w/ getiterator(::CSVFile) #30

Closed
@quinnj

Description

@quinnj

I noticed a weird performance issue when playing w/ CSVFile; note the following (this is all after having run the functions once):

julia> @time f = load("/Users/jacobquinn/Downloads/randoms.csv");
  0.000297 seconds (84 allocations: 5.234 KiB)

julia> @time f = IteratorInterfaceExtensions.getiterator(load("/Users/jacobquinn/Downloads/randoms.csv"));
  6.374569 seconds (36.76 M allocations: 810.608 MiB, 2.06% gc time)

julia> @time DataFrame(f);
  0.070821 seconds (532.63 k allocations: 34.434 MiB, 9.99% gc time)

The getiterator call takes that long each time it's called. I haven't had a chance to dig in further, but wanted to report.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions