Various HashMap optimisations #337

utdemir · 2021-07-12T04:12:55Z

Just some minor optimisations for our HashMap implementation. It contains
a few unrelated small changes, so reviewing each individual commit might
be easier.

When I compare this against Data.HashMap.Strict, the results change based
on the input (likely based on the hash distribution of the key). I need to investigate
this more. Just as reference, here're the functions I am comparing (Key is just a
newtype wrapper around Int):

   linear :: [Key] -> HashMap.Linear.HashMap Key () %1-> ()
   linear inp hm = go inp hm `Linear.lseq` ()
    where
     go :: [Key] -> HashMap.Linear.HashMap Key () %1-> HashMap.Linear.HashMap Key ()
     go [] h = h
     go (x:xs) h = go xs Linear.$! HashMap.Linear.insert x () h

   dataHashMap :: [Key] -> Data.HashMap.Strict.HashMap Key () -> ()
   dataHashMap inp hm = go inp hm `seq` ()
    where
     go :: [Key] -> Data.HashMap.Strict.HashMap Key () -> Data.HashMap.Strict.HashMap Key ()
     go [] h = h
     go (x:xs) h = go xs $! Data.HashMap.Strict.insert x () h

I did not include the benchmark on this PR, since it looks a bit different
than our existing HashMap benchmarks. I have to combine them together at
one point.

It somehow ends up being faster, since otherwise we frequently allocate boxed integers from the unpacked field.

It is accessed pretty often, so it is faster to have it ready.

aspiwack

I guess that the next step is committing these benchmarks.

facundominguez · 2021-07-12T11:41:29Z

Hello! Just a random thought, are alter and alterF supposed to be inlined? Otherwise, the allocation of the Maybe value returned by the callback can't be eliminated. insert in unordered-containers isn't implemented in terms of alter, at least.

utdemir · 2021-07-12T20:15:05Z

are alter and alterF supposed to be inlined? Otherwise, the allocation of the Maybe value returned by the callback can't be eliminated.

This was a good point, I think they should be inlined. I can observe from the core output that both the callback passed to alterF from alter and eg. callback pased to alter from insert disappear. Unfortunately they do not correspond to a measurable speed increase on the insertion benchmarks, I think this is because they are only called once per insertion, but our bottleneck is on tryInsertAtIndex and probeFrom functions which are called multiple times per insertion.

But they make sense and make the core output prettier, so I commited those.

utdemir added 5 commits July 8, 2021 13:44

Use faster array accessors on HashMaps

4a1f629

HashMap: Don't unpack PSL underlying array

ae34151

It somehow ends up being faster, since otherwise we frequently allocate boxed integers from the unpacked field.

Cache underlying 'Array's capacity on HashMap constructor

d919425

It is accessed pretty often, so it is faster to have it ready.

Make strictness explicit on HashMap fields

6ce60ff

Make 'probeFrom' return an unboxed tuple

b9cfc7b

aspiwack approved these changes Jul 12, 2021

View reviewed changes

Inline alterF and alter

a335473

Fix GHC 9.0 compatibility

f3fd127

utdemir force-pushed the ud/optimise-hashmaps branch from 75192a2 to f3fd127 Compare July 22, 2021 04:04

utdemir merged commit ed59552 into master Jul 22, 2021

utdemir deleted the ud/optimise-hashmaps branch July 22, 2021 04:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Various HashMap optimisations #337

Various HashMap optimisations #337

Uh oh!

utdemir commented Jul 12, 2021 •

edited

Loading

Uh oh!

aspiwack left a comment

Uh oh!

facundominguez commented Jul 12, 2021 •

edited

Loading

Uh oh!

utdemir commented Jul 12, 2021

Uh oh!

Uh oh!

Various HashMap optimisations #337

Various HashMap optimisations #337

Uh oh!

Conversation

utdemir commented Jul 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aspiwack left a comment

Choose a reason for hiding this comment

Uh oh!

facundominguez commented Jul 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

utdemir commented Jul 12, 2021

Uh oh!

Uh oh!

utdemir commented Jul 12, 2021 •

edited

Loading

facundominguez commented Jul 12, 2021 •

edited

Loading