improvement: Add atomic update on TrieMap #5938

jkciesluk · 2023-12-13T09:42:04Z

connected to #5072

filipwiech · 2023-12-19T07:02:34Z

mtags/src/main/scala/scala/meta/internal/mtags/AtomicTrieMap.scala

+  def updateWith(key: K)(remappingFunc: Option[V] => Option[V]): Unit = {
+    val computeFunction = new java.util.function.BiFunction[K, V, V] {
+      override def apply(k: K, v: V): V = {
+        trieMap.get(key) match {


Not sure, but maybe if inside this function we referenced k instead of key (so input of computeFunction rather than the updateWith method) we could then extract it into a constant defined on the AtomicTrieMap class level so that there's no need to allocate a new one each time an updateWith is called? 🤔

I don't see its possible, because we need to pass it remappingFunc.
We could change trieMap type to TrieMap[K, Set[V]], since we only use it like this and add a append function, that would look the way you suggested 🤔

Edit: I think it won't work either, since we need to pass the value to append

Sorry, you're absolutely right, somehow I missed the remappingFunc. 👍

tgodzik · 2023-12-20T16:13:48Z

mtags/src/main/scala/scala/meta/internal/mtags/AtomicTrieMap.scala

+        null.asInstanceOf[V]
+      }
+    }
+    concurrentMap.compute(key, computeFunction)


Doesn't this duplicate the amount of data we hold? Should we just use ConcurrentMap instead?

Size of concurrentMap is always 0, its only used to provide atomic updateWith for trieMap.
I don't think we should simply replace TrieMap with ConcurrentMap, because get on TrieMap should be faster

Size of concurrentMap is always 0, its only used to provide atomic updateWith for trieMap.

Can you ELI5 this? :D

get on TrieMap should be faster

Can you share some source about that, benchmark maybe? I did some reading, found https://medium.com/@igabaydulin/java-concurrenthashmap-vs-scala-concurrent-map-e185e8a0b798 which then led me to scala/scala#10027 and I need to read everything once again because I'm already lost. Best part - those are not even about TrieMap 😅

Size of concurrentMap is always 0, its only used to provide atomic updateWith for trieMap.

Can you ELI5 this? :D

Sure :D
We need an atomic way to update a TrieMap with a remapping function, but its not provided in its API.
ConcurrentHashMap has the ability to do that, with compute function.
We don't to store any data in ConcurrentHashMap and only use it as a sort of lock on the updated value.
So our compute function atomically calculates new value to be stored, but updates it only in TrieMap - it inserts null to ConcurrentHashMap, which means no mapping.

Can you share some source about that, benchmark maybe? I did some reading, found https://medium.com/@igabaydulin/java-concurrenthashmap-vs-scala-concurrent-map-e185e8a0b798 which then led me to scala/scala#10027 and I need to read everything once again because I'm already lost. Best part - those are not even about TrieMap 😅

To be honest I'm not sure anymore 😅 . I thought using a trie would result in a better performance, but from what I've read it could be similar or slightly worse than ConcurrentHashMap (TrieMap biggest advantage are snapshots, but we are not using them).

tgodzik

Thanks for explaning!

jkciesluk marked this pull request as draft December 13, 2023 09:42

jkciesluk changed the title ~~improvement: Add atomix update on TrieMap~~ improvement: Add atomic update on TrieMap Dec 13, 2023

improvement: Add atomix update on TrieMap

07c3946

jkciesluk force-pushed the i5072 branch from d15f542 to 07c3946 Compare December 15, 2023 10:04

jkciesluk marked this pull request as ready for review December 18, 2023 09:09

filipwiech reviewed Dec 19, 2023

View reviewed changes

jkciesluk requested review from tgodzik and kasiaMarek December 20, 2023 08:55

tgodzik reviewed Dec 20, 2023

View reviewed changes

tgodzik approved these changes Dec 29, 2023

View reviewed changes

jkciesluk merged commit 276a02c into scalameta:main Dec 29, 2023
24 of 25 checks passed

jkciesluk deleted the i5072 branch December 29, 2023 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvement: Add atomic update on TrieMap #5938

improvement: Add atomic update on TrieMap #5938

jkciesluk commented Dec 13, 2023

filipwiech Dec 19, 2023

jkciesluk Dec 20, 2023 •

edited

filipwiech Dec 20, 2023

tgodzik Dec 20, 2023

jkciesluk Dec 29, 2023

kpodsiad Jan 8, 2024

jkciesluk Jan 8, 2024

tgodzik left a comment

improvement: Add atomic update on TrieMap #5938

improvement: Add atomic update on TrieMap #5938

Conversation

jkciesluk commented Dec 13, 2023

filipwiech Dec 19, 2023

Choose a reason for hiding this comment

jkciesluk Dec 20, 2023 • edited

Choose a reason for hiding this comment

filipwiech Dec 20, 2023

Choose a reason for hiding this comment

tgodzik Dec 20, 2023

Choose a reason for hiding this comment

jkciesluk Dec 29, 2023

Choose a reason for hiding this comment

kpodsiad Jan 8, 2024

Choose a reason for hiding this comment

jkciesluk Jan 8, 2024

Choose a reason for hiding this comment

tgodzik left a comment

Choose a reason for hiding this comment

jkciesluk Dec 20, 2023 •

edited