Increase readpartial() from 1KB to 16KB #880

schanjr · 2019-11-06T09:00:04Z

I noticed the redis-rb gem is using 1KB per readpartial() call. This is considered very small in comparison to something like Mongrel, Unicorn, Puma, Passenger, and Net::HTTP, of which all of them has 16KB as their read length.

Wanted to see what are your feedbacks on this.

byroot · 2019-11-06T10:48:34Z

in comparison to something like Mongrel, Unicorn, Puma, Passenger, and Net::HTTP,

All of which deal with HTTP request / responses, so not sure it is a proper argument.

However digging into hiredis shows that some experimentation was done and that 16k is indeed yield better performance: redis/hiredis@f7f022e

So 👍

liaden · 2020-07-01T15:18:03Z

Our production sidekiq instance saw memory usage grow drastically with this change:

We downgraded on 5/20, and the memory usage went back to 8-10GB range whereas it was at 11.5-14GB on 4.1.4. Would it be reasonable to expose this value as a configurable with the default of 16KB? I can open a PR if it is.

byroot · 2020-07-01T15:21:09Z

I'm not against making it configurable, but what concerns me more is how it could so significantly increase you memory usage.

This is just a buffer, it's supposed to be consumed quickly, it shouldn't hold memory for that long.

liaden · 2020-07-01T15:28:57Z

One of my thoughts is that pulling off 16.5kb of data is going to overallocate 15.5kb of memory (which is accumulating over time for some reason) and thus the increase of memory. If you have any ideas on where I can investigate to see what is happening, that would help short circuit things on my end a bit.

casperisfine · 2020-07-01T15:54:29Z

(answering from another computer)

which is accumulating over time for some reason

Well, if it's accumulating, the bug would be that it accumulate. I don't see how 1 chunk of 16K would be "accumulating" faster than 4 blocks of 4K.

I dumped the buffer on my machine to see:

setup:

Redis.new.set("foo", "bar" * 10_000)

After reading from the socket, @buffer is:

{"address":"0x7faf5603eb68", "type":"STRING", "class":"0x7faf520bb5b8", "bytesize":16384, "capacity":24575, "value":"$30000\r\nbarbarb........barbarbar", "encoding":"UTF-8", "memsize":24616, "flags":{"wb_protected":true}}

But just after it's slice!:

{"address":"0x7faf5603eb68", "type":"STRING", "class":"0x7faf520bb5b8", "shared":true, "encoding":"UTF-8", "references":["0x7faf56037070"], "memsize":40, "flags":{"wb_protected":true}}

We see that Ruby is smart and simply point to a shared empty string. We can assume the allocated string pointer was freed (I'll double check, but I highly doubt such an obvious bug wouldn't have been fixed and reported yet).

So here's a bunch of theories:

1 - Connection leaks

Maybe somehow you are leaking Connection::Ruby instances? I've seen report and suspicions of such issues, but never described well enough to be able to reproduce and fix them.

A way to verify this would be to get a shell into a ruby process using rbtrace, and then check ObjectSpace.each_object(Redis::Connection::Ruby).size, see if it's too high. Or if rbtrace is complicated, you can write a sidekiq middleware to log that number somewhere.

If you can prove this is happening, then I'd be extremely interested in debugging this forward with you.

2 - Memory fragmentation

If you are using the default memory allocator, untuned, and sidekiq with many threads, then you might very well experience a lot of memory fragmentation.

What version of Ruby ?
What allocator * what settings?
Ever tried jemalloc / tcmalloc? They can help a lot with fragmentation. Alternatively configuring MALLOC_ARENA_MAX=2 can help a lot as well.

Answers

Ruby 2.7.1 with standard allocator.
Attempted to switch to jemalloc via LD_PRELOAD as well as using the fullstaq.io container and then did high apache benchmark with with 5 concurrent requests against 5 puma processes and did not see any memory reduction which is sad since I've previously used jemalloc to great affect.
MALLOC_ARENA_MAX=2 was also tried by another engineer and saw no improvement.
Yes, RSS.
Will try out a bit this afternoon.
Ditto
Yes, and maybe same proprotional value:

byroot · 2020-07-01T17:22:13Z

That last graph is much less concerning. The RSS seem to stabilize around 1GB, which makes much more sense and is quite typical. The average app have a bunch of code and data that isn't properly eagerloaded, so there some memory growth on the first few requests. Plus it takes some time to reach a request that allocates quite a lot, and after that memory isn't reclaimed.

If anything this graphs makes the previous one much more concerning. Your sidekiq processes are using several times more memory than your web processes, that's not normal (unless they use much more threads), and they don't seem to stabilize. You either have a nasty leak in your sidekiq processes, or excessive fragmentation.

liaden · 2020-08-12T19:30:16Z

@byroot I just wanted to follow up that the big issue was a memory leak in sidekiq/sidekiq#4652

I think the above configuration change made it explode for us due a bunch of ruby internal objects that look like:

-[ RECORD 1 ]-----+------------------------------------
id                | 576368
time              | ¤
type              | IMEMO
node_type         | ¤
root              | ¤
address           | 0x556f575b0c90
value             | ¤
klass             | 0x556f506f0a58
name              | ¤
struct            | ¤
file              | /usr/local/lib/ruby/2.7.0/socket.rb
line              | 452
method            | __read_nonblock
generation        | 57
size              | ¤
length            | ¤
memsize           | 72
bytesize          | ¤
capacity          | ¤
ivars             | ¤
fd                | ¤
encoding          | ¤
default_address   | ¤
freezed           | ¤
fstring           | ¤
embedded          | ¤
shared            | ¤
flag_wb_protected | t
flag_old          | ¤
flag_long_lived   | ¤
flag_marking      | ¤
flag_marked       | ¤

In total those IMEMO objects had 183 counts:

mem_analysis=# select count(*), o.file from space_objects o join space_objects k on o.klass = k.address where o.generation = 57 and k.name is null group by o.file;
 count |                                      file
-------+---------------------------------------------------------------------------------
     9 | /usr/local/bundle/gems/connection_pool-2.2.2/lib/connection_pool/timed_stack.rb
     7 | /usr/local/bundle/gems/sidekiq-6.0.7/lib/sidekiq/util.rb
   183 | /usr/local/lib/ruby/2.7.0/socket.rb
     1 | eval
(4 rows)

I don't know if any of the above proves useful for things that are related soley to redis-rb, but I wanted to share just in case. Anyways, thanks a huge bunch with the initial pointers!

casperisfine · 2020-08-13T07:02:41Z

Thanks for sharing, glad you figured out your issue.

Increase readpartial() from 1KB to 16KB

8c9c4bc

byroot merged commit c7b69ba into redis:master Nov 6, 2019

byroot mentioned this pull request Mar 14, 2021

Memory bloat or leak after upgrading to 4.2.5 #981

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase readpartial() from 1KB to 16KB #880

Increase readpartial() from 1KB to 16KB #880

schanjr commented Nov 6, 2019

byroot commented Nov 6, 2019

liaden commented Jul 1, 2020

byroot commented Jul 1, 2020

liaden commented Jul 1, 2020

casperisfine commented Jul 1, 2020

liaden commented Jul 1, 2020

byroot commented Jul 1, 2020

liaden commented Aug 12, 2020

casperisfine commented Aug 13, 2020

Increase readpartial() from 1KB to 16KB #880

Increase readpartial() from 1KB to 16KB #880

Conversation

schanjr commented Nov 6, 2019

byroot commented Nov 6, 2019

liaden commented Jul 1, 2020

byroot commented Jul 1, 2020

liaden commented Jul 1, 2020

casperisfine commented Jul 1, 2020

1 - Connection leaks

2 - Memory fragmentation

Other questions

liaden commented Jul 1, 2020

Answers

byroot commented Jul 1, 2020

liaden commented Aug 12, 2020

casperisfine commented Aug 13, 2020