Implement io.ReaderFrom/WriterTo for Conn #68

databus23 · 2021-02-19T23:33:40Z

This change increases performance when proxying wrapped connections using io.Copy.
Since go 1.11 copying between tcp connections uses the splice system call on linux yielding considerable performance improvments.
See: https://golang.org/doc/go1.11#net

Signed-off-by: Fabian Ruff fabian.ruff@sap.com

coveralls · 2021-02-19T23:34:14Z

Coverage decreased (-0.05%) to 94.177% when pulling ce59419 on databus23:readerfrom-writerto into fff0abf on pires:main.

This change increase performance when proxying wrapped connections using io.Copy. Since go 1.11 copying between tcp connections uses the splice system call on linux yielding considerable performance improvments. See: https://golang.org/doc/go1.11#net Signed-off-by: Fabian Ruff <fabian.ruff@sap.com>

pires · 2021-02-20T00:44:47Z

Thank you, Fabian. Can you, please, implement example tests so to have code others can learn from while keeping up with current code coverage?

pires · 2021-02-20T00:45:24Z

Maybe even benchmark tests to measure the actual performance gains?

Signed-off-by: Fabian Ruff <fabian.ruff@sap.com>

databus23 · 2021-02-23T13:09:30Z

@pires I added some tests to retain code coverage.
As requested I also added a simple benchmark for the tcp proxy use case I'm seeking to optimise.

> go test -run=XXX -bench=Bench -count 5 -benchmem > old.txt
> go test -run=XXX -bench=Bench -count 5 -benchmem > new.txt
> benchstat old.txt new.txt

name              old time/op    new time/op    delta
TCPProxy16KB-8       458µs ± 4%     454µs ±11%     ~     (p=0.690 n=5+5)
TCPProxy32KB-8       465µs ± 3%     468µs ±15%     ~     (p=0.690 n=5+5)
TCPProxy64KB-8       505µs ± 2%     464µs ± 8%   -8.16%  (p=0.016 n=5+5)
TCPProxy128KB-8     1.11ms ±52%    0.54ms ± 6%  -50.91%  (p=0.008 n=5+5)
TCPProxy256KB-8      823µs ± 5%     628µs ±10%  -23.74%  (p=0.016 n=4+5)
TCPProxy512KB-8     1.17ms ± 9%    0.79ms ± 9%  -32.99%  (p=0.008 n=5+5)
TCPProxy1024KB-8    1.89ms ±19%    1.11ms ± 5%  -41.15%  (p=0.008 n=5+5)
TCPProxy2048KB-8    2.86ms ± 7%    1.67ms ± 6%  -41.46%  (p=0.008 n=5+5)

name              old alloc/op   new alloc/op   delta
TCPProxy16KB-8      72.2kB ± 0%     6.5kB ± 0%  -90.98%  (p=0.008 n=5+5)
TCPProxy32KB-8      72.2kB ± 0%     6.5kB ± 0%  -90.98%  (p=0.008 n=5+5)
TCPProxy64KB-8      72.2kB ± 0%     6.5kB ± 0%  -90.98%  (p=0.008 n=5+5)
TCPProxy128KB-8     72.2kB ± 0%     6.5kB ± 0%  -90.98%  (p=0.008 n=5+5)
TCPProxy256KB-8     72.2kB ± 0%     6.5kB ± 0%  -90.97%  (p=0.008 n=5+5)
TCPProxy512KB-8     72.2kB ± 0%     6.5kB ± 0%  -90.97%  (p=0.008 n=5+5)
TCPProxy1024KB-8    72.2kB ± 0%     6.5kB ± 0%  -90.96%  (p=0.008 n=5+5)
TCPProxy2048KB-8    72.2kB ± 0%     6.5kB ± 0%  -90.97%  (p=0.008 n=5+5)

name              old allocs/op  new allocs/op  delta
TCPProxy16KB-8        65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)
TCPProxy32KB-8        65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)
TCPProxy64KB-8        65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)
TCPProxy128KB-8       65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)
TCPProxy256KB-8       65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)
TCPProxy512KB-8       65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)
TCPProxy1024KB-8      65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)
TCPProxy2048KB-8      65.0 ± 0%      61.0 ± 0%   -6.15%  (p=0.008 n=5+5)

I did those benchmarks in docker on macOS, so there is a VM involved which might introduce some noise. The results I'm getting are pretty consistent though and are in line with the originally reported gains when the splice optimisation was introduced: golang/go#10948 (comment)

pires · 2021-02-23T13:19:21Z

Thanks a lot!!!

databus23 force-pushed the readerfrom-writerto branch from d38c8e3 to 3e5f2ce Compare February 19, 2021 23:36

databus23 force-pushed the readerfrom-writerto branch from 3e5f2ce to 1994c14 Compare February 19, 2021 23:37

Add tests

394c64c

Signed-off-by: Fabian Ruff <fabian.ruff@sap.com>

databus23 force-pushed the readerfrom-writerto branch from 256ec68 to 394c64c Compare February 22, 2021 19:50

Add benchmark for tcp proxy use case

ce59419

Signed-off-by: Fabian Ruff <fabian.ruff@sap.com>

pires approved these changes Feb 23, 2021

View reviewed changes

pires merged commit c4bcea2 into pires:main Feb 23, 2021

pires added this to the 0.5 milestone Mar 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement io.ReaderFrom/WriterTo for Conn #68

Implement io.ReaderFrom/WriterTo for Conn #68

databus23 commented Feb 19, 2021 •

edited

coveralls commented Feb 19, 2021 •

edited

pires commented Feb 20, 2021

pires commented Feb 20, 2021

databus23 commented Feb 23, 2021 •

edited

pires commented Feb 23, 2021

Implement io.ReaderFrom/WriterTo for Conn #68

Implement io.ReaderFrom/WriterTo for Conn #68

Conversation

databus23 commented Feb 19, 2021 • edited

coveralls commented Feb 19, 2021 • edited

pires commented Feb 20, 2021

pires commented Feb 20, 2021

databus23 commented Feb 23, 2021 • edited

pires commented Feb 23, 2021

databus23 commented Feb 19, 2021 •

edited

coveralls commented Feb 19, 2021 •

edited

databus23 commented Feb 23, 2021 •

edited