[C#] Added more string read/write methods to the DirectBuffer #729 #845

MFrpurdy · 2021-05-10T18:31:28Z

Address #729

Added DirectBuffer methods for reading and writing string that need encoding.
Added convenience accessors for the various Encodings defined in the schema.
Changed the C# Code generator to create read/write methods using the new DirectBuffer methods.
Changed the sample to use the new methods.

The suggestions that the ticket creator had require .net48+ as far as I can see - using the Encoding.GetString() methods. I kept with the current .net45 version and implmented the 'spirit' of of the request.

More, hopefully efficient, mechanisms to read and write strings to and from the DirectBuffer.

…ry-encoding

mjpt777 · 2021-05-11T11:14:54Z

How is the character encoding accounted for? ASCII vs UTF-16 for example.

MFrpurdy · 2021-05-12T02:59:38Z

How is the character encoding accounted for? ASCII vs UTF-16 for example.

In the generated code the Encoding is resovled and available:

public const string ModelCharacterEncoding = "UTF-8";
public static Encoding ModelResolvedCharacterEncoding = Encoding.GetEncoding(ModelCharacterEncoding);

And used in the generated code like this:
return _buffer.GetStringFromBytes(ModelResolvedCharacterEncoding, limit + sizeOfLengthField, dataLength);

mjpt777 · 2021-05-12T14:18:02Z

@billsegall Do you have a view on this?

billsegall · 2021-05-12T22:21:45Z

@mjpt777 It looks ok but I wanted to find the time to write a couple of benchmark tests so we know it actually fixes any performance issues it is addressing.

MFrpurdy · 2021-05-12T22:23:55Z

@mjpt777 It looks ok but I wanted to find the time to write a couple of benchmark tests so we know it actually fixes any performance issues it is addressing.

I'm happy to do some benchmarking and publish the results here.

billsegall · 2021-05-12T22:32:58Z

MFrpurdy That would be great thankyou

MFrpurdy · 2021-05-19T22:22:25Z

CarBenchmark

In order to compare apples to apples I modifed the CarBenchmark to encode from a string to bytes every call and decode to a string.
In the original CarBenchmark the strings are encoded to byte[] once and those same bytes used in each iteration. Likewise the decoding of string values are only done to byte[] not to a string.
In use cases like this, where raw bytes are written and read, the existing byte based methods are much faster.
However if the use case it to take an arbitrary string and write it to the buffer and read convert a byte[] to a string; the new methods allocate less (or zero) and perform faster. I also hope it makes the client code look a little neater.

BenchmarkDotNet=v0.12.1, OS=ubuntu 20.04
Intel Core i7-10710U CPU 1.10GHz, 1 CPU, 12 logical and 6 physical cores
.NET Core SDK=5.0.202
[Host] : .NET Core 3.1.14 (CoreCLR 4.700.21.16201, CoreFX 4.700.21.16208), X64 RyuJIT
DefaultJob : .NET Core 3.1.14 (CoreCLR 4.700.21.16201, CoreFX 4.700.21.16208), X64 RyuJIT

BEFORE - Modified to Encode and Decode From/To Strings
Encoding example:
private static readonly Encoding VehicleCodeEncoding = Encoding.GetEncoding(Car.VehicleCodeCharacterEncoding);
car.Engine.SetManufacturerCode(ManufacturerCodeEncoding.GetBytes("123"), 0);

Decoding example:
length = car.GetManufacturer(_buffer, 0, _buffer.Length);
var usage = ManufacturerEncoding.GetString(_buffer, 0, length);

Method	Mean	Error	StdDev	Gen 0	Gen 1	Gen 2	Allocated
Encode	408.6 ns	3.10 ns	2.75 ns	0.0458	-	-	288 B
Decode	379.6 ns	1.10 ns	1.03 ns	0.0467	-	-	296 B

AFTER
Using the new string methods.

Encoding example:
car.SetVehicleCode("CODE12");

Decoding example:
var actCode = car.GetActivationCode();

Method	Mean	Error	StdDev	Gen 0	Gen 1	Gen 2	Allocated
Encode	271.0 ns	1.74 ns	1.63 ns	-	-	-	-
Decode	329.9 ns	0.60 ns	0.50 ns	0.0467	-	-	296 B

billsegall · 2021-05-20T12:22:08Z

@MFrpurdy I suspect the answer is to document the performance and leave the choice up to the user. I think the CarExample should change as you suggest as it is simpler.

Could you please add the benchmarking code to the pr. I might have a play.

…hmark which encodes and decodes to/from string; and a version wich uses the new methods to encode and decode strings

MFrpurdy · 2021-05-21T20:45:37Z

@billsegall I've added a modified CarBenchmark file that encodes to and from strings using the original methods. I've also added the same CarBenchamark test but using the new methods.

mjpt777 · 2021-05-23T17:22:38Z

@billsegall When you are happy with this let me know and I'll merge.

billsegall · 2021-05-24T00:10:24Z

I think the simplicity alone is worth it and people can always make performance dependent choices to fit their circumstances.

rca22 · 2021-05-25T09:05:26Z

I'd just like to comment that my company has been using a version of Rob's changes, and they were critical in terms of usability.
Some strings that you send, you know in advance and can pre-compute the bytes, and use the previous methods, but there are cases where this isn't practical, in which case the new methods are much simpler. Similarly if you ever want to parse an SBE message and turn it into an intermediate C# representation, you'll generally want to convert bytes to System.String before storing it.

Rob's changes cut down the amount of boilerplate code we would have had to have written to pull these strings out of complex messages - and if users want to be super careful about allocations etc., they still have the option of using the previously available methods. You can see that because the encoding is pre-computed as a static variable on assembly loading, the new methods are a bit quicker compared to a piece of code that has to work that out on the fly from the encoding string.
In our experience, because after you retrieve these values you'll want to compare them to other strings, it would be a major exercise in .Net (read total PITA) to completely avoid string creation, which should only be undertaken if you really need the latency performance - which of course SBE is designed to allow - but which may not be necessary depending on the application.

rca22 · 2021-05-25T14:36:05Z

@mjpt777 when does Real Logic next plan to do a release of the tool, for my information? Thanks for getting this merged.

mjpt777 · 2021-05-25T14:39:31Z

@rca22 Sometime within the next month.

rca22 · 2021-06-14T11:13:44Z

@mjpt777 thanks very much for doing a release of this and the other changes. Would you mind adding a new package to NuGet? This hasn't been done since 1.20.4.

mjpt777 · 2021-06-14T11:43:59Z

@mjpt777 thanks very much for doing a release of this and the other changes. Would you mind adding a new package to NuGet? This hasn't been done since 1.20.4.

@billsegall has been doing the NuGet releases.

billsegall · 2021-06-14T12:34:11Z

I'll try and freshen a release this week

billsegall · 2021-06-15T00:34:24Z

A new release should now be availble at nuget.org

Rob Purdy and others added 8 commits May 5, 2021 21:53

[C#] Add more string reading/writing methods to DirectBuffer

27b9188

More, hopefully efficient, mechanisms to read and write strings to and from the DirectBuffer.

[C#] Reverted to .net45

1a8e321

Merge branch 'real-logic:master' into master

6cc75dc

[C#] Reverted to .net45

2b494b0

Merge branch 'master' of https://github.com/MarketFactory/simple-bina…

047cdc2

…ry-encoding

[C#] Updated Samples to use the new String methods

262d2a3

[C#] Fixed checkstyle

5f54a0d

[C#] Cleaning

76bf8b6

Added Modified Benchmarks\nBoth a modified version of the the CarBenc…

0143183

…hmark which encodes and decodes to/from string; and a version wich uses the new methods to encode and decode strings

mjpt777 merged commit 7690fae into real-logic:master May 25, 2021

mjpt777 added a commit that referenced this pull request May 25, 2021

[C#] Tidy up after merge of PR #845.

43ee658

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C#] Added more string read/write methods to the DirectBuffer #729 #845

[C#] Added more string read/write methods to the DirectBuffer #729 #845

MFrpurdy commented May 10, 2021

mjpt777 commented May 11, 2021

MFrpurdy commented May 12, 2021

mjpt777 commented May 12, 2021

billsegall commented May 12, 2021

MFrpurdy commented May 12, 2021

billsegall commented May 12, 2021

MFrpurdy commented May 19, 2021 •

edited

billsegall commented May 20, 2021 •

edited

MFrpurdy commented May 21, 2021

mjpt777 commented May 23, 2021

billsegall commented May 24, 2021

rca22 commented May 25, 2021

rca22 commented May 25, 2021

mjpt777 commented May 25, 2021

rca22 commented Jun 14, 2021

mjpt777 commented Jun 14, 2021

billsegall commented Jun 14, 2021

billsegall commented Jun 15, 2021 •

edited

[C#] Added more string read/write methods to the DirectBuffer #729 #845

[C#] Added more string read/write methods to the DirectBuffer #729 #845

Conversation

MFrpurdy commented May 10, 2021

mjpt777 commented May 11, 2021

MFrpurdy commented May 12, 2021

mjpt777 commented May 12, 2021

billsegall commented May 12, 2021

MFrpurdy commented May 12, 2021

billsegall commented May 12, 2021

MFrpurdy commented May 19, 2021 • edited

billsegall commented May 20, 2021 • edited

MFrpurdy commented May 21, 2021

mjpt777 commented May 23, 2021

billsegall commented May 24, 2021

rca22 commented May 25, 2021

rca22 commented May 25, 2021

mjpt777 commented May 25, 2021

rca22 commented Jun 14, 2021

mjpt777 commented Jun 14, 2021

billsegall commented Jun 14, 2021

billsegall commented Jun 15, 2021 • edited

MFrpurdy commented May 19, 2021 •

edited

billsegall commented May 20, 2021 •

edited

billsegall commented Jun 15, 2021 •

edited