Pool commands #1590

benaadams · 2020-09-27T20:11:15Z

Rather than creating them per request

sebastienros · 2020-09-29T23:15:26Z

No change on fortunes, so tried on updates:

db	updates-baseline	updates-pool
CPU Usage (%)	96	97	+1.04%
Raw CPU Usage (%)	2,698.85	2,706.41	+0.28%
Working Set (MB)	524	524	0.00%
Build Time (ms)	1,856	1,762	-5.06%
Start Time (ms)	326	322	-1.23%

application	updates-baseline	updates-pool
CPU Usage (%)	70	70	0.00%
Raw CPU Usage (%)	1,952.18	1,960.89	+0.45%
Working Set (MB)	481	480	-0.21%
Build Time (ms)	4,750	4,964	+4.51%
Start Time (ms)	1,595	1,561	-2.13%
Published Size (KB)	98,006	98,007	+0.00%

load	updates-baseline	updates-pool
CPU Usage (%)	2	2	0.00%
Raw CPU Usage (%)	43.98	45.07	+2.49%
Working Set (MB)	7	7	0.00%
Build Time (ms)	3,316	3,317	+0.03%
Start Time (ms)	0	0
Published Size (KB)	76,389	76,389	0.00%
First Request (ms)	81	81	0.00%
Requests/sec	12,737	12,913	+1.38%
Requests	191,923	194,986	+1.60%
Mean latency (ms)	22.53	22.23	-1.33%
Max latency (ms)	383.46	285.45	-25.56%
Bad responses	0	0
Socket errors	0	0
Read throughput (MB/s)	9.20	9.33	+1.41%
Latency 50th (ms)	17.54	17.24	-1.71%
Latency 75th (ms)	28.31	27.93	-1.34%
Latency 90th (ms)	43.44	43.06	-0.87%
Latency 99th (ms)	91.41	91.37	-0.04%

sebastienros · 2020-09-29T23:15:46Z

@roji I think we have tested this approach by the past

benaadams · 2020-09-29T23:31:42Z

@roji does the command reparse the cmd string (to convert to from ado => postgres format, e.g. @parm to $1) on each execute or only if it has changed?

benaadams · 2020-09-30T21:26:07Z

Raised issue npgsql/npgsql#3200; it reduces allocations but reparses the query and generates a new one in pg format for each execution

roji

Sorry for disappearing here, personal issues.

Yeah, at some point I tested an approach of pooling the ADO.NET facade objects (we can do the same for NpgsqlConnection BTW), and didn't get convincing results - the way I understood it, the overhead of pooling synchronization isn't worth it for very light-weight objects. It's true that skipping the SQL parsing is a more convincing argument, but see npgsql/npgsql#3200 (comment) about improvements in 6.0 which should obviate all that anyway. Also, when I last benchmarked fortunes, SQL parsing was pretty negligible (SQL is very small and doesn't even contain parameters).

But we can definitely revisit all this.

roji · 2020-10-15T06:28:49Z

src/BenchmarksApps/Kestrel/PlatformBenchmarks/Data/RawDb.cs

@@ -237,7 +259,55 @@ private async Task<World> ReadSingleRow(NpgsqlCommand cmd)
            }
        }

-        private static readonly object[] _cacheKeys = Enumerable.Range(0, 10001).Select((i) => new CacheKey(i)).ToArray();
+        internal class SqlFortuneCommand : IDisposable


Are these wrapper classes actually necessary, why not pool NpgsqlCommand directly? As far as I can tell they're mainly there to enqueue back when disposing, but that can just be done by the code using the command instead of disposing, no?

Mainly so the callsites are just a using block and doon't have to worry about pooling e.g. its

using (var cmd = CreateReadCommand()) { cmd.Connection = db; // do something with pooled command }

OK. Moving the pooling logic here and removing the wrappers might make a tiny bit of difference too.

benaadams · 2020-10-15T10:21:42Z

SQL is very small and doesn't even contain parameters

All the SQL other than the fortunes benchmark contains parameters?

benaadams · 2020-10-15T10:26:36Z

src/BenchmarksApps/Kestrel/PlatformBenchmarks/Data/RawDb.cs

                {
+                    cmd.Connection = db;
+                    var param = cmd.Parameter;


In npgsql/npgsql#3200 (comment) you say

The typical scenario of reexecuting the same command with the same SQL also does it on the same connection, in which case explicit preparation is the right choice and bypasses everything.

Since this is executed 20 times just changing the value, should it do a prepare here?

Should definitely give it a try... I don't pay enough attention as I should to the non-Fortunes benchmarks.

Automatic preparation still has the advantage of doing one less roundtrip - the first execution prepares and executes in the same go, where with explicit preparation they're split. But as you add more executions for a single initial prepare the impact of that goes down.

Ye olde score multiplier for composite scores

Automatic preparation still has the advantage of doing one less roundtrip - the first execution prepared and executes in the same go, where with explicit preparation they're split. But as you add more executions for a single initial prepare the impact of that goes down.

Which is kinda why I want to bypass the parse; then its auto prep + parse once, rather than parse 20 times with auto prep?

OK, I'll try to take some time this weekend to play around with bypassing the parse. In any case I need to run (and update) the benchmarks for the newest Npgsql 5.0.0 (just released preview1) - will look into the parsing thing as part of that.

roji · 2020-10-15T10:49:13Z

All the SQL other than the fortunes benchmark contains parameters?

IIRC yeah (though whether a parameter exists or not doesn't matter that much for SQL parsing, just a little bit).

benaadams · 2020-10-15T11:15:26Z

Every little helps; top query per second is 1.1M and aspnet is 472k

1,185,480 = 20 * 59,274
472,120 = 20 * 23,606

DamianEdwards · 2023-03-17T20:42:06Z

@benaadams @roji is this old PR still relevant?

benaadams · 2023-03-17T21:10:15Z

@benaadams @roji is this old PR still relevant?

@roji said he was introducing a better way of doing it in a newer version of the driver; not sure of status of that

roji · 2023-03-18T16:51:24Z

Things have changed quite a lot since this was done... Here are some thoughts.

Re SQL parsing, Npgsql 6.0 did introduce support for using (native) positional parameters and not parsing SQL (@p -> $1); for more details, see this write-up. This is automatically triggered when the command parameters are unnamed, but when there are no parameters, we do have to parse for backwards compat. IIRC the only benchmark that doesn't have parameters is fortunes, so we currently do parse there, which is unneeded overhead.

We do have an app context switch which allows disabling SQL parsing/rewriting globally in the application, even where there are no parameters. Since that switch is global and since Dapper and EF don't work with positional parameters, turning it on would break them. But unlike our implementation here, our TechEmpower platforms implementation has only raw (no Dapper/EF), so I'm doing that there (in TechEmpower/FrameworkBenchmarks#8005). Though we'll have to figure out what to do if we unify the implementations (#1815).

roji · 2023-03-18T17:07:21Z

The 2nd thing here is pooling the ADO.NET objects (e.g. NpgsqlCommand). With the introduction of NpgsqlDataSource, we'll soon be switching to creating commands directly from that instead of instantiating connections (https://github.com/aspnet/Benchmarks/pull/1816/files#r1139401127):

// instead of:
using var connection = new NpgsqlConnection(...);
using var command = new NpgsqlCommand("SQL", connection);
// we'll just do this:
using var command = dataSource.CreateCommand();

(We can do this since the benchmarks don't involve any connection state (e.g. transactions), and this models multiplexing much more correctly, i.e. just execute a command against the database, without needing to care about which connection it goes throw or how. This will also likely bring some optimizations later.)

Currently, NpgsqlDataSource.CreateCommand() doesn't pool. If it's really beneficial to do so, this is an optimization we can and should implement inside Npgsql itself; opened npgsql/npgsql#5001 to track this.

/cc @vonzshik @NinoFloris

roji · 2023-03-18T17:08:33Z

So beyond the above two things, I think this can be closed... We should definitely experiment with command pooling in Npgsql and see what happens.

DamianEdwards · 2023-03-19T15:00:13Z

@roji

We do have an app context switch which allows disabling SQL parsing/rewriting globally

Are there plans to enable setting this via a property on NpgsqlCommand directly?

roji · 2023-03-19T15:16:48Z

Not really... This whole thing is tricky and somewhat complex, and comes from the fact that someone in Npgsql's history decided to accept named parameter placeholders (@p) instead of positional ones ($1), and also to support batching by parsing the command's SQL for semicolons and splitting that to multiple batched statements at the wire protocol level. Neither of these are natively supported by PG, so Npgsql has to parse/rewrite in order to suppotr them (here's a writeup).

Now, if the command has parameters, we check whether they're named on or not (is DbParameter.ParameterName set). If they're unnamed, we take that as a signal that SQL parsing isn't required, i.e. positional parameters are being used. Since we already have a user gesture here (unset parameter name), we don't need an additional flag on NpgsqlCommand. Note that if you're using unnamed positional parameters, Npgsql also doesn't support semicolons for batching: you must instead use the DbBatch abstraction we introduced in Npgsql 6.0 (partially for this).

The only corner case is when there's no parameters at all. For this case, there's still the problem of semicolons inside the SQL (batching) - we must parse since there's no user gesture here. We could in theory introduce a bool property command just to skip parsing/rewriting in the no-parameters case, but that seems really excessive... I'd rather we made EF (and Dapper) compatible with positional parameters and DbBatch (yet another thing on my list...)

For now we can probably have #if FORTUNES or similar to enable this AppContext switch only when running fortunes...

(it's all been quite a long journey...)

benaadams · 2023-03-19T16:52:33Z

The idea was that the parsed state would remain in the command if the command text didn't change; so reusing the command would skip the reparsing. Alas that isn't what happens and it reparses each time even though its the same command object with an unchanged command

roji · 2023-03-19T17:02:38Z

@benaadams right. Parsing/rewriting was already disabled in all benchmarks with parameters, since we switched to positional placeholders a while ago; TechEmpower/FrameworkBenchmarks#8005 (comment) does that for Fortunes as well. So I don't think we need to worry about that part any more.

There's another kind of "parsing" which happens every time: to look up the PostgreSQL prepared statement in an internal dictionary. We're planning to add a data-source level API for "globally-available" prepared statements, that would skip this step (npgsql/npgsql#4509).

In the meantime, we could in theory pool commands and assume that a command rented from the pool already has the correct SQL. That assumption would hold only in a single-statement benchmark, so it seems a bit unrealistic/problematic.

NinoFloris · 2023-03-19T17:14:47Z

@benaadams Any reuse that we might introduce on the DbDataSource will likely release all query specific resources during return.
I would personally like to see ADO.NET support for concurrent executions on a DbDataSource/connectionless command to be able to store one on a static (this would need new ExecuteReader methods accepting parameters as an argument and quite some rework in drivers, so it's not realistic any time soon).

If we really need fast pooling we could store relevant instances on the kestrel connection and pass them down.

Reuse commands

4f9a90b

benaadams mentioned this pull request Sep 30, 2020

Reusing NpgsqlCommand should skip parsing if cmdText/params not changed npgsql/npgsql#3200

Closed

roji reviewed Oct 15, 2020

View reviewed changes

benaadams commented Oct 15, 2020

View reviewed changes

Base automatically changed from master to main March 8, 2021 18:29

roji mentioned this pull request Mar 18, 2023

Updates to aspnetcore PG database access TechEmpower/FrameworkBenchmarks#8005

Merged

roji mentioned this pull request Mar 18, 2023

Recycle and pool data source commands npgsql/npgsql#5001

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pool commands #1590

Pool commands #1590

benaadams commented Sep 27, 2020

sebastienros commented Sep 29, 2020

sebastienros commented Sep 29, 2020

benaadams commented Sep 29, 2020 •

edited

benaadams commented Sep 30, 2020

roji left a comment

roji Oct 15, 2020

benaadams Oct 15, 2020

roji Oct 15, 2020

benaadams commented Oct 15, 2020

benaadams Oct 15, 2020

roji Oct 15, 2020 •

edited

benaadams Oct 15, 2020

roji Oct 15, 2020

roji commented Oct 15, 2020

benaadams commented Oct 15, 2020

DamianEdwards commented Mar 17, 2023

benaadams commented Mar 17, 2023

roji commented Mar 18, 2023

roji commented Mar 18, 2023

roji commented Mar 18, 2023

DamianEdwards commented Mar 19, 2023

roji commented Mar 19, 2023

benaadams commented Mar 19, 2023

roji commented Mar 19, 2023

NinoFloris commented Mar 19, 2023 •

edited

Pool commands #1590

Are you sure you want to change the base?

Pool commands #1590

Conversation

benaadams commented Sep 27, 2020

sebastienros commented Sep 29, 2020

sebastienros commented Sep 29, 2020

benaadams commented Sep 29, 2020 • edited

benaadams commented Sep 30, 2020

roji left a comment

Choose a reason for hiding this comment

roji Oct 15, 2020

Choose a reason for hiding this comment

benaadams Oct 15, 2020

Choose a reason for hiding this comment

roji Oct 15, 2020

Choose a reason for hiding this comment

benaadams commented Oct 15, 2020

benaadams Oct 15, 2020

Choose a reason for hiding this comment

roji Oct 15, 2020 • edited

Choose a reason for hiding this comment

benaadams Oct 15, 2020

Choose a reason for hiding this comment

roji Oct 15, 2020

Choose a reason for hiding this comment

roji commented Oct 15, 2020

benaadams commented Oct 15, 2020

DamianEdwards commented Mar 17, 2023

benaadams commented Mar 17, 2023

roji commented Mar 18, 2023

roji commented Mar 18, 2023

roji commented Mar 18, 2023

DamianEdwards commented Mar 19, 2023

roji commented Mar 19, 2023

benaadams commented Mar 19, 2023

roji commented Mar 19, 2023

NinoFloris commented Mar 19, 2023 • edited

benaadams commented Sep 29, 2020 •

edited

roji Oct 15, 2020 •

edited

NinoFloris commented Mar 19, 2023 •

edited