SQLite migration #150

jsign · 2022-06-27T13:30:30Z

This PR migrates from using Postgres to SQLite.

In this PR description, I’ll list a high-level overview of the changes, for more details see the PR comments.

sqlc and migrations tools

sqlc doesn’t support sqlite so unfortunately, we can’t rely on it for automatically generating models and queries. There’s ongoing work in that library to add support but isn’t there yet. When that happens, we could think of moving again to sqlc for convenience.
What I did regarding this was:
1- Switching in the sqlc config to use the database/sql generic driver instead of the Postgres one we were using.
2- Autogenerate again the queries so generated statements and models depend on database/sql types.
3- Remove sqlc from our project.

This removed many postgres dependencies from our generated SQL code. Obviously, the generated code wasn’t working since we had queries with Postgres syntax and similar, but was a good clean-up before removing sqlc.

Regarding migrations, we still use golang-migrate for migrations. Obviously, I had to start from scratch in migrations since our history of migrations was full of Postgres-syntax changes which won’t work in SQLite. So, we started again from 001_ which is also nice to have a fresh starting point with our current schema.

Validator configuration

Regarding the validator configuration flags, two important changes.

Changes in flags:

All the flags related to database connection are gone since there’s no user, password, or URL.
The AdminAPI section is gone. The admin user or admin password wasn’t used since we removed whitelisting, so made no sense to still have this configuration which can add confusion to configuring the validator. If we ever need them again, it’s totally fine to include them again.
The Impl attribute was removed since this was to enable a mocked Mesa service that made sense some months ago.
The EventFeed.MaxBlocksFetchSize was removed since I made a change to dynamically find the right value. Coming up with a good configuration number manually/statically was complicated. This allows cold syncing to be faster to consume the history of the chain that didn’t have events as fast as possible, and later adjust the maximum response size when events starts appearing.

There’s a new --dir flag that indicates the state directory of the validator, by default ${HOME}/.tableland which:

Contains the config.json file, instead of forcing the validator to find it in the same path of executing the validator as it was now.
Will have the SQLite database.

In a nutshell, all the state of the validator will be in the --dir folder.

SQLite setup

Regarding SQLite3 drivers, after digging around with potential SQLite3 drivers, we decided to go with mattn/sqlite3 since looks, like it’s the best, maintained one and battle-tested. There are some other CGO and non-CGO options but we valued more maturity.

For the connection URI we use file://%s?_busy_timeout=5000&_foreign_keys=on&_journal_mode=WAL which I’ll explain:

_busy_timeout is a configuration to manage how long are we willing to wait for a write-access to the database. Recall that SQLite only allows a single-writer; so if there’s already a writer doing actions in the database, a new writer can get a database is locked error. The _busy_timeout allows to wait up to X milliseconds before returning the error. This configuration is only to avoid potential error noise; the application should always handle the error.
_foreign_keys=on simply enables foreign key checking since it’s disabled by default.
_journal_mode=WAL is a new-ish way to run the database which uses a WAL instead of journaling to handle transactions. WAL mode allows for having multiple readers and a single writer at the same time, compared with the default journal mode which a writer blocks excludes any reader. In WAL mode all readers have point-in-time isolation. To understand more about WAL mode, see the docs.

ACL representation in the DB

In the Postgres implementation, we leveraged support for array-type columns and UNNEST() to do some magic of adding or removing privileges in a single-roundtrip.

Since SQLite3 doesn’t have array types we can’t use what we have. Since I wanted to also do the addition or removal of privileges in a single roundtrip, I represented privileges as an INTEGER and used bitwise operations (which are supported in SQLite3) to add and remove privileges in one roundtrip.

See the PR comments for more details.

SQL result to JSON

This logic was greatly simplified since I noticed that the way Scan(..) worked if we gave *interface{} returned the same values used by the underlying driver, and in mattn/sqlite3 case that were exactly the things we expected. More on this in PR comments.

Nested txns

In our event processing pipeline, we use nested txns to do a crash-resistant execution of block-level events which:

Can fail independently, without forcing a rollback on other successful events in the same block
while running everything in block-level txn, so if the validator crashes at any point in time, the database is in a correct state.

The database/sql driver doesn’t have support for nested txns as the previous Postgres driver had. SQLite3 has support for nested txns, so I had to do a helper function to manually run SAVEPOINT . From the perspective of our logic, we do exactly the same but use this new method to wrap whatever logic we want to do in the nested txns.

More about this in PR comments.

Detecting query vs infrastructure errors in the processor

The mattn/sqlite3 driver has a two-level grouping of SQL errors, which is very convenient for what we wanted to do in distinguishing user-related errors vs infrastructure ones. More on this in this PR comment

Tests

Now the tests use in-memory SQLite databases. Locally I’ve noticed a 4x (13s vs ~55s) speed improvement in running tests because of that, plus some other tests parallelization work I’ve done in some places.

In CI, I still see some flakiness probably because now more tests can run in parallel which may cause some memory problems. Since I’m tired of that, I simply made the CI action run the tests up to 3 times. If there are some flaky tests, the second run will have 99% of the tests run instantly due to caching and only retry whatever tests might be failed. It’s a pretty non-invasive change, and it will save us (hopefully) many headaches. This approach sounds better than simply limiting the parallelization run of the tests in CI, which is simply accepting to run things slower.

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

…ID from int64 Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

…sted txns Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

jsign · 2022-06-27T13:35:33Z

.golangci.yml

@@ -14,12 +14,16 @@ linters:
    - whitespace
    - godot
    - lll
+    - sqlclosecheck


Added this extra linter that sounds useful.

jsign · 2022-06-27T13:36:38Z

.golangci.yml

+  skip-dirs:
+    - "pkg/sqlstore/impl/system/internal/db"


The .go files here are where the previous sqlc files existed, which had a top comment saying that the code was automatically generated, which by default the linters ignore. Now that we don't use sqlic, those files aren't autogenerated, so just avoid lints there as usual.

Makefile

jsign · 2022-06-27T13:48:10Z

cmd/api/config.go

-func setupConfig() *config {
-	fileBytes, err := os.ReadFile(configFilename)
-	fileStr := string(fileBytes)
+func setupConfig() (*config, string) {


Some changes here to support the new --dir flag.
It loads the config.json from the directory folder, and also returns the parsed directory folder path so in main.go we can read the database file from there.

jsign · 2022-06-27T13:48:45Z

cmd/api/config.go

+	flagDirPath := flag.String("dir", "${HOME}/.tableland", "Directory where the configuration and DB exist")
+	flag.Parse()
+	if flagDirPath == nil {
+		log.Fatal().Msg("--dir is null")
+		return nil, "" // Helping the linter know the next line is safe.
+	}
+	dirPath := os.ExpandEnv(*flagDirPath)
+
+	_ = os.MkdirAll(dirPath, 0755)


Read the --dir flag via the flag package, and expand env vars.
If the folder doesn't exist, create it.

jsign · 2022-06-27T18:58:21Z

pkg/txn/impl/processor.go

+		`INSERT INTO system_acl ("chain_id","table_id","controller","privileges","created_at")
+		 VALUES (?1, ?2, ?3, ?4, ?5)
+		 ON CONFLICT (chain_id,table_id,controller)
+		 DO UPDATE SET privileges = privileges | ?4, updated_at = ?5`,


We OR the current value in the table with the mask we generated. This will activate the desired bits thus adding the wanted privileges.

jsign · 2022-06-27T18:59:39Z

pkg/txn/impl/processor.go

-		privileges,
-	); err != nil {
+		privilegesMask,
+		time.Now().Unix()); err != nil {


Note here that we use time.Now().Unix() to be used in created_at in case of an insert, but the real intention is to use it in updated_at in the UPDATE case. For other cases of created_at in this file we don't need time.Now().Unix() since we have a default value. But for updated_at, we need an explicit one.

jsign · 2022-06-27T19:00:42Z

pkg/txn/impl/processor.go

+	// Tune the mask to have a 0 in the places we want to disable the bit.
+	// For example, if we want to remove tableland.PrivUpdate, the following
+	// code will transform 111 -> 101.
+	// We'll then use 101 to AND the value in the DB.
+	for _, privilege := range privileges {
+		switch privilege {
+		case tableland.PrivInsert:
+			privilegesMask &^= tableland.PrivInsert.Bitfield
+		case tableland.PrivUpdate:
+			privilegesMask &^= tableland.PrivUpdate.Bitfield
+		case tableland.PrivDelete:
+			privilegesMask &^= tableland.PrivDelete.Bitfield
+		default:
+			return fmt.Errorf("unknown privilege: %s", privilege.Abbreviation)
+		}
 	}


Here we're in executeRevokePrivilegesTxn, so the opposite case of the described one above.
To do this we do the opposite. We start with a full mask of all bits enabled (L796), and we set to 0 the desired privileges we want to disable.

Some docs to explain what &^ is just in case:

The &^ operator is bit clear (AND NOT): in the expression z = x &^ y, each bit of z is 0 if the corresponding bit of y is 1; otherwise it equals the corresponding bit of x.

Exactly what we needed.

jsign · 2022-06-27T19:01:11Z

pkg/txn/impl/processor.go

-				}
+	if _, err := tx.ExecContext(ctx,
+		`UPDATE system_acl 
+	     SET privileges = privileges & ?4, updated_at = ?5


Then we do an AND which will keep all the potentially existing privileges, but it will disable the ones that we wanted (since there's a 0 in those bit positions).

jsign · 2022-06-27T19:11:22Z

pkg/txn/impl/processor.go

+// isErrCausedByQuery detects if the query execution failed because of possibly expected
+// bad queries from users. If that's the case the call might want to accept the failure
+// as an expected event in the flow.
+func isErrCausedByQuery(err error) (string, bool) {
+	// This array contains all the sqlite errors that should be query related.
+	// e.g: inserting a column with the wrong type, some function call failing, etc.
+	// All these errors must be errors that will always happen if the query is retried.
+	// (e.g: a timeout error isn't the querys fault, but an infrastructure problem)
+	//
+	// Each error in sqlite3 has an "Error Code" and an "Extended error code".
+	// e.g: a FK violation has "Error Code" 19 (ErrConstraint) and
+	// "Extended error code" 787 (SQLITE_CONSTRAINT_FOREIGNKEY).
+	// The complete list of extended errors is found in: https://www.sqlite.org/rescode.html
+	// In this logic if we use "Error Code", with some few cases, we can detect a wide range of errors without
+	// being so exhaustive dealing with "Extended error codes".
+	//
+	// sqlite3ExecutionErrors is probably missing values, but we'll keep discovering and adding them.
+	sqlite3ExecutionErrors := []sqlite3.ErrNo{
+		sqlite3.ErrError,      /* SQL error or missing database */
+		sqlite3.ErrConstraint, /* Abort due to constraint violation */
+		sqlite3.ErrTooBig,     /* String or BLOB exceeds size limit */
+		sqlite3.ErrMismatch,   /* Data type mismatch */
+	}
+	var sqlErr sqlite3.Error
+	if errors.As(err, &sqlErr) {
+		for _, ee := range sqlite3ExecutionErrors {
+			if sqlErr.Code == ee {
+				return sqlErr.Error(), true
+			}
+		}
+	}
+	return "", false
+}


OK, I went pretty long in the comments so they already explain the gist.

The good part is that I think we already have all the values we need here, compared to the Postgres case.
In the sqlite driver, the error codes have two levels: an Error code which is pretty general like "error in constraints", and an Extended error code that gives more detail on which kind of constraint error (FK?, CHECK?, etc).

The good part is that the Error code level contains few values, see here. I think the ones I included here should cover all scenarios which are nice.

We'll see with time... in the worst case, we can have a long list of extended errors for some cases, which would be more similar to how the Postgres detection worked. But that style is kind of endless waiting for finding the next unknown error, so pretty bullish on this.

awesome 👏

sanderpick · 2022-06-27T19:42:18Z

Thanks for the great summary 🙌

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

pkg/parsing/impl/validator.go

internal/tableland/impl/mesa_test.go

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

pkg/txn/impl/processor_test.go

pkg/sqlstore/impl/pgx_system_store_instrumented.go

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

brunocalza

Great work! Very excited to see this change coming true. Makes the whole architecture much simpler. 👏👏

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

jsign added 30 commits June 27, 2022 10:27

Makefile: remove sqlc

415d668

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

sqlc: migrate to database/sql style

0c8e560

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

sqlstore: work on json transformations

560695a

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

refactor migrations

ad3eb0e

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

Makefile: remove sqlc rule

13dccb7

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

mod: add sqlite3 dependency

fe8db98

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

internal/tableland: move from pgx to sql and add new helper for Table…

330b511

…ID from int64 Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

pkg/sqlstore: fix some models, queries and DTO transformations

b38a865

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

txnprocessor: start doing sqlite3 migration

8545ced

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

processor: receipts table semi-migration

ee7a32e

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

processor: fix queries, sql runtime error detection, and remaining ne…

058db28

…sted txns Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

eventprocessor: fix tests for sqlite3

7d9bbc1

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

sqlstore: fix impl and tests

615a82d

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

noncetracker: sqlite3 fixes and tests

d716a1a

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

general signatures modifications

eb00877

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

postgres: keep removing dependencies

84b0eaa

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

mod: update to latest mattn/sqlite3

54e3eff

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

mesa: integration tests fixes

60c9b47

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

paralellize more tests

e36dbf0

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

processor: fix update_at valeus

d004214

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

row to json marshaling work & make tests run faster

150226e

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

processor: rename ctids to rowids

1a66b31

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

eventprocessor: add IndexInBlock feature

bfcc349

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

db: create table indexes and unique constraints

de57f19

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

privileges: improve ergonomics of bitfields

1616acc

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

.

39d667e

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

create base directory & event feed optimizations

3ab73bc

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

lint pass

ad7f4a8

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

tools: remove postgres tool

989c898

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

update deps & remove deleted field in configs

11fff11

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

jsign added 2 commits June 27, 2022 10:27

processor: use mutating statement operation to distinguish inserts

c8a08f7

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

cmd: remove unneeded configs

251be4d

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

jsign self-assigned this Jun 27, 2022

jsign changed the title ~~Sqlite migration~~ SQLite migration Jun 27, 2022

jsign force-pushed the jsign/sqlitemigr branch 3 times, most recently from 137e117 to df19b56 Compare June 27, 2022 15:05

cleanups

66ee09d

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

jsign force-pushed the jsign/sqlitemigr branch from df19b56 to 66ee09d Compare June 27, 2022 15:13

jsign added 4 commits June 27, 2022 14:13

ci: test retries

a1ba600

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

Makefile: enable race flag

3c20da5

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

sql: simplify created_at default values

e2cca3e

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

tests: use ElementsMatch to ignore ordering

3e9a90e

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

jsign commented Jun 27, 2022

View reviewed changes

jsign requested a review from brunocalza June 27, 2022 19:19

jsign marked this pull request as ready for review June 27, 2022 19:19

Makefile: remove sqlc var definition

52204ea

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

brunocalza reviewed Jun 28, 2022

View reviewed changes

pkg/parsing/impl/validator.go Outdated Show resolved Hide resolved

brunocalza reviewed Jun 28, 2022

View reviewed changes

internal/tableland/impl/mesa_test.go Outdated Show resolved Hide resolved

comment typo

36c0073

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

brunocalza reviewed Jun 28, 2022

View reviewed changes

pkg/txn/impl/processor_test.go Show resolved Hide resolved

brunocalza reviewed Jun 28, 2022

View reviewed changes

pkg/sqlstore/impl/pgx_system_store_instrumented.go Show resolved Hide resolved

jsign added 2 commits June 28, 2022 11:47

tests: use ElementsMatch in more places

2ee5c71

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

rename files

010d3f8

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

brunocalza approved these changes Jun 28, 2022

View reviewed changes

test: avoid shadowing and use high-order func

c98074b

Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>

jsign merged commit bdfa368 into sqlite Jun 28, 2022

jsign deleted the jsign/sqlitemigr branch July 7, 2022 13:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SQLite migration #150

SQLite migration #150

jsign commented Jun 27, 2022 •

edited

jsign Jun 27, 2022

jsign Jun 27, 2022

jsign Jun 27, 2022

jsign Jun 27, 2022

jsign Jun 27, 2022

jsign Jun 27, 2022

jsign Jun 27, 2022

jsign Jun 27, 2022

jsign Jun 27, 2022

brunocalza Jun 28, 2022

sanderpick commented Jun 27, 2022

brunocalza left a comment

SQLite migration #150

SQLite migration #150

Conversation

jsign commented Jun 27, 2022 • edited

sqlc and migrations tools

Validator configuration

SQLite setup

ACL representation in the DB

SQL result to JSON

Nested txns

Detecting query vs infrastructure errors in the processor

Tests

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sanderpick commented Jun 27, 2022

brunocalza left a comment

Choose a reason for hiding this comment

jsign commented Jun 27, 2022 •

edited