build 'discovery' ODB with auto-consistency for on-disk files #266

Byron · 2021-11-29T09:13:13Z

The DB spiked in the discovery has all strengths and seemingly no weaknesses (except for complexity). Let's build it to be able to automatically adapt to changes on disk and to handle server loads efficiently.

The radicle-link ODB shows how a fully sync ODB can look like, and maybe the initial design can be simplified now that it's clear that fully Sync is very fast, too.

Or said differently, it appears that thread-local caches can be used but they probably only make sense for object buffers and pack-caches, less for accelerating object access.

Ideally, a quick-version can be drafted to have benchmark results quickly to even see if the path is right.

What I wish for is more dynamic handling of open file handles, ideally there is a trigger to release unused ones, and generally have a way of handling file limits somewhat gracefully (at least on server/caller level).

Tasks

Refactor Find traits #267
PoC of new ODB design #273
docs
make the above production ready
- tests for parallel pack creation with changes made to the disk state while a pack is being created
- auto-refresh of disk-state when packs are merged, new packs are added.
delete odb-design experiment

The text was updated successfully, but these errors were encountered:

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

It's meant for the next generation of object db handles which keep a local cache of all the details of the actual object database.

… alter git_odb::Find trait (#266) This will break a lot, but has to happen to prepare these traits for the next generation of object databases.

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

It's meant for the next generation of object db handles which keep a local cache of all the details of the actual object database.

… alter git_odb::Find trait (#266) This will break a lot, but has to happen to prepare these traits for the next generation of object databases.

…266)

With the new architecture this can be an implementation detail without forcing it to be Sync.

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

It's meant for the next generation of object db handles which keep a local cache of all the details of the actual object database.

… alter git_odb::Find trait (#266) This will break a lot, but has to happen to prepare these traits for the next generation of object databases.

…266)

With the new architecture this can be an implementation detail without forcing it to be Sync.

Borrow is the manual form, Deref has allows for more automatic use and more idiomatic looking code.

For completeness in case of single-threaded operations

It could easily be general over all kinds of store as long as there is support for pack-caches, which might be helpful later for the 'final' store type. Ideally, this one now shows how to do it.

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

It's a bit tricky to use the right kind of handle and transform the Rc<Store> back into an Arc<Store>, but it works.

…onfigure (#266)

This reverts commit a3caf39.

The latter needs the notion of the index not existing, and shouldn't fetch it by default or else each sanbox run fetches the entire index.

It now assumes that the crates-index must exist, which migth not always be the case and rightfully so. Now we wrap it to get back to the original behavior.

)

This works by bypassing the central index, which doesn't know about the garbaged indices anymore, to obtain the indices directly.

…266)

Byron created this issue from a note in Collaboration Board (In progress) Nov 29, 2021

Byron added a commit that referenced this issue Nov 30, 2021

Add 'contains()' method to Find (#266)

dd386ce

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

Byron added a commit that referenced this issue Nov 30, 2021

feat: add Data object (#266)

e8f1912

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

Byron added a commit that referenced this issue Nov 30, 2021

feat: A simplified version of the Find trait (#266)

f788310

It's meant for the next generation of object db handles which keep a local cache of all the details of the actual object database.

Byron added a commit that referenced this issue Nov 30, 2021

refactor!: move git_pack::data::Object to git_object::Data, massively…

e22a710

… alter git_odb::Find trait (#266) This will break a lot, but has to happen to prepare these traits for the next generation of object databases.

Byron added a commit that referenced this issue Dec 1, 2021

Add 'contains()' method to Find (#266)

532e9e0

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

Byron added a commit that referenced this issue Dec 1, 2021

feat: add Data object (#266)

004120f

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

Byron added a commit that referenced this issue Dec 1, 2021

feat: A simplified version of the Find trait (#266)

c2ded9d

It's meant for the next generation of object db handles which keep a local cache of all the details of the actual object database.

Byron added a commit that referenced this issue Dec 1, 2021

refactor!: move git_pack::data::Object to git_object::Data, massively…

ccabe2f

… alter git_odb::Find trait (#266) This will break a lot, but has to happen to prepare these traits for the next generation of object databases.

Byron added a commit that referenced this issue Dec 1, 2021

fix docs (#266)

83e1845

Byron added a commit that referenced this issue Dec 1, 2021

feat: linked::Store sorts bundles by modification date, newest first (#…

b54a985

…266)

Byron added a commit that referenced this issue Dec 1, 2021

Clarify that we really need stable pack ids (#266)

fab5fe7

Byron added a commit that referenced this issue Dec 1, 2021

refactor!: remove pack-cache from Find::try_find(…) (#266)

3bd9261

With the new architecture this can be an implementation detail without forcing it to be Sync.

Byron added a commit that referenced this issue Dec 2, 2021

Add 'contains()' method to Find (#266)

cc9df78

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

Byron added a commit that referenced this issue Dec 2, 2021

feat: add Data object (#266)

adfbcee

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

Byron added a commit that referenced this issue Dec 2, 2021

feat: A simplified version of the Find trait (#266)

fcc1c31

It's meant for the next generation of object db handles which keep a local cache of all the details of the actual object database.

Byron added a commit that referenced this issue Dec 2, 2021

refactor!: move git_pack::data::Object to git_object::Data, massively…

8f6ff71

… alter git_odb::Find trait (#266) This will break a lot, but has to happen to prepare these traits for the next generation of object databases.

Byron added a commit that referenced this issue Dec 2, 2021

fix docs (#266)

b557550

Byron added a commit that referenced this issue Dec 2, 2021

feat: linked::Store sorts bundles by modification date, newest first (#…

be124ce

…266)

Byron added a commit that referenced this issue Dec 2, 2021

Clarify that we really need stable pack ids (#266)

108c77b

Byron added a commit that referenced this issue Dec 2, 2021

refactor!: remove pack-cache from Find::try_find(…) (#266)

87930a1

With the new architecture this can be an implementation detail without forcing it to be Sync.

Byron added a commit that referenced this issue Dec 2, 2021

Inform when multi-threaded counting is rejected (#266)

bed0e92

Byron added a commit that referenced this issue Dec 2, 2021

Use Deref instead of Borrow in linked ODB iterator (#266)

0f8a9d1

Borrow is the manual form, Deref has allows for more automatic use and more idiomatic looking code.

Byron added a commit that referenced this issue Dec 2, 2021

feat: add linked::Store::rc_iter() (#266)

565b7d9

For completeness in case of single-threaded operations

Byron added a commit that referenced this issue Dec 2, 2021

git-odb::Find implementation for linked::Store (#266)

11a4550

Byron added a commit that referenced this issue Dec 2, 2021

refactor (#266)

e357ba4

Byron added a commit that referenced this issue Dec 2, 2021

refactor (#266)

f43d985

Byron added a commit that referenced this issue Dec 3, 2021

Add 'contains()' method to Find (#266)

dfdd6fb

The plan is to separate pack location entirely from Object and put the location specific functions into a separate trait.

Byron added a commit that referenced this issue Dec 3, 2021

feat: add Data object (#266)

a0bb652

This is typed data baked by a slice for conversion into parsed ObjectRef's for example. This is usually the result of a `Find` operation on an object database.

Byron added a commit that referenced this issue Dec 18, 2021

Make single-threaded programs possible to use with git-repository (#266)

dde5c6b

It's a bit tricky to use the right kind of handle and transform the Rc<Store> back into an Arc<Store>, but it works.

Byron added a commit that referenced this issue Dec 18, 2021

fix docs (#266)

360bf9d

Byron added a commit that referenced this issue Dec 18, 2021

minor improvements to module layout, docs (#266)

0364f48

Byron added a commit that referenced this issue Dec 18, 2021

change!: move loose::iter::Iter to loose::Iter (#266)

8bb5c9a

Byron added a commit that referenced this issue Dec 18, 2021

adapt to changes in git-odb (#266)

a44dd4b

Byron added a commit that referenced this issue Dec 18, 2021

dynamic store module cleanu (#266)

494772c

Byron added a commit that referenced this issue Dec 18, 2021

change!: move sink::Sink to the top-level exclusively (#266)

ab4e726

Byron added a commit that referenced this issue Dec 18, 2021

refactor (#266)

3da91ce

Byron added a commit that referenced this issue Dec 18, 2021

refactor (#266)

52a4dcd

Byron added a commit that referenced this issue Dec 18, 2021

refactor (#266)

b88f253

Byron added a commit that referenced this issue Dec 19, 2021

chore: update sha-1 dependency to 0.10 (#266)

361892c

Byron added a commit that referenced this issue Dec 19, 2021

upgrade git-ref's os_str_bytes crate to 6.0.0 (#266)

0cfba57

Byron added a commit that referenced this issue Dec 19, 2021

upgrade dashmap to latest version (#266)

52d4fe5

Byron added a commit that referenced this issue Dec 19, 2021

chore: upgrade all dependencies (#266)

a3caf39

Byron added a commit that referenced this issue Dec 19, 2021

chore: upgrade dependencies (#266)

322b290

Byron added a commit that referenced this issue Dec 19, 2021

chore: remove unused dependencies (#266)

c800fdd

Byron added a commit that referenced this issue Dec 19, 2021

upgrade dependencies (#266)

8adf0d8

Byron added a commit that referenced this issue Dec 19, 2021

upgrade dependencies (#266)

c301abe

Byron added a commit that referenced this issue Dec 19, 2021

Default handle refresh mode is the least surprising, with option to c…

1b74c14

…onfigure (#266)

Byron added a commit that referenced this issue Dec 19, 2021

Revert "chore: upgrade all dependencies (#266)"

0dfe4a7

This reverts commit a3caf39.

Byron added a commit that referenced this issue Dec 19, 2021

upgrade dependencies except for crates-index (#266)

c77c0d6

The latter needs the notion of the index not existing, and shouldn't fetch it by default or else each sanbox run fetches the entire index.

Byron added a commit that referenced this issue Dec 19, 2021

feat!: upgrade to crates-index 0.18 (#266)

15e60b2

It now assumes that the crates-index must exist, which migth not always be the case and rightfully so. Now we wrap it to get back to the original behavior.

Byron added a commit that referenced this issue Dec 19, 2021

docs for dynamic object store (#266)

2c2a2e9

Byron added a commit that referenced this issue Dec 19, 2021

refactor (#266)

c499843

Byron added a commit that referenced this issue Dec 19, 2021

a failing test to show the handle-stability doesn't quite work yet (#266

5562e88

)

Byron added a commit that referenced this issue Dec 19, 2021

assure stable handles can actually access the indices hey need (#266)

9474a43

This works by bypassing the central index, which doesn't know about the garbaged indices anymore, to obtain the indices directly.

Byron added a commit that referenced this issue Dec 19, 2021

More explicit information about how much garbaged is in the slotmap (#…

cfd36ee

…266)

Byron added a commit that referenced this issue Dec 19, 2021

delete now unused experiment (#266)

c44f72e

Byron closed this as completed Dec 19, 2021

Collaboration Board automation moved this from In progress to Done Dec 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build 'discovery' ODB with auto-consistency for on-disk files #266

build 'discovery' ODB with auto-consistency for on-disk files #266

Byron commented Nov 29, 2021 •

edited

build 'discovery' ODB with auto-consistency for on-disk files #266

build 'discovery' ODB with auto-consistency for on-disk files #266

Comments

Byron commented Nov 29, 2021 • edited

Tasks

Byron commented Nov 29, 2021 •

edited