Implement secure boost scheme - secure evaluation and validation (during training) without local feature leakage #10079

ZiyueXu77 · 2024-02-27T21:57:48Z

For implementing Vertical Federated Learning with Secure Features, as discussed in
#9987
This part is independent from the encryption and the alternative vertical pipeline. The purpose is to avoid leaking the real cut value information from participants. Hence add as a separate PR.
This PR is based on #10037, which should be reviewed and merged first.

…ute under secure scenario

…valent to broadcast

…lobal best split, but need to further apply split correctly

…case

…ute under secure scenario

…valent to broadcast

…lobal best split, but need to further apply split correctly

…case

Add alternate vertical splits

…x for training phase

…rent model

ZiyueXu77 · 2024-03-04T19:04:45Z

Hi @trivialfis , the method implementation part for secure inference is ready, I added detailed information to our RFC under the section "Design for Secure Inference - avoid leakage of feature cut value". @YuanTingHsieh will add / make modifications to the unit testing. Thanks!

trivialfis · 2024-03-05T16:18:28Z

src/common/quantile.cc

@@ -445,29 +449,27 @@ void SketchContainerImpl<WQSketch>::MakeCuts(Context const *ctx, MetaInfo const
      max_cat = std::max(max_cat, AddCategories(categories_.at(fid), p_cuts));


Based on my understanding, categorical features are not yet supported right?

right, will need to find a proper use-case / testing data with categorical features to add the support, it seems the categorical feature is "experimental" according to some of the last year's release notes, is it still the case? maybe we can add the support later when we find it really necessary.

src/common/quantile.cc

src/tree/common_row_partitioner.h

trivialfis · 2024-03-05T16:38:35Z

src/tree/hist/evaluate_splits.h

+          if (!is_secure_) {
+            split_pt = cut_val[i];  // not used for partition based
+            best.Update(loss_chg, fidx, split_pt, d_step == -1, false, left_sum, right_sum);
+          } else {
+            // secure mode: record the best split point, rather than the actual value
+            // since it is not accessible at this point (active party finding best-split)
+            best.Update(loss_chg, fidx, i, d_step == -1, false, left_sum, right_sum);
+          }
        } else {


Do you think a policy class might help here? Or maybe there are other efficient ways to handle these conditions? I'm losing track of these conditions, considering that we have three enumeration functions:

numeric

partition

one hot

Then we have three split modes:

column

row

column + secure

So, in combination, 9 potential cases, and we haven't counted vector leaf yet. Need to find a better way to manage these conditions.

it can be tricky to consolidate, since the 9 cases have high overlaps (e.g. same enumeration logic for all splits modes except when secure+passive party), some further processing only for col_split (w/ w/o secure), but irrelevant to enumeration.

Another thing regarding this mode combinations: potentially with the upcoming processor interface we will be able to enable encrypted horizontal, shall we further add a row + secure mode, adding a 4th one for
enum class DataSplitMode : int { kRow = 0, kCol = 1, kColSecure = 2 };? (or maybe there are better options?)

For my preference, I would have put it in the CommunicatorContext configuration for whether the channel is encrypted.

trivialfis · 2024-03-07T02:10:45Z

Hi, could you please share how to run some high level tests?

ZiyueXu77 · 2024-03-07T14:18:45Z

Hi, could you please share how to run some high level tests?

Sure, this is what I am using for testing:
https://github.com/ZiyueXu77/NVFlare/tree/secureboost/examples/advanced/xgboost_secure
will share the data link with you

ZiyueXu77 · 2024-03-07T14:26:02Z

Another general challenge for any vertical pipelines: at inference time, all parties need to be online, and as our model records the "global feature index", the "order" of the clients need to remain the same. We may need some mechanisms to ensure this order.

trivialfis · 2024-03-07T18:56:34Z

the "order" of the clients need to remain the same. We may need some mechanisms to ensure this order.

I will leave that to nvflare.

trivialfis

The code looks good to me overall. We can merge it once we have some basic unittests.

As for integration tests in Python with nvflare (in future PRs), we can assert that

models are different for different workers.
predictions are the same
evaluation result are the same
only works if the 0th worker has the label.

I highly recommend using the hypothesis test framework (see python tests in xgboost and search the term hypothesis).

ZiyueXu77 · 2024-03-07T19:04:41Z

The code looks good to me overall. We can merge it once we have some basic unittests.

As for integration tests in Python with nvflare (in future PRs), we can assert that

models are different for different workers.

predictions are the same

evaluation result are the same

only works if the 0th worker has the label.

I highly recommend using the hypothesis test framework (see python tests in xgboost and search the term hypothesis).

Thanks! @YuanTingHsieh , could you add the unit tests according to @trivialfis 's suggestions?

trivialfis · 2024-03-07T19:32:06Z

could you add the unit tests according to @trivialfis 's suggestions

Those points are all for integration tests not for small unittest. I think the integration tests in Python with nvflare will take more effort, we don't need to rush it in this PR.

trivialfis · 2024-03-20T07:08:02Z

Hi, is there any update?

ZiyueXu77 · 2024-03-20T14:05:10Z

Hi, is there any update?

Thanks for asking! :) @YuanTingHsieh has been busy with a related NVFlare release in the past two weeks, now the release is close to finish, he will have time to work on this soon.

Add secure inf unit tests

ZiyueXu77 · 2024-04-15T18:05:12Z

@trivialfis Yuanting just added some unit tests, seems there is a failed R-test, but not sure if it is related to our modifications, the error message being

* checking package namespace information ... OK
* checking package dependencies ... ERROR
Packages suggested but not available: 'ggplot2', 'DiagrammeR', 'igraph'
.........
Traceback (most recent call last):
Ncpus: 4
  File "/__w/xgboost/xgboost/tests/ci_build/test_r_package.py", line 359, in <module>
    main(args)
  File "/__w/xgboost/xgboost/tests/ci_build/test_utils.py", line 52, in inner
    r = func(*args, **kwargs)
        ^^^^^^^^^^^^^^^^^^^^^
  File "/__w/xgboost/xgboost/tests/ci_build/test_r_package.py", line 307, in main
    check_rpackage(tarball)
  File "/__w/xgboost/xgboost/tests/ci_build/test_utils.py", line 31, in inner
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/__w/xgboost/xgboost/tests/ci_build/test_utils.py", line 52, in inner
    r = func(*args, **kwargs)
        ^^^^^^^^^^^^^^^^^^^^^
  File "/__w/xgboost/xgboost/tests/ci_build/test_r_package.py", line 166, in check_rpackage
    with open(rcheck_dir / "00install.out", "r") as fd:
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
FileNotFoundError: [Errno 2] No such file or directory: 'xgboost.Rcheck/00install.out'

trivialfis · 2024-04-18T21:09:20Z

That should be unrelated, will look into this PR today.

ZiyueXu77 · 2024-04-29T14:59:59Z

Hi @trivialfis , thanks for the updates, just merged it.
Everything passed, except an error for R regarding "matrix":
2024-04-29T13:40:51.6394593Z make: *** [Makefile:291: Matrix.ts] Error 1

trivialfis · 2024-04-29T15:40:45Z

Triggered the rest of the CI.

ZiyueXu77 · 2024-05-07T18:44:08Z

Hi @trivialfis , there are 3 failed checks, but I think they align with the rebase merge, shall we just merge this? Thanks!

ZiyueXu77 and others added 29 commits January 31, 2024 10:48

Add additional data split mode to cover the secure vertical pipeline

8570ba5

Add IsSecure info and update corresponding functions

2d00db6

Modify evaluate_splits to block non-label owners to perform hist comp…

ab17f5a

…ute under secure scenario

Continue using Allgather for best split sync for secure vertical, equ…

fb1787c

…valent to broadcast

Modify histogram sync scheme for secure vertical case, can identify g…

7a2a2b8

…lobal best split, but need to further apply split correctly

Sync cut informaiton across clients, full pipeline works for testing …

3ca3142

…case

Code cleanup, phase 1 of alternative vertical pipeline finished

22dd522

Code clean

52e8951

change kColS to kColSecure to avoid confusion with kCols

e9eef15

Add additional data split mode to cover the secure vertical pipeline

70e6ca6

Add IsSecure info and update corresponding functions

a54ea6a

Modify evaluate_splits to block non-label owners to perform hist comp…

6fe61dd

…ute under secure scenario

Continue using Allgather for best split sync for secure vertical, equ…

1c2b7ed

…valent to broadcast

Modify histogram sync scheme for secure vertical case, can identify g…

b36ff2b

…lobal best split, but need to further apply split correctly

Sync cut informaiton across clients, full pipeline works for testing …

0707731

…case

Code cleanup, phase 1 of alternative vertical pipeline finished

dce7609

Code clean

6cebc31

change kColS to kColSecure to avoid confusion with kCols

1562f52

Add one unit test

f31c824

Merge branch 'SecureBoost' into add_alternate_vertical_splits

6fcbe02

Merge pull request #1 from YuanTingHsieh/add_alternate_vertical_splits

967e307

Add alternate vertical splits

Merge branch 'dmlc:master' into SecureBoost

04cd1cb

Merge branch 'dmlc:master' into SecureBoost

087a8dd

modify inference behavior of secure vertical from split value to inde…

5e85438

…x for training phase

fix the logic for secure vertical inference, each client save a diffe…

e008818

…rent model

code clean

1fd1fb0

code clean

72159b9

code clean

069f811

code clean

4e3c329

ZiyueXu77 mentioned this pull request Feb 27, 2024

Vertical Federated Learning with Secure Features (secure inference and encrypted training) RFC #9987

Open

ZiyueXu77 marked this pull request as draft March 4, 2024 16:07

clean the conflicts, make sure the pipeline functions

4624c3f

ZiyueXu77 marked this pull request as ready for review March 4, 2024 19:00

ZiyueXu77 changed the title ~~Implement secure boost scheme - secure inference without local feature leakage~~ Implement secure boost scheme - secure evaluation and validation without local feature leakage Mar 5, 2024

ZiyueXu77 changed the title ~~Implement secure boost scheme - secure evaluation and validation without local feature leakage~~ Implement secure boost scheme - secure evaluation and validation (during training) without local feature leakage Mar 5, 2024

trivialfis reviewed Mar 5, 2024

View reviewed changes

ZiyueXu77 added 4 commits March 5, 2024 16:15

address comments on split_value update et al

85b215d

remove secure flags no longer needed

cb6af9f

linting update

b1ca59a

correction on split_value recovery, perform only for secure mode

2ba22dd

trivialfis reviewed Mar 7, 2024

View reviewed changes

YuanTingHsieh and others added 2 commits April 14, 2024 21:31

Add secure inf unit tests

7cdde6f

Merge pull request #3 from YuanTingHsieh/add_secure_inf_unit_tests

be37fcd

Add secure inf unit tests

ZiyueXu77 requested a review from trivialfis April 15, 2024 18:05

trivialfis approved these changes Apr 19, 2024

View reviewed changes

Merge branch 'vertical-federated-learning' into SecureBoostInf

090cb1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement secure boost scheme - secure evaluation and validation (during training) without local feature leakage #10079

Implement secure boost scheme - secure evaluation and validation (during training) without local feature leakage #10079

ZiyueXu77 commented Feb 27, 2024 •

edited

ZiyueXu77 commented Mar 4, 2024

trivialfis Mar 5, 2024

ZiyueXu77 Mar 5, 2024 •

edited

trivialfis Mar 5, 2024 •

edited

ZiyueXu77 Mar 5, 2024 •

edited

ZiyueXu77 Mar 6, 2024

trivialfis Mar 7, 2024

trivialfis commented Mar 7, 2024

ZiyueXu77 commented Mar 7, 2024

ZiyueXu77 commented Mar 7, 2024

trivialfis commented Mar 7, 2024

trivialfis left a comment

ZiyueXu77 commented Mar 7, 2024

trivialfis commented Mar 7, 2024

trivialfis commented Mar 20, 2024

ZiyueXu77 commented Mar 20, 2024

ZiyueXu77 commented Apr 15, 2024 •

edited

trivialfis commented Apr 18, 2024

ZiyueXu77 commented Apr 29, 2024

trivialfis commented Apr 29, 2024

ZiyueXu77 commented May 7, 2024

		@@ -445,29 +449,27 @@ void SketchContainerImpl<WQSketch>::MakeCuts(Context const *ctx, MetaInfo const
		max_cat = std::max(max_cat, AddCategories(categories_.at(fid), p_cuts));

Implement secure boost scheme - secure evaluation and validation (during training) without local feature leakage #10079

Are you sure you want to change the base?

Implement secure boost scheme - secure evaluation and validation (during training) without local feature leakage #10079

Conversation

ZiyueXu77 commented Feb 27, 2024 • edited

ZiyueXu77 commented Mar 4, 2024

trivialfis Mar 5, 2024

Choose a reason for hiding this comment

ZiyueXu77 Mar 5, 2024 • edited

Choose a reason for hiding this comment

trivialfis Mar 5, 2024 • edited

Choose a reason for hiding this comment

ZiyueXu77 Mar 5, 2024 • edited

Choose a reason for hiding this comment

ZiyueXu77 Mar 6, 2024

Choose a reason for hiding this comment

trivialfis Mar 7, 2024

Choose a reason for hiding this comment

trivialfis commented Mar 7, 2024

ZiyueXu77 commented Mar 7, 2024

ZiyueXu77 commented Mar 7, 2024

trivialfis commented Mar 7, 2024

trivialfis left a comment

Choose a reason for hiding this comment

ZiyueXu77 commented Mar 7, 2024

trivialfis commented Mar 7, 2024

trivialfis commented Mar 20, 2024

ZiyueXu77 commented Mar 20, 2024

ZiyueXu77 commented Apr 15, 2024 • edited

trivialfis commented Apr 18, 2024

ZiyueXu77 commented Apr 29, 2024

trivialfis commented Apr 29, 2024

ZiyueXu77 commented May 7, 2024

ZiyueXu77 commented Feb 27, 2024 •

edited

ZiyueXu77 Mar 5, 2024 •

edited

trivialfis Mar 5, 2024 •

edited

ZiyueXu77 Mar 5, 2024 •

edited

ZiyueXu77 commented Apr 15, 2024 •

edited