Skip to content

Adding extra metadata. Postprocessing Filter or Dispatcher? #307

Answered by cerveada
diegoeche asked this question in Q&A
Discussion options

You must be logged in to vote

LineageDispatcher

The goal of the dispatcher is to send the lineage data to the server. Currently we support kafka dispatcher, http dispatcher and some others for debuging and testing like logging dispatcher and console dispatcher. Typically you would implement your own dispatcher when you need another method of sending data like RabbitMq or store the data to S3 for some reason.

PostProcesingFilter

The goal of the filter is to change the lineage data before it is sent to the dispatcher. This might mean filtering out some sensitive data or adding additional information. Filters have access to the the original Spark LogicalPlan and the SparkSession as means of metadata.

UserExtraMetaDataPro…

Replies: 5 comments 6 replies

Comment options

You must be logged in to vote
1 reply
@zacayd
Comment options

Answer selected by diegoeche
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
5 replies
@cerveada
Comment options

@zacayd
Comment options

@wajda
Comment options

@zacayd
Comment options

@wajda
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants