Skip to content
View rcap107's full-sized avatar
Block or Report

Block or report rcap107

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
rcap107/README.md

Hello! I'm Riccardo Cappuzzo (@rcap107). I am a postdoctoral researcher at Inria Saclay and Dataiku. I am working on wrangling tabular data so that it can be used for Machine Learning tasks.

I work mostly with Python and its data science libraries (Pytorch, Pandas, Numpy, scikit-learn, matplotlib, seaborn).

I am interested in word embeddings, graph embeddings, graph neural networks and how they can be applied to data curation tasks.

I have implemented EmbDI, a data integration system based on tabular embeddings.

Pinned

  1. embdi embdi Public

    EmbDI is a table embeddings algorithm that solves data integration problems by converting tabular data into graphs, then applying word2vec to the graph to obtain embeddings.

    Jupyter Notebook 4 2

  2. retrieve-merge-predict retrieve-merge-predict Public

    Jupyter Notebook 3 2