Skip to content

๐Ÿ“Š Results from the reproducibility and benchmarking studies presented in "Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework" (http://arxiv.org/abs/2006.13365)

License

Notifications You must be signed in to change notification settings

pykeen/benchmarking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

PyKEEN Benchmarking Results

This repository contains the results from the reproducibility and benchmarking studies described in

Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework.
Ali, M., Berrendorf, M., Hoyt, C. T., Vermue, L., Galkin, M., Sharifzadeh, S., Fischer, A., Tresp, V., & Lehmann, J. (2020).
arXiv, 2006.13365.

This repository itself is archived on Zenodo at DOI.

Reproducibility Study

In this study, we use the KGEMs reimplemented in PyKEEN and the authors' best reported hyper-parameters to make reproductions of past experiments. The experimental artifacts from the reproducibility study can be found here.

Benchmarking (Ablation) Study

In this study, we conduct a large number of hyper-parameter optimizations to investigate the effects of certain aspects of models (training assumption, loss function, regularizer, optimizer, negative sampling strategy, HPO methodology, training strategy). The experimental artifacts from the ablation study can be found here.

We provide an additional tool to search through these configurations at ablation/search.py, by finding configurations with optimal validation H@10 for a number of different queries. You can also run this script without full installation, as long as click and pandas are available. General usage information can be obtained by python3 ablation/search.py --help. Moreover, here are a few examples:

  • the overall best configuration
python3 ablation/search.py
  • the best configuration for the dataset FB15k-237
python3 ablation/search.py --dataset fb15k237
  • the best configuration for the dataset FB15k-237 using the distmult model
python3 ablation/search.py --dataset fb15k237 --model distmult
  • the top-3 configuration for the dataset WN18-RR using LCWA training loop
python3 ablation/search.py --dataset wn18rr --training-loop lcwa --at-most 3

Regeneration of Charts

All configuration for installation of relevant code, collation of results, and generation of charts is included in the tox.ini configuration that can be run with:

$ pip install tox
$ tox

About

๐Ÿ“Š Results from the reproducibility and benchmarking studies presented in "Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework" (http://arxiv.org/abs/2006.13365)

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published