Lightning IR

Your one-stop shop for fine-tuning and running neural ranking models.

Lightning IR is a library for fine-tuning and running neural ranking models. It is built on top of Lightning to provide a simple and flexible interface to interact with neural ranking models.

Two types of models are supported: cross-encoders and bi-encoders. Cross-encoders are models that encode a query and a document together (monoBERT, monoT5, RankT5, etc.), while bi-encoders encode queries and documents separately (DPR, ColBERT, SPLADE, etc.).

Both types of models are usually trained on the same types of data: triples of queries, positive documents, and negative documents. Therefore, the library provides a unified interface to fine-tune and run both types of models. See the Fine-tuning section for more details.

Regarding inference, since bi-encoders encode queries and documents separately, they can be used to index documents and search for relevant documents. Lightning IR provides a simple interface for indexing and searching with bi-encoders. See the Indexing and Searching sections for more details. Cross-encoders, on the other hand, encode queries and documents together, making them only suitable for re-ranking. Lightning IR provides a simple interface for re-ranking with cross-encoders and bi-encoders. See the Re-ranking section for more details.

Installation

We're currently in the process of setting up the package on PyPI. In the meantime, you can install the package from source.

git clone
cd lightning-ir
pip install .

Model Zoo

Cross-encoders

Model Name	TREC DL 19/20 nDCG@10 (BM25)	TIREx nDCG@10
monoelectra-base	0.715	0.416
monoelectra-large	0.730	0.434
monoT5 (Coming soon)	--	--

Bi-encoders

Model Name	TREC DL 19/20 nDCG@10
ColBERT (Coming soon)	--
DPR (Coming soon)	--
SPLADE (Coming soon)	--

Usage

Command Line Interface

Lightning IR uses the Lightning CLI and adds some additional options to provide a unified interface for fine-tuning and running neural ranking models. After installation, the CLI can be accessed via the lightning-ir command.

The CLI offers four subcommands:

$ lightning-ir -h
Lightning Trainer command line tool

subcommands:
  For more details of each subcommand, add it as an argument followed by --help.

  Available subcommands:
    fit                 Runs the full optimization routine.
    index               Index a collection of documents.
    search              Search for relevant documents.
    re_rank             Re-rank a set of retrieved documents.

Name		Name	Last commit message	Last commit date
Latest commit History 265 Commits
configs		configs
lightning_ir		lightning_ir
tests		tests
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

lightning_ir

lightning_ir

tests

tests

.gitignore

.gitignore

README.md

README.md

setup.py

setup.py

Repository files navigation

Lightning IR

Installation

Model Zoo

Cross-encoders

Bi-encoders

Usage

Command Line Interface

Configuration

Data Formats

Examples

Fine-tuning

Indexing

Searching

Re-ranking

About

Releases

Packages

Languages

webis-de/lightning-ir

Folders and files

Latest commit

History

Repository files navigation

Lightning IR

Installation

Model Zoo

Cross-encoders

Bi-encoders

Usage

Command Line Interface

Configuration

Data Formats

Examples

Fine-tuning

Indexing

Searching

Re-ranking

About

Resources

Stars

Watchers

Forks

Languages