Skip to content

goliaro/ms_thesis

Repository files navigation

ExpertFlow: Enabling Low-Latency Asynchronous Inference for Mixture of Expert Models

This repository contains the LaTeX source and PDF of my Tsinghua MS thesis, submitted in May 2023.

You can download the PDF here.

For more information, check out the project page here.

The code is available here.

Citation

Please cite as:

@masterthesis{oliaro2023expertflow,
    title        = {ExpertFlow: Enabling Low-Latency Asynchronous Inference for Mixture of Expert Models},
    author       = {Gabriele Oliaro},
    year         = 2023,
    month        = {May},
    address      = {Beijing, China},
    school       = {Tsinghua University},
    type         = {Master's thesis}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages