Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use multiprocessing for hyperopt operation #688

Open
osma opened this issue Apr 14, 2023 · 1 comment
Open

Use multiprocessing for hyperopt operation #688

osma opened this issue Apr 14, 2023 · 1 comment
Milestone

Comments

@osma
Copy link
Member

osma commented Apr 14, 2023

As noted in the "Potential future work" section of PR #681:

The threading performance of annif hyperopt was already bad, and now it got worse. The solution could be to switch to process-based multiprocessing. This has been difficult to do with Optuna (needs an external relational database), but the Optuna FAQ now states that it could also be done with JournalFileStorage, which sounds more promising.

So we should investigate whether it would be possible to use multiprocessing in hyperopt operations, because the current multithreading approach doesn't actually work very well.

@osma osma added this to the Long term milestone Apr 14, 2023
@osma
Copy link
Member Author

osma commented Apr 17, 2023

Here is a blog post with more details about JournalFileStorage.

Using a journal file could perhaps make it possible to extend hyperparameter optimization runs, as suggested in #633.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant