Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add model JinaBertForMaskedLM to the supported list. #628

Open
Cris-Maggi opened this issue Nov 6, 2023 · 1 comment
Open

Add model JinaBertForMaskedLM to the supported list. #628

Cris-Maggi opened this issue Nov 6, 2023 · 1 comment
Labels
enhancement New feature or request topic:NLP Issue or PR about NLP model support and eland_import_hub_model

Comments

@Cris-Maggi
Copy link

Add the new BERT based model JinaBertForMaskedLM to the supported list as it's being requested by customers.
Link to the model page. https://huggingface.co/jinaai/jina-embeddings-v2-base-en.
CLI used to import

**docker run -it --rm elastic/eland eland_import_hub_model --url https://elastic:password@elasticsearchlink/ --hub-model-id jinaai/jina-embeddings-v2-base-en --task-type text_embedding --start**

Error observed

Traceback (most recent call last):
File "/usr/local/bin/eland_import_hub_model", line 8, in <module>
sys.exit(main())
File "/usr/local/lib/python3.10/site-packages/eland/cli/eland_import_hub_model.py", line 241, in main
tm = TransformerModel(
File "/usr/local/lib/python3.10/site-packages/eland/ml/pytorch/transformers.py", line 642, in __init__
self._config = self._create_config(es_version)
File "/usr/local/lib/python3.10/site-packages/eland/ml/pytorch/transformers.py", line 735, in _create_config
tokenization_config = self._create_tokenization_config()
File "/usr/local/lib/python3.10/site-packages/eland/ml/pytorch/transformers.py", line 673, in _create_tokenization_config
_max_sequence_length = self._find_max_sequence_length()
File "/usr/local/lib/python3.10/site-packages/eland/ml/pytorch/transformers.py", line 730, in _find_max_sequence_length
raise ValueError("Cannot determine model max input length")
@joshdevins joshdevins added enhancement New feature or request topic:NLP Issue or PR about NLP model support and eland_import_hub_model labels Nov 6, 2023
@steveedcast
Copy link

I also have a need to use Jina for long input sequences. Any idea when Elastic would support this in eland?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request topic:NLP Issue or PR about NLP model support and eland_import_hub_model
Projects
None yet
Development

No branches or pull requests

3 participants