Skip to content

Latest commit

History

History
47 lines (31 loc) 路 1.75 KB

bloom.mdx

File metadata and controls

47 lines (31 loc) 路 1.75 KB

BLOOM

Overview

Bloom model has been proposed with its various versions through the BigScience Workshop. BigScience is inspired by other open science initiatives where researchers have pooled their time and resources to collectively achieve a higher impact. The architecture of Bloom is essentially similar to GPT3 (auto-regressive model for next token prediction), but has been trained on different 46 languages including code. Several smaller versions of the models have been trained on the same dataset. Bloom is available in the following versions:

BloomConfig

[[autodoc]] BloomConfig - all

BloomModel

[[autodoc]] BloomModel - forward

BloomTokenizerFast

[[autodoc]] BloomTokenizerFast - all

BloomForCausalLM

[[autodoc]] BloomForCausalLM - forward