[New Model] Donut: Document Understanding Transformer #18530

WaterKnight1998 · 2022-08-08T15:33:10Z

Model description

Donut doughnut, Document understanding transformer, is a new method of document understanding that utilizes an OCR-free end-to-end Transformer model. Donut does not require off-the-shelf OCR engines/APIs, yet it shows state-of-the-art performances on various visual document understanding tasks, such as visual document classification or information extraction (a.k.a. document parsing). In addition, we present SynthDoG dog, Synthetic Document Generator, that helps the model pre-training to be flexible on vairous languages and domains.

Open source status

The model implementation is available
The model weights are available

Provide useful links for the implementation

Code @clovaai : https://github.com/clovaai/donut

Weights:

NielsRogge · 2022-08-08T16:07:15Z

See #18488

WaterKnight1998 · 2022-08-08T16:27:47Z

Cool to see you working there, thank you very much =D

WaterKnight1998 added the New model label Aug 8, 2022

WaterKnight1998 closed this as completed Aug 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Model] Donut: Document Understanding Transformer #18530

[New Model] Donut: Document Understanding Transformer #18530

WaterKnight1998 commented Aug 8, 2022

NielsRogge commented Aug 8, 2022

WaterKnight1998 commented Aug 8, 2022

[New Model] Donut: Document Understanding Transformer #18530

[New Model] Donut: Document Understanding Transformer #18530

Comments

WaterKnight1998 commented Aug 8, 2022

Model description

Open source status

Provide useful links for the implementation

NielsRogge commented Aug 8, 2022

WaterKnight1998 commented Aug 8, 2022