{nb} breaks if text shaping is turned on with certain fonts #1090

catsclaw · 2024-01-12T19:37:23Z

The special {nb} code fails with some fonts when text shaping is turned on.

Minimal code
Please include some minimal Python code reproducing your issue:

pdf = FPDF(format='letter')
pdf.add_font('gentium', style='', fname='GenBkBasR.ttf')
pdf.add_page()
pdf.set_font('gentium', '', 24)
pdf.set_text_shaping(True)
pdf.write(text='Pages {nb}')
pdf.ln()
pdf.set_text_shaping(False)
pdf.write(text='Pages {nb}')
pdf.output('test.pdf')

Result

Environment
Please provide the following information:

Operating System: Ubuntu
Python version: 11
fpdf2 version used: 2.7.7

The text was updated successfully, but these errors were encountered:

Lucas-C · 2024-01-12T20:24:36Z

Thank you for the report @catsclaw

You are right, those two features are currently incompatible.

The reason is that with test shaping, each character is rendered individually in the PDF (with a dedicated Tj operator for each letter). But in FPDF._substitute_page_number() we look for the sequence {nb} to be present inside a single "PDF string" (rendered by a single Tj operator).

As a consequence, this is currently a limitation in fpdf2.
We should mention it in our documention (in docs/PageBreaks.md).
And PRs are welcome to implement this feature also when text shaping is enabled!

Would you be interested to contribute regarding this @catsclaw? (docs improvement and/or implementation)

andersonhc · 2024-01-12T20:32:51Z

The characters will be rendered as a sequence if they are only moving on the x axis by the character length, but if there is any offset (kerning, etc) we need to adjust the text matrix and make individual Tj. That's why only some fonts will have this problem.

andersonhc · 2024-01-27T14:48:50Z

Adding to this issue:

When you have alias ({nb}) in the text in a multi cell with alignment justified, the line width will be calculated with the alias size instead of the final number, so your text won't be correctly justified
If multi cell breaks the alias in 2 different lines it won't be replaced by the number of pages.

Example:

from fpdf import FPDF
text="Lorem ipsum dolor sit amet, {nb} {nb} {nb} {nb} {nb} {nb} consectetur adipiscing elit. {nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}{nb}Mauris sit amet lacus ut ex tincidunt vulputate non nec mauris. Lorem ipsum dolor sit amet, consectetur adipiscing elit."
pdf = FPDF()
pdf.add_page()
pdf.set_font("helvetica", "", 24)
pdf.multi_cell(w=pdf.epw, text=text, align="J", new_x="LEFT")
pdf.output('test_nb.pdf')

Result:

The problem is the replacement is done directly in the page content after all the rendering is done.
I don't see an obvious way to fix it and it will probably demand a lot of rework on how output works.

gmischler · 2024-02-22T14:08:37Z

The underlying problem here is that an otherwise legitimate sequence of text characters is given a special meaning under certain circumstances. This was bound to result in conflicts somewhere down the line.

The clean solution would be to use a reserved Unicode character for this purpose, which can't otherwise appear in renderable text.
A practical approach might be to convert self.str_alias_nb_pages into a special Glyph() subtype (say NbGlyph()) during text parsing. When rendering, NbGlyph() then inserts a sequence of three or four of this reserved Unicode character. And before writing the file, the reserved character sequences get replaced with the right sequence of digit glyphs.

Or am I missing some basic obstacle here?
Yes, various places in the code need to learn about this special case, but that is kind of inevitable if we want to avoid conflicts.

Address issue: py-pdf/fpdf2#1090

catsclaw added the bug label Jan 12, 2024

Lucas-C added the text-shaping label Jan 12, 2024

andersonhc mentioned this issue May 21, 2024

Number of pages appears to not function correctly #71

Closed

evilaliv3 added a commit to globaleaks/GlobaLeaks that referenced this issue May 22, 2024

Adopt courier font for headers and footers

0464d2a

Address issue: py-pdf/fpdf2#1090

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

{nb} breaks if text shaping is turned on with certain fonts #1090

{nb} breaks if text shaping is turned on with certain fonts #1090

catsclaw commented Jan 12, 2024 •

edited

Lucas-C commented Jan 12, 2024 •

edited

andersonhc commented Jan 12, 2024

andersonhc commented Jan 27, 2024 •

edited

gmischler commented Feb 22, 2024

{nb} breaks if text shaping is turned on with certain fonts #1090

{nb} breaks if text shaping is turned on with certain fonts #1090

Comments

catsclaw commented Jan 12, 2024 • edited

Lucas-C commented Jan 12, 2024 • edited

andersonhc commented Jan 12, 2024

andersonhc commented Jan 27, 2024 • edited

gmischler commented Feb 22, 2024

catsclaw commented Jan 12, 2024 •

edited

Lucas-C commented Jan 12, 2024 •

edited

andersonhc commented Jan 27, 2024 •

edited