[Trojan Source Attacks] New feature: ban the use of text directionality control characters #750

Lucas-C · 2021-11-04T13:40:01Z

Is your feature request related to a problem? Please describe.
The vulnerability is detailed here: https://trojansource.codes

adversaries can attack the encoding of source code files to inject vulnerabilities
The trick is to use Unicode control characters to reorder tokens in source code at the encoding level.

Describe the solution you'd like
Could a new check be added to bandit to detect those characters?

Describe alternatives you've considered
Using a language-agnostic linter tool detecting this vulnerability,
but I do not know any existing one so far.

Additional context

The text was updated successfully, but these errors were encountered:

kleph · 2021-11-04T14:02:42Z

May be a bit less strict, as the article mention, unterminated bidirectionnal control character ?

I still haven't use those type of character, but I presume they are usefull for right to left languages. I assume banning all control chars would prevent writing comment in those languages ?

It's probably harder to implement though.

Appart from that, I think it's a great idea to implent those checks in tools, to not rely on human eye !

Lucas-C · 2021-11-05T08:06:53Z

It is recommended as part of the PDF white paper, section "VII. F - Defenses":

The simplest defense is to ban the use of text directionality control characters
If an application wishes to print text that requires Bidi overrides, developers can generate those characters using escape sequences rather than embedding potentially dangerous characters into source code.

CarliJoy · 2021-11-09T08:53:57Z

Please note #749 where I already opened an issue for the same topic.
Especially have a look at the linked ticket there that explains the issue in detail for python!

Lucas-C · 2021-11-09T10:52:39Z

Indeed, those issues are duplicates.
I'm closing this as it came after yours @CarliJoy

Lucas-C added the enhancement New feature or request label Nov 4, 2021

This was referenced Nov 8, 2021

Unpaired Bidi Control Character Detection PyCQA/pycodestyle#1037

Closed

Unpaired Bidi Control Character Detection pylint-dev/pylint#5281

Closed

Lucas-C closed this as completed Nov 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Trojan Source Attacks] New feature: ban the use of text directionality control characters #750

[Trojan Source Attacks] New feature: ban the use of text directionality control characters #750

Lucas-C commented Nov 4, 2021

kleph commented Nov 4, 2021

Lucas-C commented Nov 5, 2021

CarliJoy commented Nov 9, 2021 •

edited

Lucas-C commented Nov 9, 2021

[Trojan Source Attacks] New feature: ban the use of text directionality control characters #750

[Trojan Source Attacks] New feature: ban the use of text directionality control characters #750

Comments

Lucas-C commented Nov 4, 2021

Additional context

kleph commented Nov 4, 2021

Lucas-C commented Nov 5, 2021

CarliJoy commented Nov 9, 2021 • edited

Lucas-C commented Nov 9, 2021

CarliJoy commented Nov 9, 2021 •

edited