Resolve XSS vulnerability in local Wordnet browser #3096

tomaarsen · 2022-12-28T13:34:39Z

Hello!

Pull Request overview

Resolve Cross Site Scripting vulnerability in nltk.app.wordnet_app. This only affected users of this browser interface to Wordnet, and not other users of Wordnet. If the following image is not familiar to you, then you are not affected:

Details

Whenever an unknown path was supplied in the localhost, e.g. http://localhost:8000/<script>alert(1)</script>.html, then the wordnet app would try to find a file called <script>alert(1)</script>.html, be unable to do so, and then report back with an error on the website saying that "Internal error: Path for static page '<script>alert(1)</script>' is unknown". However, this page was loaded as HTML, so the script would be executed.

I don't believe that there is a real attack vector here, as the pages that are normally seen are directly from Wordnet, so one of the Wordnet URLs would need to be modified into a malicious link. That said, there is no reason not to fix this.

Reproducing

The wordnet browser app can be started like so:

import nltk
nltk.app.wordnet()

Then, browsing to http://localhost:8000/<script>alert(1)</script>.html would cause the following popup to appear:

The fix

By setting the Content-type to text/plain when an unknown path is used, we prevent any code from being executed.

After the fix

When running the reproduction code, we now see:

And no popup.

This vulnerability was disclosed according to our security policy, and we are thankful for that.

Tom Aarsen

By setting the Content-type to text/plain when an unknown path is used.

arademaker · 2022-12-28T14:15:31Z

Where can I find the tutorial/doc about how to use the wordnet app?

tomaarsen · 2022-12-28T14:22:45Z

There's no tutorial on it, and the only documentation is this: https://www.nltk.org/api/nltk.app.wordnet_app.html
But honestly, that's quite vague and doesn't really seem to correspond with my knowledge on the wordnet app. I would just run

import nltk
nltk.app.wordnet

And mess around with the webbrowser that should automatically pop up. There's a help button in the interface.

kylemcmearty · 2023-01-06T14:39:42Z

Hey is this fix going into version 3.9?

tomaarsen · 2023-01-06T14:42:41Z

We're still discussing this internally. We will either:

include this fix into NLTK 3.9, or
remove wordnet_app from NLTK altogether for version 3.9 onwards.

One thing is certain, once NLTK 3.9 releases, it should not have this vulnerability.

arademaker · 2023-01-13T12:48:59Z

Thank you, @tomaarsen. I could finally see the wordnet app in action with nltk.app.wordnet() in the Python prompt. I noticed that the example of searching for multiple words on the help page needs to be fixed. The current word field is also not enabled.

tomaarsen · 2023-01-13T12:59:32Z

There may still be some small issues with it. Steven and I have discussed potentially deprecating it and moving it to nltk_contrib instead of paying the maintenance cost for it.

Resolve XSS in Wordnet app

2e0525a

By setting the Content-type to text/plain when an unknown path is used.

tomaarsen added bug wordnet GUI labels Dec 28, 2022

github-actions bot removed the wordnet label Dec 28, 2022

tomaarsen merged commit c8cedf1 into nltk:develop Dec 28, 2022

tomaarsen deleted the vuln/wordnet_app_xss branch December 28, 2022 13:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resolve XSS vulnerability in local Wordnet browser #3096

Resolve XSS vulnerability in local Wordnet browser #3096

tomaarsen commented Dec 28, 2022

arademaker commented Dec 28, 2022

tomaarsen commented Dec 28, 2022

kylemcmearty commented Jan 6, 2023

tomaarsen commented Jan 6, 2023

arademaker commented Jan 13, 2023 •

edited

tomaarsen commented Jan 13, 2023

Resolve XSS vulnerability in local Wordnet browser #3096

Resolve XSS vulnerability in local Wordnet browser #3096

Conversation

tomaarsen commented Dec 28, 2022

Pull Request overview

Details

Reproducing

The fix

After the fix

arademaker commented Dec 28, 2022

tomaarsen commented Dec 28, 2022

kylemcmearty commented Jan 6, 2023

tomaarsen commented Jan 6, 2023

arademaker commented Jan 13, 2023 • edited

tomaarsen commented Jan 13, 2023

arademaker commented Jan 13, 2023 •

edited