Improve Gson and JsonParser trailing data handling #2123

Marcono1234 · 2022-05-22T17:04:07Z

Improves the trailing data handling for Gson and JsonParser, mainly addressing the following issues:

The old code was not detecting trailing data after a JSON null, e.g. null[], because it erroneously considered a null to always mean that the input was empty
The existing check for trailing data in the form of multiple top-level JSON elements was never working (?). While the parsing was done in lenient mode, the check reader.peek() != JsonToken.END_DOCUMENT was done separately, where the JsonReader was in strict mode again and therefore peek() itself threw a MalformedJsonException.
With the new code added by this pull request the custom JsonSyntaxException is now thrown for multiple top-level JSON elements; however, a MalformedJsonException could likely still occur if the trailing data is not valid JSON data, but that is probably acceptable.

Marcono1234 · 2022-05-22T17:05:03Z

gson/src/test/java/com/google/gson/functional/GsonParsingTest.java

 *
 * @author Inderjeet Singh
 * @author Joel Leitch
 */
-public class JsonParserTest extends TestCase {
+public class GsonParsingTest extends TestCase {


Have renamed this class because there exists already a JsonParserTest (in a different package), and this test class is mainly checking the behavior of Gson methods.

eamonnmcmanus

I ran this against all of Google's tests and did not find any problems, except:

references to a now-deleted internal method, which I think needs to be undeleted
tests that check for specific exceptions that are now different

I did have to merge with HEAD, and that implied adding a couple of @Test annotations to the now-JUnit4 JsonParserTest.

eamonnmcmanus · 2022-07-22T15:01:57Z

gson/src/main/java/com/google/gson/internal/Streams.java

   */
-  public static JsonElement parse(JsonReader reader) throws JsonParseException {


I'm finding a surprisingly large number of references to this method in Google's source base (including third-party code). In some cases it is because people have copied RuntimeTypeAdapterFactory into their own projects. So I think we probably need to keep this overload. Then of course we could continue to call it from RuntimeTypeAdapterFactory instead of the new (..., false) version.

Right, before #1959 RuntimeTypeAdapterFactory was using the internal Streams class. I have now added back that method, but marked it as deprecated to make users aware that it is declared in an internal Gson class, I hope that is ok.

Then of course we could continue to call it from RuntimeTypeAdapterFactory instead of the new (..., false) version.

The RuntimeTypeAdapterFactory variant which was still using this method was actually a copy of the class from the extras module. I have now removed that copy (and the enclosing test class RuntimeTypeAdapterFactoryFunctionalTest) and have moved the adjusted test code to the RuntimeTypeAdapterFactoryTest class in the extras module to avoid code duplication.

…check

Marcono1234 · 2022-07-23T16:32:37Z

tests that check for specific exceptions that are now different

Are these tests related to JSON with trailing data / multiple top level JSON values? And are the newly thrown exceptions more reasonable?
If not, which exceptions are now thrown? I thought I did not change any of the other exception handling logic.

eamonnmcmanus · 2022-07-23T22:48:05Z

Are these tests related to JSON with trailing data / multiple top level JSON values? And are the newly thrown exceptions more reasonable?
If not, which exceptions are now thrown? I thought I did not change any of the other exception handling logic.

Thanks for prompting me to look at this in more detail. I think there are problems with the new code. I haven't looked at all the test failures, but the one I did look at failed because this:

JsonParser.parseString("$$$$////")

gets a JsonSyntaxException with the current code but returns a JsonPrimitive for the string "$$$$" with this change. That doesn't seem right.

Marcono1234 · 2022-07-24T15:03:46Z

gets a JsonSyntaxException with the current code but returns a JsonPrimitive for the string "$$$$" with this change. That doesn't seem right.

The problem is that JsonParser parses leniently. However, before this PR the JsonReader.peek() call to determine whether the end of the document has been reached was (erroneously?) done after the original lenient mode was restored (therefore the JsonReader was already strict again). You will also notice that in the message of the original exception you get for that input:

Use JsonReader.setLenient(true) to accept malformed JSON at line 1 column 6 path $

This indicates that the first / caused the exception and not the $ (the location information is slightly incorrect, see also #1764).

With the changes of this PR the peek() call is now (correctly?) done in lenient mode as well, which therefore parses the JSON as:

String $$$$
End of line comment start //
- End of line comment //

If you don't want this to be changed, then the end of document checks can most likely be removed (or reduced to a peek() call discarding the result) because they are effectively dead code in their current form. Though explaining to a user when and how Gson is lenient becomes then slightly more confusing.

…check

Marcono1234 commented May 22, 2022

View reviewed changes

Improve Gson and JsonParser trailing data handling

ee70912

Marcono1234 force-pushed the marcono1234/Gson-JsonParser-trailing-data-check branch from 0da3a56 to ee70912 Compare June 29, 2022 19:55

Marcono1234 mentioned this pull request Jun 29, 2022

Improve lenient mode documentation #2122

Merged

eamonnmcmanus requested changes Jul 22, 2022

View reviewed changes

Marcono1234 added 4 commits July 23, 2022 17:24

Merge branch 'master' into marcono1234/Gson-JsonParser-trailing-data-…

f37bdc2

…check

Add missing Test annotations for JsonParserTest

0938138

Add back Streams.parse(JsonReader)

4ab2af9

Move RuntimeTypeAdapterFactoryFunctionalTest to extras module

d4fc470

Marcono1234 added 2 commits October 8, 2022 19:35

Merge branch 'master' into marcono1234/Gson-JsonParser-trailing-data-…

0dc79b2

…check

Merge branch 'master' into marcono1234/Gson-JsonParser-trailing-data-…

17b3826

…check

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Gson and JsonParser trailing data handling #2123

Improve Gson and JsonParser trailing data handling #2123

Marcono1234 commented May 22, 2022 •

edited

Marcono1234 May 22, 2022

eamonnmcmanus left a comment

eamonnmcmanus Jul 22, 2022

Marcono1234 Jul 23, 2022

Marcono1234 commented Jul 23, 2022

eamonnmcmanus commented Jul 23, 2022

Marcono1234 commented Jul 24, 2022 •

edited

		*/
		public static JsonElement parse(JsonReader reader) throws JsonParseException {

Improve Gson and JsonParser trailing data handling #2123

Are you sure you want to change the base?

Improve Gson and JsonParser trailing data handling #2123

Conversation

Marcono1234 commented May 22, 2022 • edited

Marcono1234 May 22, 2022

Choose a reason for hiding this comment

eamonnmcmanus left a comment

Choose a reason for hiding this comment

eamonnmcmanus Jul 22, 2022

Choose a reason for hiding this comment

Marcono1234 Jul 23, 2022

Choose a reason for hiding this comment

Marcono1234 commented Jul 23, 2022

eamonnmcmanus commented Jul 23, 2022

Marcono1234 commented Jul 24, 2022 • edited

Marcono1234 commented May 22, 2022 •

edited

Marcono1234 commented Jul 24, 2022 •

edited