Updating and moving the grammar to a separate file #2

mtdowling · 2014-12-07T02:08:18Z

This commit simplifies the JMESPath grammar and moved the grammar to a
separate downloadable file.

This is WIP PR. I'm finding other places in the grammar that I think could be improved.

This commit simplifies the JMESPath grammar and moved the grammar to a separate downloadable file.

mtdowling · 2014-12-09T04:15:47Z

Actually, "||" and "|" are still ambiguous. What's the desired precedence here?

I found more ambiguities with pipes and ors: a | b | c was ambiguous, so I've updated the grammar. I also noticed an issue with my grammar where projections were consuming pipes and ors. I've made a change to address this.

I've also adding slicing and grouping to the grammar.

jamesls · 2014-12-22T19:36:51Z

I had an updated grammar somewhere that had part of the precedence baked into the grammar. It wasn't complete but it had the or/pipe's worked out. I'll see if I can find that branch. We should double check that it's consistent with this.

The desired precedence is that a || b | b is parsed as ((a || b) | c). I believe we have a compliance test somewhere that verifies this. I confirmed that this is how the python lib parses this expression.

>>> pprint(jmespath.compile('a || b | c').parsed)
{'children': [{'children': [{'children': [], 'type': 'field', 'value': 'a'},
                            {'children': [], 'type': 'field', 'value': 'b'}],
               'type': 'or_expression'},
              {'children': [], 'type': 'field', 'value': 'c'}],
 'type': 'pipe'}

mtdowling · 2014-12-24T23:02:21Z

Good catch. I've pushed a commit that ensures that or expressions bind more tightly than pipe expressions.

Added an "and-expression". Added a "not" expression. not and binary expressions can be at the root level. Filter expressions now support unary conditions. Simplified function argument grammar. Fixed a bug in how literals were described.

mtdowling · 2015-01-03T21:03:33Z

I've now shortened the majority of the rule names (e.g., expression is now expr). This shortening of the rule names seems to be in line with most other grammars I've seen in the wild (i.e., http://www.lua.org/manual/5.2/manual.html#9 and https://docs.python.org/3/reference/grammar.html).

I updated the or and and to have an explicit precedence so that and binds more tightly than or.

I updated subexpr to now split out into object-subexpr and array-subexpr to better distinguish between object type access using a "." and array style access using a "[". This change fixes a bug that is present in the current grammar that allows you to use dot style object access off of a multi-list (i.e., this is no longer allowed because you cannot access an array as an object: [a, b].c). I think splitting these rules out will help us to very granularly specify what and how subexpressions descend into data.

I think this grammar update is pretty close to being a fully functional and non-ambiguous replacement to the current grammar. I'm using this grammar verbatim in the Clojure implementation, so you can use that and play around with how it parses to see how everything works. I'll continue to flesh out the Clojure implementation and try to get it fully compliant with the test suite to ensure that the grammar is backwards compatible.

Renaming function-arg* to arg and arg-list

mtdowling · 2015-02-01T04:23:50Z

I pushed a change that allows for insignificant whitespace built into the grammar (similar to the JSON ABNF). This makes the grammar much more robust and does not require implementation details to be part of parsing (i.e., we no longer need to say "whitespace is insignificant except inside of quotes and literals).

jamesls · 2015-04-10T23:53:24Z

I'm starting to pick up work again on JEP-9 (the improved filters with ands, nots, parens, etc) and I want to incorporate this grammar as part of that work. I'm starting to take a look at this.

jamesls · 2015-04-10T23:58:31Z

docs/jmespath-grammar.txt

+wildcard-values = "*"
+current-node    = "@"
+
+number            = *"-" 1*DIGIT


Shouldn't have the *. It should just be optional instead of zero or more.

Ah yeah, should be ["-"] 1*DIGIT

mtdowling · 2015-04-11T00:23:38Z

I'll send an updated PR shortly

jamesls · 2015-04-11T15:47:53Z

docs/jmespath-grammar.txt

+current-node    = "@"
+
+number            = *"-" 1*DIGIT
+literal           = "`" 1*(unescaped-literal / escaped-literal) "`"


What was the motivation for this change? I would prefer to have the JSON grammar in here. Otherwise this will allow invalid strings such as:

`[`

which is invalid.

I thought [ was valid (currently) because JMESPath allows you to pass in unquoted strings here. It will try to parse and see that it failed and then parse it as "[".

mtdowling · 2015-04-11T18:38:02Z

Actually, I've got an updated copy of the grammar that includes JSON and fixes a bunch of stuff from the clojure implementation... Updating coming.

Incorporates JSON back into the grammar. Much better use of insignificant whitespace. Uses `[optional]` syntax as it is clearer than `1* optional`.

mtdowling · 2015-04-11T19:10:40Z

I've updated the grammar to include the changes from jmespath.clj. This includes adding JSON back into the grammar and better whitespace control.

jamesls · 2015-04-12T15:25:39Z

docs/jmespath-grammar.txt

+
+; "&&" binds more tightly than "||"
+; "||" binds more tightly than "|".
+terminal     = pipe / or / and


Just curious, why is this called terminal? I'm thinking of terminals as the leaf nodes in the parsed tree, i.e it can't be replaced with anything. But everything on the RHS can be broken down into more nodes.

Maybe it isn't a good name. I named it this because it owns the entire side. Do you have a suggestion?

mtdowling · 2015-04-19T18:22:09Z

I've updated the grammar to support JEP 12 and added support to the Clojure implementation for raw string literals.

jamesls · 2015-06-09T07:26:53Z

docs/jmespath-grammar.txt

+; Strings, identifiers, and characters.
+raw-string        = "'" *raw-string-char "'"
+raw-string-char   = (%x20-26 / %x28-5B / %x5D-10FFFF) / raw-string-escape
+raw-string-escape = escape ["'"]


Just a heads up, I synced with this latest version that includes raw string literals and abnfstress caught this. The ["'"] means that the closing single quote is optional so strings such as '\' would be valid. This should be raw-string-escape = escape "'"

Simplifying grammar and moving to a separate file

d291d11

This commit simplifies the JMESPath grammar and moved the grammar to a separate downloadable file.

mtdowling force-pushed the grammar-updates branch from a486562 to d291d11 Compare December 8, 2014 02:38

Renaming root to root-expression and wildcard to wildcard-values

5bf09b7

Adding slices, removing pipe/or ambiguity

987a660

mtdowling force-pushed the grammar-updates branch from 29a4433 to d89729e Compare December 24, 2014 23:05

Fixing precedence of or and pipe

d5d3cac

mtdowling force-pushed the grammar-updates branch from d89729e to d5d3cac Compare December 24, 2014 23:14

Adding binary and unary expressions.

da2b1b0

Added an "and-expression". Added a "not" expression. not and binary expressions can be at the root level. Filter expressions now support unary conditions. Simplified function argument grammar. Fixed a bug in how literals were described.

mtdowling force-pushed the grammar-updates branch from a9b0bd7 to 8698eb1 Compare January 3, 2015 21:12

mtdowling added 2 commits January 3, 2015 13:13

Shortening rule names. Removing ambiguities and invalid syntax.

8698eb1

Renaming function-arg* to arg and arg-list

Allow literal array-subexr and fix for unescaped-literal

8b4be8a

Adding whitespace into the grammar

7a82416

mtdowling force-pushed the grammar-updates branch from e782dda to 7a82416 Compare February 1, 2015 06:00

jamesls reviewed Apr 10, 2015
View reviewed changes

jamesls reviewed Apr 11, 2015
View reviewed changes

mtdowling added 2 commits April 11, 2015 12:01

Updating with latest jmespath.clj changes.

d75585e

Incorporates JSON back into the grammar. Much better use of insignificant whitespace. Uses `[optional]` syntax as it is clearer than `1* optional`.

Removing instaparse angle bracket notation

aaed9c2

mtdowling changed the title ~~Simplifying grammar and moving to a separate file~~ Updating and moving the grammar to a separate file Apr 11, 2015

jamesls mentioned this pull request Apr 12, 2015

JEP 12: Raw string literals #11

Merged

jamesls reviewed Apr 12, 2015
View reviewed changes

jamesls mentioned this pull request Apr 12, 2015

JEP 11: The let() Function #6

Closed

Updating for JEP 12

c4e7b70

mtdowling mentioned this pull request May 11, 2015

"Could not find artifact jmespath:jmespath:jar:1.0.1 in ..." mtdowling/jmespath.clj#1

Open

jamesls reviewed Jun 9, 2015
View reviewed changes

springcomp mentioned this pull request Aug 31, 2019

Project status #65

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating and moving the grammar to a separate file #2

Updating and moving the grammar to a separate file #2

mtdowling commented Dec 7, 2014

mtdowling commented Dec 9, 2014

jamesls commented Dec 22, 2014

mtdowling commented Dec 24, 2014

mtdowling commented Jan 3, 2015

mtdowling commented Feb 1, 2015

jamesls commented Apr 10, 2015

jamesls Apr 10, 2015

mtdowling Apr 11, 2015

mtdowling commented Apr 11, 2015

jamesls Apr 11, 2015

mtdowling Apr 11, 2015

mtdowling commented Apr 11, 2015

mtdowling commented Apr 11, 2015

jamesls Apr 12, 2015

mtdowling Apr 12, 2015

mtdowling commented Apr 19, 2015

jamesls Jun 9, 2015

Updating and moving the grammar to a separate file #2

Are you sure you want to change the base?

Updating and moving the grammar to a separate file #2

Conversation

mtdowling commented Dec 7, 2014

mtdowling commented Dec 9, 2014

jamesls commented Dec 22, 2014

mtdowling commented Dec 24, 2014

mtdowling commented Jan 3, 2015

mtdowling commented Feb 1, 2015

jamesls commented Apr 10, 2015

jamesls Apr 10, 2015

Choose a reason for hiding this comment

mtdowling Apr 11, 2015

Choose a reason for hiding this comment

mtdowling commented Apr 11, 2015

jamesls Apr 11, 2015

Choose a reason for hiding this comment

mtdowling Apr 11, 2015

Choose a reason for hiding this comment

mtdowling commented Apr 11, 2015

mtdowling commented Apr 11, 2015

jamesls Apr 12, 2015

Choose a reason for hiding this comment

mtdowling Apr 12, 2015

Choose a reason for hiding this comment

mtdowling commented Apr 19, 2015

jamesls Jun 9, 2015

Choose a reason for hiding this comment