Parse exponent literal as number #768

Jefffrey · 2022-12-22T10:42:28Z

Resolve part of #610

Allows parsing of exponent literals as numbers, provided the dialect does not allow an identifier to being with a number (Hive dialect). Will need more thinking on how to handle that specific case.

alamb · 2022-12-28T13:12:15Z

src/test_utils.rs

@@ -144,6 +144,7 @@ pub fn all_dialects() -> TestedDialects {
            Box::new(RedshiftSqlDialect {}),
            Box::new(MySqlDialect {}),
            Box::new(BigQueryDialect {}),
+            Box::new(SQLiteDialect {}),


alamb · 2022-12-28T13:16:09Z

src/tokenizer.rs

@@ -617,6 +618,36 @@ impl<'a> Tokenizer<'a> {
                        return Ok(Some(Token::Period));
                    }

+                    // Parse exponent as number
+                    if chars.peek() == Some(&'e') || chars.peek() == Some(&'E') {
+                        let mut char_clone = chars.peekable.clone();


Why is this copy needed?

Given chars is already peekable I don't see why it can't be used directly

I needed a way to peek more than just the next char, since is only valid exponent if e followed by optional sign and an actual number. Found easiest way was to simply clone the iter and use that, and if found not to be an exponent and safely discard it and continue regular behaviour with original iter.

alamb · 2022-12-28T13:20:17Z

src/tokenizer.rs

+            Token::Comma,
+            Token::Whitespace(Whitespace::Space),
+            Token::Number(String::from("1e-10"), false),
+            Token::make_word("a", None),


I found this very strange that a new token is formed without whitespace after a number. I expected that this is a token error but this implementation agrees with postgres 🤯

postgres=# select 12e-10a; a -------------- 0.0000000012 (1 row) postgres=# select 12e-10 a; a -------------- 0.0000000012 (1 row)

Likewise

postgres=# select 1e-10-10; ?column? --------------- -9.9999999999 (1 row) postgres=# select 1e-10 -10; ?column? --------------- -9.9999999999 (1 row)

🤯

I believe this behaviour is part of what bit me when trying to implement for Hive dialect 😅

alamb · 2022-12-28T13:22:46Z

Thank you @Jefffrey

coveralls · 2022-12-28T13:25:23Z

Pull Request Test Coverage Report for Build 3757065788

65 of 69 (94.2%) changed or added relevant lines in 2 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.04%) to 86.373%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/tokenizer.rs	43	45	95.56%
tests/sqlparser_common.rs	22	24	91.67%

Totals
Change from base Build 3720056662:	0.04%
Covered Lines:	12835
Relevant Lines:	14860

💛 - Coveralls

Parse exponent literal as number

fd124d4

alamb approved these changes Dec 28, 2022

View reviewed changes

alamb merged commit 2c20ec0 into sqlparser-rs:main Dec 28, 2022

Jefffrey deleted the 610_exponent_literals branch December 28, 2022 19:56

This was referenced Jan 2, 2023

support scientific notation for SQL literals apache/datafusion#3448

Closed

[Datafusion] Datafusion queries involving a column name that begins with a number produces unexpected results apache/datafusion#108

Closed

Jefffrey mentioned this pull request Feb 6, 2023

Incorrect parsing literals that consisting of digits and letters, beginning with digits #804

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parse exponent literal as number #768

Parse exponent literal as number #768

Jefffrey commented Dec 22, 2022

alamb Dec 28, 2022

alamb Dec 28, 2022

Jefffrey Dec 28, 2022

alamb Dec 28, 2022

alamb Dec 28, 2022

Jefffrey Dec 28, 2022

alamb commented Dec 28, 2022

coveralls commented Dec 28, 2022

Parse exponent literal as number #768

Parse exponent literal as number #768

Conversation

Jefffrey commented Dec 22, 2022

alamb Dec 28, 2022

Choose a reason for hiding this comment

alamb Dec 28, 2022

Choose a reason for hiding this comment

Jefffrey Dec 28, 2022

Choose a reason for hiding this comment

alamb Dec 28, 2022

Choose a reason for hiding this comment

alamb Dec 28, 2022

Choose a reason for hiding this comment

Jefffrey Dec 28, 2022

Choose a reason for hiding this comment

alamb commented Dec 28, 2022

coveralls commented Dec 28, 2022

Pull Request Test Coverage Report for Build 3757065788

💛 - Coveralls