Commits · 082f04b5208748be41acca61e1f89290cc686b3e · Sonia Zorba / vollt

Oct 19, 2020
- [ADQL] Fix the "try-fix" feature with regular identifiers. · 082f04b5
  Grégory Mantelet authored Oct 19, 2020
```
Fixes #121
```
  082f04b5
Aug 21, 2020

[ADQL,TAP] Allow to set a simple string as translation pattern for UDFs. · 51691725

Grégory Mantelet authored Aug 21, 2020

This aims to prevent extending UserDefinedFunction for UDFs whose translation
is very simple (e.g. different name in SQL, different argument order, etc).

Fixes #115

51691725

Jul 02, 2019

Revert "[ADQL,TAP] New parser for ADQL-2.1." · f4ffbf1d

Grégory Mantelet authored Jul 02, 2019

This commit reverts commit 89418d13.

The reverted commit will be applied in another branch (probably 'adql-2.1') as
it is part of the next release of ADQL-Lib.

f4ffbf1d

May 10, 2019

[ADQL,TAP] New parser for ADQL-2.1. · 89418d13

Grégory Mantelet authored May 10, 2019

- Now, `ADQLParserFactory.createParser(...)` should be used to create a parser
- Only the new function `LOWER` is supported for the moment
- Not yet possible to manage the optional features _(next dev to come)_
=> 1st step for ADQL-Lib v2.0

- TAP adapted so that using the last stable version of the ADQL language
  (i.e. 2.0 for the moment)
  - but not yet possible to set the ADQL version to use in the configuration
    file

89418d13

Mar 13, 2019

[ADQL] Add to the parser a function attempting to quickly fix an ADQL query. · 15cd5944

Grégory Mantelet authored Mar 13, 2019

This new function - ADQLParser.tryQuickFix(...) - fixes the most common issues
with ADQL queries:

- replace Unicode confusable characters by their ASCII/UTF-8 version,
- double-quote SQL reserved words/terms (e.g. `public`, `year`, `date`),
- double-quote ADQL function names used a column name/alias (e.g. `distance`,
  `min`, `avg`),
- double-quote invalid regular identifiers (e.g. `_RAJ2000`, `2mass`).

The last point is far from being perfect but should work at least for
identifiers starting with a digit or an underscore, or an identifier including
one of the following character: `?`, `!`, `$`, `@`, `#`, `{`, `}`, `[`, `]`,
`~`, `^` and '`'.

It should also been noted that double-quoting a column/table name will make it
case-sensitive. Then, it is possible that the query does not pass even after the
double-quote operation ; the case would have to be checked by the user.

Finally, there is no attempt to fix column and table names (i.e. case
sensitivity and/or typos) using tables/columns list/metadata. That could be a
possible evolution of this function or an additional feature to implement in the
parser.

15cd5944

Mar 05, 2019

[ADQL] Fix the SQL translation of concatenations for MySQL and MS-SQLServer. · e136017d

Grégory Mantelet authored Mar 05, 2019

As @vforchi said:

> The ANSI standard `||` is supported only by Oracle and Postgres: MySQL uses
> `CONCAT` and SQLServer uses `+`.

_This commit resolves the issue #70 ._

e136017d

Mar 21, 2018
- [ADQL] Finally, do not test exactly the weirdly encoded character · 4dea14fc
  gmantele authored Mar 21, 2018
```
Follow up to the commits 33a790a4
and 5e0f82de
```
  4dea14fc
- [ADQL] Complete the previous commit 33a790a4 · 5e0f82de
  gmantele authored Mar 21, 2018
  
  5e0f82de
- [ADQL] Fix character encoding in JUnit test for ADQLParser. · 33a790a4
  gmantele authored Mar 21, 2018
  
  33a790a4
- Test: Compare only the first 8 digits in string comparison · 8656c313
  Ole Streicher authored Mar 21, 2018
```
In TestPgSphereTranslator, two strings are compared
containing (double) floating point numbers. These numbers are slightly
different with different Java versions. To overcome this, only the
first eight fractional digits are compared.
```
  8656c313
Jan 12, 2018

[ADQL] Fix the parsing and translation of a concatenation expression. · e4f38c95

gmantele authored Jan 12, 2018

* The parsing did not allow unsigned numerics and SQL SET functions as
  specified in the ADQL 2.0 grammar

* It was even forbidden to put a column whose the type is not String.

* The translation of a concatenation expression was always prefixed by the
  ADQLList's name: CONCAT_STR. Of course, no database likes that...

Regarding this last point, this commit fixes the GitHub issue #54

e4f38c95

Nov 30, 2017

[ADQL] Put column aliases in lower case while translating into SQL · 3d96c9d9
gmantele authored Nov 30, 2017
```
if the alias is not delimited in ADQL.

This commit fixes the GitHub issue #56
```
3d96c9d9

[ADQL] Prevent using the real name of a parent table inside subqueries when · d7927d84

gmantele authored Nov 30, 2017

this table is declared with an alias. Instead, the table alias must be used.

Note: This problem occurred only when ADQLParser was used with a DBChecker.

This commit fixes the GitHub issue #53

d7927d84

Nov 10, 2017

[ADQL] Fix escaping of double quotes in delimited identifiers. · 239c7178

gmantele authored Nov 10, 2017

A delimited identifier is any sequence of characters between a pair of
double quotes. For instance: "123 I am a delimited identifier!".

It is of course possible to have double quotes inside this kind of identifier,
but they have to be doubled in order to not be mistaken with the end of the
identifier. For instance: "Cool ""identifier""".

However, this escape option was not taken into account by the ADQL library,
though the same mechanism was already in place for string contants.

239c7178

Sep 13, 2017

[ADQL] Also append an HINT message in the ParseException message when a SQL · fe4c3e97

gmantele authored Sep 13, 2017

reserved word is encountered instead of a column/table/schema name/alias.

On the contrary to the previous commit, this time a list of SQL reserved words
has been added into the ADQL grammar. In this way, the parser will ensure that
no word of this list is used in an ADQL query. The raised error is then enriched
of an HINT message stating that this word is part of SQL, is not supported
by ADQL and must be written between double quotes if used as an identifier.

The list of SQL reserved words comes from the ADQL-2.0 standard, after removal
of all potentially used ADQL words, in order to avoid a conflict with the
already existing tokens in the ADQL grammar.

fe4c3e97

[ADQL] Append an HINT message in the ParseException message when an ADQL · db0dfdad

gmantele authored Sep 13, 2017

reserved word is encountered instead of a column/table/schema name/alias.

No list of ADQL reserved words has been added into the ADQL grammar.

However, the ADQL grammar has been slightly changed in order to provide a more
precise location of the REAL wrong part of the query.

Before this commit, if an ADQL reserved word (e.g. 'point') was encountered
outside of its normal syntax (e.g. 'point' no followed by an opening
parenthesis), the next token was highlighted instead of this one. Hence a
confusing error message.

For instance, the following ADQL query:

```sql
SELECT point
FROM aTable
```

returned the following error message:

> Encountered "FROM". Was expecting: "("

Now, it will return the following one:

> Encountered "point". Was expecting one of: "*" <QUANTIFIER> "TOP" [...]
> (HINT: "point" is a reserved ADQL word. To use it as a column/table/schema name/alias, write it between double quotes.)

This error message highlights exactly the source of the problem and even provide
to the user a clear explanation of why the query did not parse and how it could
be solved.

db0dfdad

[ADQL] Allow multiple space characters between ORDER/GROUP and BY keywords. · 993ee846
gmantele authored Sep 13, 2017

993ee846

Sep 11, 2017
- [ADQL] Relax JUnit test on an incorrect character in an ADQL query: · caa7f8be
  gmantele authored Sep 11, 2017
```
with a different local charset, the error message will print differently the
incorrect character.
```
  caa7f8be
- [ALL] Add SearchTableApi in ADQLLib, and make functions returning ArrayList · 1a3bd2be
  gmantele authored Sep 11, 2017
```
and HashMap more generic by returning resp. a List and Map instead.
```
  1a3bd2be
Sep 08, 2017

[ADQL] Fix the transformation of NATURAL JOIN and JOIN...USING of MS-SQLServer. · e03e5725

gmantele authored Sep 08, 2017

In the resulting SQL query, if there are an alias on the joined tables, these
aliases must be used in the ON clause (instead of the full table name).

For instance, the following ADQL query:

```sql
  SELECT *
  FROM tableA AS a NATURAL JOIN tableB AS b;
```

should be translated into the following SQL:

```sql
  SELECT *
  FROM tableA AS a
    INNER JOIN tableB AS b
      ON a.id = b.id
```

This commit complete the resolution of the Pull Request #16
(more details about the issue can be got in there)

e03e5725

[ADQL] Throwing a ParseException instead of an Error · a382b251

gmantele authored Sep 08, 2017

when an incorrect character that can not be interpreted by
the JavaCC Token Manager is encountered.

Actually, the TokenMgrError thrown by JavaCC is caught by all
ADQLParser.parseQuery(...) functions, wrapped inside a ParseException
which is finally thrown instead of the TokenMgrError. In this way,
ADQL-Lib users just have to care about a single Throwable:
ParseException.

Besides the error message has been slightly modified from:

> Lexical error at line 1, column 10.  Encountered: "\u00e9" (233), after : \"\"

to:

> Incorrect character encountered at l.1, c.10: \"\\u00e9\" ('é'), after : \"\"

Thus, the error is more user-friendly, more easy to understand by users.
Additionally, the incorrect character is displayed, as before, in its unicode
expression, but also in its character form (instead of an integer value that
nobody can really understand).

This commit fixes the GitHub issue #17

a382b251

Jun 01, 2017

[ADQL] Fix nasty infinite loop when wrapping matches with SimpleReplaceHandler. · 66304427

gmantele authored Jun 01, 2017

This infinite loop occured only when the replacement object is just
a wrapping of the matching object ; after replacement, the new object was
inspected for matching objects.

Example: infinite loop if we want to wrap all foo(...) functions with
         the function ROUND in the following query:
    SELECT foo(foo(123)) FROM myTable
	     Expected result:
    SELECT ROUND(foo(ROUND(foo(123)))) FROM myTable
	     But generated result was:
    SELECT ROUND(ROUND(ROUND(......foo(foo(123))))) FROM myTable

66304427

May 10, 2017
- [ADQL] Fix a NullPointerException in SearchColumnOutsiteGroupByHandler.gotInto. · 232bf9c9
  gmantele authored May 10, 2017
```
See the test case TestDBChecker.testClauseADQLWithNameNull() for more details.
```
  232bf9c9
Apr 20, 2017
- [ADQL] Grouping by a SELECTed item's alias was not possible any more · 225c49e1
  gmantele authored Apr 20, 2017
```
since the commit 8e2fa9ff.
```
  225c49e1
Apr 04, 2017

[ADQL] Complete commit "Re-Fix GROUP BY's columns handling" · 8e2fa9ff

gmantele authored Apr 04, 2017

(https://github.com/gmantele/taplib/commit/7a70c6038cef460ab169682bed391bb5ae1de1e9)

It was not possible to use a GROUP BY with a qualified column name.
So finally, now, a GROUP BY is a ClauseADQL<ADQLColumn> instead of
a ClauseADQL<ColumnReference>. Indeed, according to the ADQL's BNF,
GROUP BY items are only columns as they would appear in the SELECT
clause (i.e. qualified or not, delimited or not). On the other
hand an ORDER BY accepts ONLY column index or non-qualified column
name/alias.

The class ColumnReference is kept for backward compatibility (or in
case the next version of the ADQL grammar make items of GROUP BY and
ORDER BY of the same type: index or qualified column). Besides, this
class is still inherited for the ORDER BY clause items
(see adql.query.ADQLOrder).

8e2fa9ff

[ADQL] Update a comment in TestFunctionDef · e3be1964
gmantele authored Apr 04, 2017

e3be1964

Apr 03, 2017
- [ADQL] Support J2000 as STC-S' frame (in addition of ICRS, FK4, FK5, ...). · bae21b07
  gmantele authored Apr 03, 2017
  
  bae21b07
- [ADQL] Re-Fix GROUP BY's columns handling: · 7a70c603
  gmantele authored Apr 03, 2017
```
a qualified column name should be allowed, but still no column index should be.
```
  7a70c603
Mar 15, 2017
- removed non ASCII character · 56d6d774
  vforchi authored Mar 15, 2017
  
  56d6d774
Mar 10, 2017
- [ADQL] Fix handling of delimited column references (e.g. items of ORDER BY). · 118f357a
  gmantele authored Mar 10, 2017
```
This error has been raised on the issue #32 by Zarquan.
```
  118f357a
Mar 02, 2017
- [ADQL] Fix Error when a query ends by a comment with no ending new line. · 9bc530cd
  gmantele authored Mar 02, 2017
  
  9bc530cd
Feb 22, 2017

[ADQL] Follow-up to the previous commit on CENTROID: · 3306decd

gmantele authored Feb 22, 2017

the automatic datatype detection was missing for CENTROID functions.
--
Additionally, some JUnit test files of the `adql` package has been moved to
the correct location.

3306decd

Feb 20, 2017
- [ADQL] Fix CentroidFunction's type and its PgSphere translation. · 61063906
  gmantele authored Feb 20, 2017
  
  61063906
Sep 20, 2016

[ADQL] Fix the tree generated by the parsing of NATURAL JOINs. · 7ca49f81

gmantele authored Sep 20, 2016

The "normal" JOIN:
    A JOIN B ON A.id = B.id JOIN C ON B.id = C.id
is correctly interpreted as:
    ( (A JOIN B ON A.id = B.id) JOIN C ON B.id = C.id )
But with a NATURAL JOIN, the tree is mirrored:
    A NATURAL JOIN B NATURAL JOIN C
gives:
	( A NATURAL JOIN (B NATURAL JOIN C) )
instead of:
    ( (A NATURAL JOIN B) NATURAL JOIN C )
This is not a problem when the SQL translation is identical to the ADQL
expression, but for some DBMS a conversion into a INNER JOIN ON is necessary
and in this case we got the following SQL:
    A JOIN B JOIN C ON A.id = B.id ON B.id = C.id
Which seems to work, but is syntactically strange.

This commit should fix the generated tree. A "normal" JOIN and a NATURAL JOIN
should now have the same form. A JUnit test has been added into TestADQLParser
to check that: testJoinTree().

7ca49f81

Jul 13, 2016

[ADQL] Fix a sub-query bug. No DBTable was set on an ADQLTable having a · 04a00fe2

gmantele authored Jul 13, 2016

sub-query. Without this information, it was impossible to resolve columns makingreference to sub-queries of the clause FROM. See the JUnit test case for a
concrete example.
(error raised by Hendrik Heinl - ARI/GAVO)

04a00fe2

[ALL] Restore some sleeping JUnit tests + Allow a reset of custom types · 2c79eb6f

gmantele authored Jul 13, 2016

in DBType.DBDatatype (for UNKNOWN and UNKNOWN_NUMERIC). This reset is performed
after each JUnit setting a special custom value (otherwise it prevents other
JUnit to run correctly)

2c79eb6f

May 25, 2016

[ADQL] Fix recursive replacement using SimpleReplaceHandler. · ad2acca3

gmantele authored May 25, 2016

Before correction, if an ADQlObject (e.g. a function or a sub-query) contains
another ADQLObject and that both (i.e. parent and child) are matching in a
SimpleReplaceHandler and are asked to be replaced, only the parent
seemed to have been replaced. However, the child has been replaced, but
in the former instance of the parent ; and so its replacement is not
visible in the final query.

For instance:
if all mathematical functions must be replaced by a dumb UDF named 'foo' in
the ADQL query:
        "SELECT sqrt(abs(81)) FROM myTable"
,the result should be:
        "SELECT foo(foo(81)) FROM myTable"
,but before this correction it was:
        "SELECT foo(abs(81)) FROM myTable".

ad2acca3

Apr 20, 2016
- [ADQL] Adapt the JUnit test case for ADQLParser according to the last commit. · 9a0f1022
  gmantele authored Apr 20, 2016
  
  9a0f1022
Mar 17, 2016

[ADQL] Add an ADQL translator for MS SQL Server. This particular translator · a2fb29ad

gmantele authored Mar 17, 2016

deals with NATURAL JOINs and JOINs using the keyword USING so that being
supported by SQL Server. Basically, they are translated as a list of ON
conditions.
Warning: This translator is just guaranteed to solve the NATURAL and USING
issue. Support for datatypes conversion and case sensitivity has to be
reviewed. Besides no geometrical function is translated for SQL Server.

a2fb29ad

Mar 04, 2016

[ADQL] Set a type to a query's resulting column when it is not originally a column. · 0003e343

gmantele authored Mar 04, 2016

This is easily possible for concatenations, string constants and User Defined
Functions having a FunctionDef. A new special datatype was needed for
numeric functions and operations: UNKNOWN_NUMERIC. This special type
can not be set with FunctionDef.parse(...) and it behaves exactly like the type
UNKNOWN, except that DBType.isNumeric() returns true (as .isUnknown()).
Thus, while writing the metadata of a result in TAP, nothing changes:
an UNKNOWN_NUMERIC type will be processed similarly as an UNKNOWN type:
to use the type returned from the database ResultSet or to set VARCHAR.
(no modification of TAP was needed for that)

0003e343