Implement SQL writer #18

andygrove · 2018-09-08T12:59:45Z

It is often useful to be able to generate SQL from the AST. This opens up possibilities for SQL manipulation or translation. The write should recursively walk the AST and write SQL to a writer / output stream.

A secondary benefit of having a writer is that it often simplifies writing unit tests for the parser, since a SQL statement can be parsed and re-written and then the output can be compared with the input (ignoring whitespace differences)

andygrove · 2018-10-06T15:44:32Z

This is partially implemented already by impl ToString for ASTNode but this needs to move into the dialect specific code as each dialect will write SQL differently.

mitar · 2019-11-21T23:41:09Z

I second that a lot! I would love to have a way to translate SQL queries.

ivanceras · 2020-01-01T03:34:24Z

I'm thinking of initial design on how to implement a sql writer for each specific dialect.
I believe the easy way to do it, is to make a trait for each of this specific dialect which also leverage
a call to the ToString to the AST when it is not a dialect specific syntax. Also, we can add a little functionality to extract out all the values into a Vec<Value> to be later used in parameterized query in order to avoid SQL injection attacks.

Here is a POC code I'm testing.

pub trait ToPostgreSQL: ToString {
    fn to_postgresql(&self, args: &mut usize) -> (String, Vec<Value>) {
        (self.to_string(), vec![])
    }
}


impl ToPostgreSQL for Value {
    fn to_postgresql(&self, args: &mut usize) -> (String, Vec<Value>) {
        let ret = (format!("${}", args), vec![self.clone()]);
        *args += 1;
        ret
    }
}

impl ToPostgreSQL for DataType {
    fn to_postgresql(&self, args: &mut usize) -> (String, Vec<Value>) {
        match self {
            DataType::Timestamp => ("timestamp with time zone".into(), vec![]),
            _ => (self.to_string(), vec![]),
        }
    }
}

jmzagorski · 2020-01-25T21:05:55Z

I think this would be great if i understand the goal. I inherited a mess of sql statements for a warehouse and my goal is to create a program to format all queries to one standard, using your lib as the parser. Right now I find myself writing a lot of the same sql as you did in the fmt::Display implementations so I can format a few expression/identifiers based on configurations. Most of my other formatting is at the clause level

Dandandan · 2020-06-03T18:00:32Z

I would like to work on this, as I am working on a tool that needs to export queries to different.

It would be an awesome feature actually to write queries in one SQL dialect and to also convert it (as far as possible of course) to another dialect.

I think the suggestion above makes sense to have a to_postgresql etc. For each dialect. I am not sure about the need to extract Vec<Value> ?

I am also not sure whether it also needs to do (any) formatting. I think that can be out of scope and maybe more for a external library like https://github.com/maxcountryman/forma/blob/master/formation/benches/formation_bench.rs

nickolay · 2020-06-03T21:09:49Z

Frankly, I don't understand the to_postgresql suggestion.

FWIW, my instinct is to think about translations between SQL dialects as of AST transformations, rather than different serializations, but It's hard to think about this without looking at a corpus of translations that need to be supported.

Pretty printing an AST like ours is definitely out of scope, but for tasks requiring minor fixups to the SQL, one could want to retain original formatting and comments as much as possible (cf. #175).

AugustoFKL · 2022-11-03T20:19:55Z

@alamb @nickolay @andygrove do you still think this is a reasonable issue?

Although seems quite useful, possibly would demand a lot of extra maintaining structure, which I don't think the project has right now.

alamb · 2022-11-03T20:56:06Z

There is basically a SQL writer now that converts from an AST to SQL. I didn't read this entire ticket so I am not sure what else is being proposed. Perhaps we can close it as "done" 🤔

AugustoFKL · 2022-11-03T21:11:53Z

@alamb this considers dialect specific implementation as well.

Not sure if that's really feasible. This approach (dialect specific serialisation) was already discussed in another moments and, as always, we agree that that's too much additional code, to such small gains.

aljazerzen · 2022-11-30T12:04:11Z

We use the SQL writer for PRQL and it does work well, but not for dialects as you pointed out. So we have code on our side that checks how idents should be quoted and whether TOP or LIMIT should be used.

So I consider sqlparser-rs to have an SQL writer, just not SQL dialect translator. So this issue could be closed.

All that said, I came here looking for a fix on VALUES in MySQL which requires ROW. But this is an issue targeting both parsing and writing.

…he#18) A customer needs these fixes now, so need to merge these directly into our fork. When the upstream PRs land we'll revert this and incorporate that version in the next release merge.

andygrove added this to the Milestone 1 milestone Oct 6, 2018

mitar mentioned this issue Nov 21, 2019

Support case sensitive parsing #144

Closed

nickolay mentioned this issue Jun 7, 2020

Lossless syntax tree prototype and discussion #189

Closed

This was referenced Dec 3, 2021

Add Span to Tokens, AST nodes #161

Closed

Data model or table summarizing equivalent key - SQL statement pairs #371

Open

serprex pushed a commit to serprex/sqlparser-rs that referenced this issue Nov 6, 2023

Schema mapping support (apache#18)

c8b469d

devinjdangelo mentioned this issue Feb 29, 2024

feat: add support for producing SQL in multiple dialects datafusion-contrib/datafusion-federation#22

Merged

tv42 mentioned this issue Nov 13, 2024

Feature Request: "Unparsing" #674

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement SQL writer #18

Implement SQL writer #18

andygrove commented Sep 8, 2018 •

edited

Loading

andygrove commented Oct 6, 2018

mitar commented Nov 21, 2019

ivanceras commented Jan 1, 2020

jmzagorski commented Jan 25, 2020

Dandandan commented Jun 3, 2020

nickolay commented Jun 3, 2020

AugustoFKL commented Nov 3, 2022

alamb commented Nov 3, 2022

AugustoFKL commented Nov 3, 2022

aljazerzen commented Nov 30, 2022

Implement SQL writer #18

Implement SQL writer #18

Comments

andygrove commented Sep 8, 2018 • edited Loading

andygrove commented Oct 6, 2018

mitar commented Nov 21, 2019

ivanceras commented Jan 1, 2020

jmzagorski commented Jan 25, 2020

Dandandan commented Jun 3, 2020

nickolay commented Jun 3, 2020

AugustoFKL commented Nov 3, 2022

alamb commented Nov 3, 2022

AugustoFKL commented Nov 3, 2022

aljazerzen commented Nov 30, 2022

andygrove commented Sep 8, 2018 •

edited

Loading