Auto-formatting tweaks #47299

pugnascotia · 2019-09-30T14:51:41Z

Follow-up to #46745. Tweak the Eclipse formatter config. It's hard to see what different is from the XML, so here's a summary:

Split method, constructor etc arguments into separate lines when they wrap
Enable "joined wrapped lines", which basically means the formatter will ignore existing line breaks and join and re-wrap statements. I had originally hesitated about enabling this, but without it the codebase's style won't be as consistent as it should be, defeating the point of all this formatting business.

Also tweak some source files so that they aren't adversely affected when we finally format the source.

Convert some ASCII art diagrams that were in line quotes into to block quotes, which the formatter is configured to ignore.
Apply some formatter off/on comments to shield some blocks. There are just a few places I happened to notice.

elasticmachine · 2019-09-30T14:51:43Z

Pinging @elastic/es-core-infra

rjernst

I left some comments

rjernst · 2019-09-30T22:53:15Z

libs/ssl-config/src/main/java/org/elasticsearch/common/ssl/SslConfigurationLoader.java

@@ -68,6 +68,7 @@

    static final List<String> DEFAULT_PROTOCOLS = List.of("TLSv1.3", "TLSv1.2", "TLSv1.1");

+    // @formatter:off


This seems like an important tool we should document in our CONTRIBUTING.md? (separate from this PR)

I can add it to CONTRIBUTING.md, I should have adding it in the last PR to be honest.

rjernst · 2019-09-30T22:54:21Z

server/src/main/java/org/apache/lucene/queries/BinaryDocValuesRangeQuery.java

-                // part of the other range must touch this range
-                // this:    |---------------|
-                // other:               |------|
+                /*


Why is this converted to a multiline comment? Is there some problem with using several single line comments with the formatter?

I observed the formatted mangling these diagrams, hence the change. You can ask the formatter to process block and javadoc comments, but I decided that wasn't worth it.

rjernst · 2019-09-30T22:58:53Z

server/src/test/java/org/elasticsearch/search/suggest/SuggestionTests.java

-                                + "\"collate_match\":true}]"
-                            + "}]"
-                    + "}", xContent.utf8ToString());
+                // @formatter:off


We have lines like this in a lot of tests. Is there anything we can do to improve this without resorting to manually turning off the formatter?

There are options, depending on the code in question.

JSON snippets

For this example, we could include whitespace:

String suggestionString = "{" + "\"unknownType#suggestionName\": [" + " {" + " \"text\": \"entryText\"," + " \"offset\": 42," + " \"length\": 313," + " \"options\": [{" + " \"text\": \"someText\"," + " \"highlighted\": \"somethingHighlighted\"," + " \"score\": 1.3," + " \"collate_match\": true" + " }]" + " }]" + "}";

With Java 13, we could use text blocks:

String suggestionString = """ { "unknownType#suggestionName": [ { "text": "entryText", "offset": 42, "length": 313, "options": [{ "text": "someText", "highlighted": "somethingHighlighted", "score": 1.3, "collate_match": true }] }] } """;

We could also make these snippets more readable by using an ObjectMapper with some lenient settings, e.g:

public static String asJson(final String rawJson) throws IOException { ObjectMapper mapper = new ObjectMapper(); mapper.configure(JsonParser.Feature.ALLOW_UNQUOTED_FIELD_NAMES, true); mapper.configure(JsonParser.Feature.ALLOW_SINGLE_QUOTES, true); final JsonNode jsonNode = mapper.readTree(rawJson); return mapper.writeValueAsString(jsonNode); }

Then we can write the JSON like so:

String suggestionString = asJson("{" + "'unknownType#suggestionName': [" + " {" + " text: 'entryText'," + " offset: 42," + " length: 313," + " options: [{" + " text: 'someText'," + " highlighted: 'somethingHighlighted'," + " score: 1.3," + " collate_match: true" + " }]" + " }]" + "}");

Depend how wild we're feeling, we could define these snippets in YAML, and convert to JSON, e.g.:

String suggestionYaml = yamlToJson( "unknownType#suggestionName:\n" + " - text: entryText\n" + " offset: 42\n" + " length: 313\n" + " options:\n" + " - text: someText\n" + " highlighted: somethingHighlighted\n" + " score: 1.3\n" + " collate_match: true\n");

Though I expect that'll look better with text blocks, once they're available:

String suggestionYaml = yamlToJson(""" unknownType#suggestionName: - text: entryText offset: 42 length: 313 options: - text: someText highlighted: somethingHighlighted score: 1.3 collate_match: true """);

Aligned code

The changes to SslConfigurationLoader.java are a case where strings and comments are arranged in meaningful rows. I think this is a case where just exempting the code is the best approach.

On that subject, however, the Painless module is interesting because it has code that has vertical alignment. For example:

DefMath.java

MethodWriter.java

Operation.java

WriterConstants.java

I can't decide whether we should turn off the formatter for these. What do you think?

Also, Spotless doesn't appear to have a way to exempt files from formatting, so it formats all the source that ANTLR generates too. Can't do much about that.

@rjernst do you want me to do anything here? I could go looking for more code whose format we want to preserve?

Long term I think we will want to use text blocks, but it will be a long time before we can do that. I wouldn't change to yaml, since we are testing json. Ideally these would actually be converted to xcontentbuilder, since that has earlier error checking (eg invalid json would get caught when running the code, rather than in the request parsing the json). My concern here is starting this precedent of arbitrarily turning off the formatter. Using column aligned strings seems like the best option for now, but we should probably change those in separate PRs from any enabling auto formatting, so that the formatting PRs can remain completely automatic and mechanical in nature, thus simplifying reviewing.

For cases where the formatter was disabled around JSON snippets, embed more whitespace within the strings to as to format the JSON in a way that the formatter won't break. Also, change the formatter's enable/disables tags to that the documentation comments i.e "// tag::NAME" and "// end::NAME" still work. This means that such snippets won't be automatically formatted, but since the formatter can't be configured different for specific regions in a file, it's the best we can do.

Make a number of code changes so that when the codebase is automatically formatted, the code doens't end up looking mangled. Mostly this is: - Moving comments - Joining multiline strings that fit on one line - Formatting JSON so that indentation is inside the strings - This includes stripping out additional whitespace in some cases, where the test does an exact comparison with JSON from the API Also reformat XML documents in the SAML tests, using a formatter that can handle named parameters. This significantly improves legibility.

pugnascotia · 2019-10-23T08:34:41Z

@rjernst I've reworked the code where I'd disabled the formatter, and then carried on to audit the repository post-formatting. I've made a number of further changes so that some code doesn't appear mangled after formatting. Mostly this is:

Moving comments around
Joining multiline strings that fit on one line (I didn't do all these, there were a lot)
Formatting JSON so that indentation is inside the strings
- This includes stripping out additional whitespace in some cases,
  where the test does an exact comparison with JSON from the API. I added a method to XContentHelper for this.

I also reformatted the XML documents in the SAML tests, using a formatter that
can handle named parameters. This significantly improved legibility.

There are still some other files that could use some love, for example the geo parser tests. They're building up documents via XContent, and since the formatter sees these as a long sequence of method calls, that exactly how they're formatted. I think we're better off just using JSON here - it'd be significantly terser, and would retain indentation.

pugnascotia · 2019-10-23T08:35:06Z

@elasticmachine test this please

rjernst · 2019-10-23T21:31:07Z

Thanks for working through the formatter disabling issues.

Something I've realized upon looking back at this is the enormous size in conjunction with several changes being made to the format. Would it be possible to separate out each of the listed tweaks in the description to their own PRs? If a PR is 1k lines, it is generally difficult to adequately review, and this PR is almost 4k edits. I understand these tweaks will necessarily touch a lot of code and be large, but it is much easier to scroll through a large PR when there is a singular purpose, and thus one typical pattern to visually look for.

pugnascotia · 2019-10-24T09:11:28Z

@rjernst sure, the PR wasn't really supposed to be this size. I'll split up the changes and list them here for posterity.

Refine the auto-formatting settings (Refine the auto-formatting settings #48446)
Improve resiliency to auto-formatting in client ([7.x] Add populate_user_metadata in OIDC realm (#48357) #48438)
Improve resiliency to auto-formatting in libs, modules (Improve resiliency to auto-formatting in libs, modules #48448)
Improve resiliency to auto-formatting in server (Improve resiliency to auto-formatting in server #48450)
Improve resiliency to auto-formatting in x-pack monitoring (pending)
Improve resiliency to auto-formatting in x-pack (pending)
Improve SAML tests resiliency to auto-formatting (Improve SAML tests resiliency to auto-formatting #48452)

pugnascotia added 3 commits September 30, 2019 10:08

Change comment style to preserve ASCII art

7bf64a2

Proof some code against the formatter

b50783b

Refine code formatter settings

568e48d

pugnascotia added :Core/Infra/Core Core issues without another label v8.0.0 labels Sep 30, 2019

rjernst reviewed Sep 30, 2019

View reviewed changes

pugnascotia added 6 commits October 1, 2019 14:41

Note how to disable formatting in CONTRIBUTING.md

c34c973

Fix missing backtick in CONTRIBUTING.md

ac88244

Merge remote-tracking branch 'upstream/master' into formatting-tweaks

5d804b3

Merge remote-tracking branch 'upstream/master' into formatting-tweaks

1ae31b9

pugnascotia added 8 commits October 23, 2019 12:22

Test fix

a4c1ffd

Put NamedFormatter in a better location

97a28d6

Fix forbidden APIs

c05a87e

Checkstyle exception

5c32bca

Linting

d37d4e8

Test fixes

03ce841

More tests and checkstyle

faa2573

Lint

2e02711

pugnascotia closed this Oct 24, 2019

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-formatting tweaks #47299

Auto-formatting tweaks #47299

pugnascotia commented Sep 30, 2019

elasticmachine commented Sep 30, 2019

rjernst left a comment

rjernst Sep 30, 2019

pugnascotia Oct 1, 2019

rjernst Sep 30, 2019

pugnascotia Oct 1, 2019

rjernst Sep 30, 2019

pugnascotia Oct 1, 2019

pugnascotia Oct 8, 2019

rjernst Oct 10, 2019 •

edited

Loading

pugnascotia commented Oct 23, 2019

pugnascotia commented Oct 23, 2019

rjernst commented Oct 23, 2019

pugnascotia commented Oct 24, 2019 •

edited

Loading

		@@ -68,6 +68,7 @@

		static final List<String> DEFAULT_PROTOCOLS = List.of("TLSv1.3", "TLSv1.2", "TLSv1.1");

		// @formatter:off

Auto-formatting tweaks #47299

Auto-formatting tweaks #47299

Conversation

pugnascotia commented Sep 30, 2019

elasticmachine commented Sep 30, 2019

rjernst left a comment

Choose a reason for hiding this comment

rjernst Sep 30, 2019

Choose a reason for hiding this comment

pugnascotia Oct 1, 2019

Choose a reason for hiding this comment

rjernst Sep 30, 2019

Choose a reason for hiding this comment

pugnascotia Oct 1, 2019

Choose a reason for hiding this comment

rjernst Sep 30, 2019

Choose a reason for hiding this comment

pugnascotia Oct 1, 2019

Choose a reason for hiding this comment

JSON snippets

Aligned code

pugnascotia Oct 8, 2019

Choose a reason for hiding this comment

rjernst Oct 10, 2019 • edited Loading

Choose a reason for hiding this comment

pugnascotia commented Oct 23, 2019

pugnascotia commented Oct 23, 2019

rjernst commented Oct 23, 2019

pugnascotia commented Oct 24, 2019 • edited Loading

rjernst Oct 10, 2019 •

edited

Loading

pugnascotia commented Oct 24, 2019 •

edited

Loading