Different formatting of block line comments with openjdk 23+37-2369 #1153

dweiss · 2024-08-29T18:41:20Z

Applies to: 1.23.0

I came across this oddity - this file from Apache Lucene:
SandboxFacetsExample.txt

doesn't need any changes with jdk17-jdk22:

> java -version
openjdk version "22.0.1" 2024-04-16
OpenJDK Runtime Environment Temurin-22.0.1+8 (build 22.0.1+8)
OpenJDK 64-Bit Server VM Temurin-22.0.1+8 (build 22.0.1+8, mixed mode, sharing)
> java -jar google-java-format-1.23.0-all-deps.jar -n SandboxFacetsExample.java

but will result in reformatting under jdk 23 (ea, 23+37-2369):

> java -version
openjdk version "23" 2024-09-17
OpenJDK Runtime Environment (build 23+37-2369)
OpenJDK 64-Bit Server VM (build 23+37-2369, mixed mode, sharing)

>java -jar google-java-format-1.23.0-all-deps.jar -n SandboxFacetsExample.java
SandboxFacetsExample.java

Fully reproducible on Windows and Linux. The diff is:

--- a/SandboxFacetsExample.java
+++ b/SandboxFacetsExample.java_
@@ -149,7 +149,8 @@ public class SandboxFacetsExample {
         new FacetFieldCollectorManager<>(defaultTaxoCutter, defaultRecorder);

     //// (2.1) if we need to collect data using multiple different collectors, e.g. taxonomy and
-    ////       ranges, or even two taxonomy facets that use different Category List Field, we can
+    ////       ranges, or even two taxonomy facets that use different Category List Field, we
+    // can
     ////       use MultiCollectorManager, e.g.:
     // TODO: add a demo for it.
     // TaxonomyFacetsCutter publishDateCutter = new

The text was updated successfully, but these errors were encountered:

dweiss · 2024-08-29T19:16:00Z

I toyed a bit with debugging this and there's a difference in how these line comments appear to JavaCommentsHelper.wrapLineComments. With JDK21, it receives that comment line-by-line. With JDK23, it receives a concatenation of all lines, with lines 2 and on having a whitespace prefix:

java21:

////       ranges, or even two taxonomy facets that use different Category List Field, we can

java23:

//// (2.1) if we need to collect data using multiple different collectors, e.g. taxonomy and
    ////       ranges, or even two taxonomy facets that use different Category List Field, we can
    ////       use MultiCollectorManager, e.g.:

this triggers the difference because the split condition now sees different line length for those subsequent lines.

while (line.length() + column0 > Formatter.MAX_LINE_LENGTH) {
...

Hope this helps somehow.

dweiss · 2024-08-29T19:21:00Z

For what it's worth, core tests pass when I do this:

diff --git a/core/src/main/java/com/google/googlejavaformat/java/JavaCommentsHelper.java b/core/src/main/java/com/google/googlejavaformat/java/JavaCommentsHelper.java
index d34ecc4..d54b231 100644
--- a/core/src/main/java/com/google/googlejavaformat/java/JavaCommentsHelper.java
+++ b/core/src/main/java/com/google/googlejavaformat/java/JavaCommentsHelper.java
@@ -49,7 +49,11 @@ public final class JavaCommentsHelper implements CommentsHelper {
     List<String> lines = new ArrayList<>();
     Iterator<String> it = Newlines.lineIterator(text);
     while (it.hasNext()) {
-      lines.add(CharMatcher.whitespace().trimTrailingFrom(it.next()));
+      if (tok.isSlashSlashComment()) {
+        lines.add(CharMatcher.whitespace().trimFrom(it.next()));
+      } else {
+        lines.add(CharMatcher.whitespace().trimTrailingFrom(it.next()));
+      }
     }
     if (tok.isSlashSlashComment()) {
       return indentLineComments(lines, column0);

but I'm not sure whether this is the right fix. Perhaps it'd be good to find out why the token text is different with jdk 23 (as it's the primary cause of the problem).

cushon · 2024-09-10T15:20:10Z

Thanks for the bug and the investigation! Presumably the difference in JDK 23 is due to the new support for markdown doc comments.

dweiss · 2024-09-10T19:10:44Z

I think you're right - that's a spot-on observation. Interestingly, this comment appears inline with the code, not as a javadoc of anything in particular [1]. Could be a regression in the comment parser worth reporting to openjdk.

[1] https://github.com/apache/lucene/blob/304d4e7855deb39b4650d954d027ce8697873056/lucene/demo/src/java/org/apache/lucene/demo/facet/SandboxFacetsExample.java#L151-L153

cushon · 2024-09-11T22:10:59Z

I think it's probably a deliberate change on the javac side, to be able to process the entire /// comment as a single token instead of multiple line comments. Your fix of removing the leading and trailing whitespace seems OK to me, the formatter will add any necessary leading whitespace back as part of indentLineComments, it seems reasonable to remove the leading whitespace to fix the line length computation.

Are you interested in sending a PR?

#1153 PiperOrigin-RevId: 673553329

#1153 PiperOrigin-RevId: 673574143

google#1153

dweiss · 2024-09-12T08:21:20Z

Thank you for adding the regression test. I've created a PR with the basic workaround I suggested, hope it helps.

Fixes #1153. Fixes #1161 FUTURE_COPYBARA_INTEGRATE_REVIEW=#1161 from dweiss:1153-block-line-comments-in-java23 e3ed83c PiperOrigin-RevId: 674301748

dweiss mentioned this issue Aug 29, 2024

Upgrade to gradle 8.10 apache/lucene#13698

Closed

dweiss changed the title ~~Different formatting with openjdk 23+37-2369~~ Different formatting of block line comments with openjdk 23+37-2369 Aug 29, 2024

dweiss added a commit to dweiss/lucene that referenced this issue Aug 29, 2024

Work around google/google-java-format#1153

25d1f44

copybara-service bot pushed a commit that referenced this issue Sep 11, 2024

Add a regression test for handling of /// comments

de08ce7

#1153 PiperOrigin-RevId: 673553329

copybara-service bot mentioned this issue Sep 11, 2024

Add a regression test for handling of /// comments #1160

Merged

copybara-service bot pushed a commit that referenced this issue Sep 11, 2024

Add a regression test for handling of /// comments

8a0e3b3

#1153 PiperOrigin-RevId: 673574143

dweiss added a commit to dweiss/google-java-format that referenced this issue Sep 12, 2024

Fix different formatting of block line comments with openjdk 23+37-2369

e3ed83c

google#1153

dweiss mentioned this issue Sep 12, 2024

Fix different formatting of block line comments with openjdk 23+ #1161

Closed

copybara-service bot mentioned this issue Sep 13, 2024

Fix different formatting of block line comments with openjdk 23+ #1162

Merged

copybara-service bot closed this as completed in 5e0d9e3 Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different formatting of block line comments with openjdk 23+37-2369 #1153

Different formatting of block line comments with openjdk 23+37-2369 #1153

dweiss commented Aug 29, 2024 •

edited

Loading

dweiss commented Aug 29, 2024

dweiss commented Aug 29, 2024

cushon commented Sep 10, 2024

dweiss commented Sep 10, 2024

cushon commented Sep 11, 2024

dweiss commented Sep 12, 2024

Different formatting of block line comments with openjdk 23+37-2369 #1153

Different formatting of block line comments with openjdk 23+37-2369 #1153

Comments

dweiss commented Aug 29, 2024 • edited Loading

dweiss commented Aug 29, 2024

dweiss commented Aug 29, 2024

cushon commented Sep 10, 2024

dweiss commented Sep 10, 2024

cushon commented Sep 11, 2024

dweiss commented Sep 12, 2024

dweiss commented Aug 29, 2024 •

edited

Loading