📎 Prettier Compatibility Metric #2555

MichaReiser · 2022-05-07T10:35:55Z

Description

Rome's goal is that our formatting matches Prettier's formatting closely. However, it's currently difficult to know if a PR is improving the compatibility or is making things worse.

Goal

Define a Prettier compatibility metric and provide means to compute the metric using the current Rome version.

Proposal

Percentage of lines that match Prettier's formatting, similar to git's similarity index

compatibility_per_file = matching_lines / MAX(lines_file_1, lines_file_2)
compatibility = SUM(matching_lines) / SUM(MAX(lines_file1, lines_file2))

I'm not very proficient at math and the metric might be flawed. Please feel free to propose other metrics.

Code Pointers

Our test runner already has an option to generate a Report by setting the REPORT_PRETTIER env variable.

tools/crates/rome_js_formatter/tests/prettier_tests.rs

Lines 270 to 297 in db61c4b

    
           fn print(&self) { 
        
               // Only create the report file if the REPORT_PRETTIER 
        
               // environment variable is set to 1 
        
               match env::var("REPORT_PRETTIER") { 
        
                   Ok(value) if value == "1" => {} 
        
                   _ => return, 
        
               } 
        
               let mut report = String::new(); 
        
               let mut state = self.state.lock(); 
        
               state.sort_by_key(|(name, ..)| *name); 
        
               for (file_name, rome, prettier) in state.iter() { 
        
                   writeln!(report, "# {}", file_name).unwrap(); 
        
                   writeln!(report, "```diff").unwrap(); 
        
                   for (tag, line) in diff_lines(Algorithm::default(), prettier, rome) { 
        
                       let line = line.strip_suffix('\n').unwrap_or(line); 
        
                       writeln!(report, "{}{}", tag, line).unwrap(); 
        
                   } 
        
                   writeln!(report, "```").unwrap(); 
        
               } 
        
               write("report.md", report).unwrap(); 
        
           }

It should be straightforward to

Compute the metric for every file and include it in the report
Compute the metric across all files and print it at the top of the report.

Stretch

Create a CI Job similar to the parser conformance that computes the overall metric on the PR branch and on main and comments the two numbers together with the difference PR - main (percentage the PR improved the compatibility)
Include a link to the full report (should be possible to commit the report as a gist) in the comment
Ultimate solution: Compare the metric for every file and report the count of files for which the metric: a) didn't change, b) increased, c): decreased. List the names of the files for which the metric increased/decreased

The text was updated successfully, but these errors were encountered:

IWANABETHATGUY · 2022-05-10T10:39:57Z

I would like to have a try.

MichaReiser · 2022-05-10T11:30:06Z

I would like to have a try.

Awesome. I assigned you the issue. Ping me if something is unclear or if you need some pointers. I recommend approaching this problem step by step (PR by PR) and built out the tools first and test them as CLI before approaching the CI.

IWANABETHATGUY · 2022-05-11T05:56:50Z

I propose another formula to calculate the prettier compatibility,

compatibility_per_file = matching_lines / MAX(lines_file_1, lines_file_2)
compatibility = Sum(compatibility_per_file) / number_of_files

There is no conclusion to say which one is better in any case, it dependends.
for example:
Assume we have three files A, B, C

match_lines:
A: 10000
B: 50
C: 60

MAX(lines_file_1, lines_file_2)
A: 10010
B: 100
C: 100

We use the same formula to calculate compatibility_per_file, so we got:

compatibility_a:  0.999
compatibility_b:  0.5
compatibility_a:  0.6

the result of the first compatibility formula:
compatibility: 0.9902

the result of my compatibility formula:
compatibility: 0.69

if someone resolves all the compatible issues of file b,
compatibility_per_file:

compatibility_a:  0.999
compatibility_b:  1
compatibility_a:  0.6

the result of the first compatibility formula:
compatibility: 0.995

the result of my compatibility formula:
compatibility: 0.863

the compatibility diff of the first formula:
0.990 -> 0.995
the compatibility diff of the second formula:
0.69 -> 0.863

I prefer to call the formula that @MichaReiser is line based and the second formula is file based.

MichaReiser · 2022-05-11T06:35:12Z

My understanding is that you're proposing an alternative metric for the overall compatibility but keep the same metric for a single file.

The file based metric calculates the average of the per file compatibilities, whereas the "line based" metric tries to measure how many lines in total are similar. That's why I would call these Similarity (line based) and Avg similarity (file based).

In my view, both of these provide valuable signal and I would recommend implementing both to see which one works better to track our work. What do you think?

IWANABETHATGUY · 2022-05-11T06:36:48Z

agree

IWANABETHATGUY · 2022-05-12T09:51:49Z

File Based Average Prettier Similarity:

compatibility_per_file = matching_lines / MAX(lines_file_1, lines_file_2)
file_based_average_prettier_similarity = Sum(compatibility_per_file) / number_of_files

Line Based Average Prettier Similarity

compatibility_per_file = matching_lines / MAX(lines_file_1, lines_file_2)
line_based_average_prettier_similarity = SUM(matching_lines) / SUM(MAX(lines_file1, lines_file2))

ematipico · 2022-05-23T08:09:51Z

I saw this PR was merged: #2574

What's missing now?

MichaReiser · 2022-05-23T08:12:46Z

The metric is merged but what would be nice to have is a CI job that comments with the current metric and compares it with main (ideally, per file).

IWANABETHATGUY · 2022-05-23T08:13:04Z

I am still working on CI

NicholasLYang · 2022-05-26T22:50:54Z

I have some concerns about a numerical metric. Testing out Rome on some small projects, I've noticed that there are large diffs that occur from very small changes like trailing commas. On the flip side, there are some changes that are small in line diffs, but produce formatted output that, at least in my view, is harder to read and less appealing visually.

I don't want to discourage a numerical metric, but I do think we should take more stuff into consideration when thinking about compatibility.

MichaReiser · 2022-05-27T07:04:29Z

I have some concerns about a numerical metric. Testing out Rome on some small projects, I've noticed that there are large diffs that occur from very small changes like trailing commas. On the flip side, there are some changes that are small in line diffs, but produce formatted output that, at least in my view, is harder to read and less appealing visually.

I don't want to discourage a numerical metric, but I do think we should take more stuff into consideration when thinking about compatibility.

This metric metric is a tool that helps us approximate the prettier compatibility. It isn't an exact representation. Nevertheless, it helps us to measure if we are moving in the right direction and have a rough understanding on how close we are. However, it doesn't mean that our ultimate goal is to reach 100% and that we should optimize for it at any cost. That would be a misuse of the metric.

Regarding trailing comma. We should make sure that we compare apples with apples, meaning, we should apply the same formatting options.

github-actions · 2022-09-27T12:04:39Z

This issue is stale because it has been open 14 days with no activity.

MichaReiser added the task A task, an action that needs to be performed label May 7, 2022

MichaReiser changed the title ~~📎 Comment Prettier Compatibility on PRs~~ 📎 Prettier Compatibility Metric May 7, 2022

MichaReiser added A-Formatter Area: formatter S-Wishlist Possible interesting features not on the current roadmap good first issue Good for newcomers labels May 7, 2022

yassere added this to Rome 2022 May 8, 2022

MichaReiser assigned IWANABETHATGUY May 10, 2022

IWANABETHATGUY mentioned this issue May 11, 2022

feat(rome_js_formatter): 💡 Prettier Compatibility Metric #2574

Merged

MichaReiser moved this to In Progress in Rome 2022 May 13, 2022

IWANABETHATGUY mentioned this issue May 31, 2022

test(rome_js_formatter): support report prettier metric as a json file #2626

Merged

lgarron mentioned this issue Sep 10, 2022

Switch formatting & linting to Rome cubing/cubing.js#217

Closed

github-actions bot added the S-Stale label Sep 27, 2022

MichaReiser closed this as completed Sep 30, 2022

Repository owner moved this from In Progress to Done in Rome 2022 Sep 30, 2022

github-actions bot mentioned this issue Nov 15, 2023

Prettier compatibility nissy-dev/biome#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📎 Prettier Compatibility Metric #2555

📎 Prettier Compatibility Metric #2555

MichaReiser commented May 7, 2022 •

edited

Loading

IWANABETHATGUY commented May 10, 2022

MichaReiser commented May 10, 2022

IWANABETHATGUY commented May 11, 2022

MichaReiser commented May 11, 2022

IWANABETHATGUY commented May 11, 2022

IWANABETHATGUY commented May 12, 2022

ematipico commented May 23, 2022

MichaReiser commented May 23, 2022

IWANABETHATGUY commented May 23, 2022

NicholasLYang commented May 26, 2022

MichaReiser commented May 27, 2022

github-actions bot commented Sep 27, 2022

📎 Prettier Compatibility Metric #2555

📎 Prettier Compatibility Metric #2555

Comments

MichaReiser commented May 7, 2022 • edited Loading

Description

Goal

Proposal

Code Pointers

Stretch

IWANABETHATGUY commented May 10, 2022

MichaReiser commented May 10, 2022

IWANABETHATGUY commented May 11, 2022

MichaReiser commented May 11, 2022

IWANABETHATGUY commented May 11, 2022

IWANABETHATGUY commented May 12, 2022

ematipico commented May 23, 2022

MichaReiser commented May 23, 2022

IWANABETHATGUY commented May 23, 2022

NicholasLYang commented May 26, 2022

MichaReiser commented May 27, 2022

github-actions bot commented Sep 27, 2022

MichaReiser commented May 7, 2022 •

edited

Loading