Highlight declaration differences in overloaded symbol groups #928

QuietMisdreavus · 2024-05-23T21:59:33Z

Bug/issue #, if applicable: rdar://116409531

Summary

This PR introduces a new field to declaration tokens in Render JSON: highlight. This field indicates that the token is distinct among a set of overloads and should be indicated with a highlight or other style. It also performs a difference comparison of overloaded symbols' declarations so that the fields are populated for Swift-DocC-Render.

When combined with the related Swift-DocC-Render PR (linked below), the rendered page appears like this:

Because the differencing works directly on the declaration fragments received from the symbol graph, this PR also takes some care to break up .text tokens to prevent false-positive differences and create a cleaner output. For example, the below overloads correctly only highlight the <S> in the second overload, even though it originally contains a text token of >(. Also visible in this example is some proactive whitespace-trimming from highlighted sections; there is a unique text token containing a single space before the where clause in the second overload, but it's not highlighted to reduce clutter:

Performance

This PR uses the standard library's CollectionDifference API to generate the Longest Common Subsequence of the overload declarations. The diff algorithm can have a heavy time complexity, especially when run repeatedly like here. The standard library's implementation is well-optimized, but there is still a necessary performance hit to calculate the diffs. This following is a benchmark run on a large framework with many overloaded symbols:

┌──────────────────────────────────────────────────────────────────────────────────────────────────────────┐
│ Metric                                   │ Change          │ main                 │ current              │
├──────────────────────────────────────────────────────────────────────────────────────────────────────────┤
│ Duration for 'bundle-registration'       │ +1.913%¹,²      │ 10.351 sec           │ 10.549 sec           │
│ Duration for 'convert-total-time'        │ +3.857%³,⁴      │ 15.477 sec           │ 16.074 sec           │
│ Duration for 'documentation-processing'  │ +7.959%⁵,⁶      │ 5.003 sec            │ 5.401 sec            │
│ Duration for 'finalize-navigation-index' │ no change⁷      │ 0.046 sec            │ 0.046 sec            │
│ Peak memory footprint                    │ no change⁸      │ 811 MB               │ 835.4 MB             │
│ Data subdirectory size                   │ +2.028%⁹        │ 127.4 MB             │ 130 MB               │
│ Index subdirectory size                  │ -0.02829%¹⁰     │ 1.1 MB               │ 1.1 MB               │
│ Total DocC archive size                  │ +1.092%¹¹       │ 236.6 MB             │ 239.2 MB             │
│ Topic Anchor Checksum                    │ no change       │ 53074b40863239b7a00d │ 53074b40863239b7a00d │
│ Topic Graph Checksum                     │ change          │ 5bdfe323b77baec3d5cd │ f4af046ddef9eab49ba8 │
└──────────────────────────────────────────────────────────────────────────────────────────────────────────┘

Dependencies

swiftlang/swift-docc-render#847 is required to render the tokens with a highlight.

Testing

Steps:

Build the Swift-DocC-Render PR and link the resulting dist directory into the DOCC_HTML_DIR environment variable.
`swift run docc preview --enable-experimental-overloaded-symbol-presentation 'Tests/SwiftDocCTests/Test Bundles/OverloadedSymbols.docc'
Navigate to the OverloadedEnum/firstTestMemberName(_:) overload group and expand the declaration list.
Ensure that the declarations correctly highlight the parameter type in each overloaded declaration.

Checklist

Make sure you check off the following items. If they cannot be completed, provide a reason.

Added tests
Ran the ./bin/test script and it succeeded
[ n/a ] Updated documentation if necessary

QuietMisdreavus · 2024-05-23T21:59:42Z

@swift-ci Please test

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

Sources/SwiftDocC/Model/Rendering/Symbol/DeclarationsRenderSection.swift

patshaughnessy

Great work! Just some nitpicks about readability. And also I agree the recursive token data structure seems very counterintuitive.

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

patshaughnessy · 2024-05-29T22:51:54Z

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

+                    // Pre-process the declarations by splitting text fragments apart to increase legibility
+                    let mainDeclaration = declaration.declarationFragments.flatMap(preProcessFragment(_:))
+                    let processedOverloadDeclarations = overloadDeclarations.map({ ($0.declaration.flatMap(preProcessFragment(_:)), $0.reference) })
+                    let preProcessedDeclarations = [mainDeclaration] + processedOverloadDeclarations.map(\.0)


Maybe use names like mainFragments, processedOverloadFragments and preProcessedFragments since (I think) you have mapped from DeclarationFragments to Fragments just above.

I think the distinction is minimal. Even SymbolKit doesn't really distinguish between the concept of a "declaration" and "a list of declaration fragments"; the convenience Symbol.declarationFragments accessor directly returns the list instead of the wrapper object. It also avoids having to pluralize a plural, since processedOverloadDeclarations and preProcessedDeclarations are lists of lists. To make them reference "fragments" instead, it would have to be something like processedOverloadFragmentLists or the like, which i consider more awkward.

patshaughnessy · 2024-05-29T22:53:55Z

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

+            /// Translate a whole ``SymbolGraph`` declaration to a ``DeclarationRenderSection``
+            /// declaration and highlight any tokens that aren't shared with a sequence of common tokens.
+            func translateDeclaration(
+                _ declaration: [SymbolGraph.Symbol.DeclarationFragments.Fragment],


Similar, this could be func translateDeclaration(_ fragments:, commonFragments:)

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

Sources/SwiftDocC/Model/Rendering/Symbol/DeclarationsRenderSection.swift

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

mportiz08 · 2024-05-30T01:05:50Z

I'm also concerned about the possibility of other renderers that consume DocC Render JSON that might not anticipate the concept of recursive tokens.

FWIW, other renderers would already have to deal with recursion for basic existing things like formatting inline text (you can have any combination of bold/emphasis/codeVoice or nested lists, etc) with our existing tree-like concept of block/inline content which carries over from the markdown. I wouldn't actually anticipate that there would be any other renderer that would handle experimental features under feature flags like this in any case though.

d-ronnqvist

I strongly feel that we shouldn't redefine tokens to be a hierarchical structure. Not only because it breaks existing code in DocC and in clients but because it's a surprising and unconventional way to model an already well defined concept.

d-ronnqvist · 2024-05-31T10:24:24Z

I'm also concerned about the possibility of other renderers that consume DocC Render JSON that might not anticipate the concept of recursive tokens.

FWIW, other renderers would already have to deal with recursion for basic existing things like formatting inline text (you can have any combination of bold/emphasis/codeVoice or nested lists, etc) with our existing tree-like concept of block/inline content which carries over from the markdown. I wouldn't actually anticipate that there would be any other renderer that would handle experimental features under feature flags like this in any case though.

That is different because "Text" in the RenderNode spec isn't recursive, it's a simple object with "text" and "type" properties (very similar to "DeclarationToken"):

"Text": {
    "type": "object",
    "required": [
        "type",
        "text"
    ],
    "properties": {
        "type": {
            "type": "string",
            "enum": ["text"]
        },
        "text": {
            "type": "string"
        }
    }
},

Just like "DeclarationToken", I would be opposed to making "Text" in the RenderNode spec recursive.

Instead, the way that we have modeled emphasis, strong, codeVoice, etc. in the RenderNode spec is by defining many dedicated types—some of which are recursive and some of which are not—and wrapping them all in a "RenderInlineContent" type:

"RenderInlineContent": {
    "oneOf": [
        {
            "$ref": "#/components/schemas/Text"
        },
        {
            "$ref": "#/components/schemas/Emphasis"
        },
        {
            "$ref": "#/components/schemas/Strong"
        },
        {
            "$ref": "#/components/schemas/CodeVoice"
        },
        ...
        }
    ]
}

We could introduce this type of structure for declarations as well—like what @vera suggested with enum RenderToken {} (which I feel would be better named "DeclarationInlineContent" for consistency and to distance it from the "token" concept)—but to use that structure we would need to make breaking changes to the various "tokens" properties in the RenderNode spec:

 "tokens": {
    "type": "array",
    "items": {
-       "$ref": "#/components/schemas/DeclarationToken"
+       "$ref": "#/components/schemas/DeclarationInlineContent"
    }
}

As far as I know, we don't have a process yet for how to handle breaking change to the RenderNode spec.

If the Swift code won't mirror the Render Node spec, I don't feel like this structure makes sense for the Swift code.

mportiz08 · 2024-06-02T20:02:21Z

I've created a draft renderer PR for both proposed Render JSON schemas that have been discussed in this PR:

new token kind: Highlight unique components of overloaded declarations swift-docc-render#841
new flag for all tokens: Highlight unique components of overloaded declarations [alternate approach] swift-docc-render#847

Please let me know which approach ends up being decided for this PR, and I'll close the corresponding renderer PR that isn't needed and remove draft status from the other.

QuietMisdreavus · 2024-06-04T22:43:08Z

@swift-ci Please test

QuietMisdreavus · 2024-06-05T16:44:02Z

@swift-ci Please test

QuietMisdreavus · 2024-06-05T16:45:03Z

@d-ronnqvist @patshaughnessy I've updated the PR to remove the recursive token type and refactor the code to clean it up a bit. I've also added some tests, so i've removed the draft status from the PR.

patshaughnessy

Thanks a lot for moving the code around and extracting the methods. The new initializers make things much easier to read 👏

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

Sources/SwiftDocC/Model/Rendering/Symbol/DeclarationsRenderSection.swift

QuietMisdreavus · 2024-06-07T15:43:40Z

@swift-ci Please test

d-ronnqvist · 2024-06-07T17:36:39Z

Tests/SwiftDocCTests/Rendering/DeclarationsRenderSectionTests.swift

@@ -151,4 +152,114 @@ class DeclarationsRenderSectionTests: XCTestCase {
        XCTAssertEqual(declarationsSection.declarations.count, 2)
        XCTAssert(declarationsSection.declarations.allSatisfy({ $0.platforms == [.iOS, .macOS] }))
    }
+
+    func testHighlightDiff() throws {


I think it would be good to add more—and more complicated—tests for this. The code works really well but it'd be good to have more comprehensive testing so that we can be confident that we don't break it in the future.

This is just one test with two overloads func myFunc(param: Int) and func myFunc<S>(param: S) where S : StringProtocol but there's a lot more possible syntax that would be worth testing, especially for text tokens since there's logic to split those for better diffident.

Here are the variations that I could think of for a single parameter type. We definitely don't need to add tests for all of them but I think we should have something that's an array, something with a dictionary, something with a tuple, something with an optional, something with a closure type, something with a variadic, and something with a generic argument.

public func doSomething(with: Int) {} public func doSomething(with: Int?) {} public func doSomething(with: [Int]?) {} public func doSomething(with: [Int?]) {} public func doSomething(with: (Int, Int)) {} public func doSomething(with: (Int?, Int)) {} public func doSomething(with: (Int, Int?)) {} public func doSomething(with: (Int, Int)?) {} public func doSomething(with: [Int: Int]) {} public func doSomething(with: [Int: Int]?) {} public func doSomething(with: [Int?: Int]) {} public func doSomething(with: [Int: Int?]) {} public func doSomething(with: Int...) {} public func doSomething(with: Int?...) {} public func doSomething(with: Set<Int>) {} public func doSomething(with: Set<Int?>) {} public func doSomething(with: Set<Int>?) {} public func doSomething(with: (Int) -> ()) {} public func doSomething(with: (Int) -> Int) {} public func doSomething(with: (Int?) -> Int) {} public func doSomething(with: (Int) -> Int?) {} public func doSomething(with: ((Int) -> Int)?) {}

(by the way, the diff in this test is not that "fancy". I can come think with much more complex differences between overloads)

The existing code seems to trip up with tuples and closures, because the differencing eagerly gloms onto the first closing parenthesis it finds and treats that as a common token with the closing parenthesis for the argument list in other overloads...

It also makes the whitespace trimming fall apart for where clauses, since now the argument list parenthesis is considered a different token:

I can write a test with these symbols, but the highlighting here is a bit unfortunate. However, to fix it "properly" would require introducing the complete symbol information into the differencing algorithm somehow, so that the entire argument type could be considered a distinct "token" and the correct parenthesis (and comma, in case of multiple arguments) could be counted as a common token. If we decide that we want to improve this, i'd like to defer that to after this PR lands so that we can get a "good-enough" implementation landed that we can iterate on.

The highlighting is great. I don't think the few cases where it could be slightly better should hold back this PR.

If anything we could add a comment in the tests for the highlights that could be slightly better as extra information for anyone who wants to iterate on this in the future.

In other words: the implementation looks great and we don't need to add many new tests but I think we should add a handful (maybe one with a tuple, one with a closure, and one with an array/dictionary and then fit an optional and a generic value in one of those 3 cases). How does that sound?

I've updated my tests to test the following overload groups:

public func overload1(param: Int) {} public func overload1(param: Int?) {} public func overload1(param: [Int]) {} public func overload1(param: [Int]?) {} public func overload1(param: Set<Int>) {} public func overload1(param: [Int: Int]) {} public func overload2(p1: Int, p2: Int) {} public func overload2(p1: (Int, Int), p2: Int) {} public func overload2(p1: Int, p2: (Int, Int)) {} public func overload2(p1: (Int) -> (), p2: Int) {} public func overload2(p1: (Int) -> Int, p2: Int) {} public func overload2(p1: (Int) -> Int?, p2: Int) {} public func overload2(p1: ((Int) -> Int)?, p2: Int) {} public func overload3(_ p: [Int: Int]) {} public func overload3<T: Hashable>(_ p: [T: T]) {} public func overload3<K: Hashable, V>(_ p: [K: V]) {}

I feel like this both tests the features we want (type decorators getting highlighted, whitespace trimmed off highlighted tokens, splitting >( tokens) and also adds the known edge case issues (parentheses and commas throwing off the diff).

I also added a convenience wrapper in the test code so that i could more concisely test that certain spans of declarations were being highlighted as expected. I'm not 100% sure that this is completely useful, but it helped when rewriting the tests to work with six or seven overloads at a time. 😅

After some out-of-band discussion, i've rewritten the test wrapper to render plain-text comparison strings instead of using the string fragments i originally used.

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift

This reverts commit ed38756. This reverts commit 31ee7a2.

…sabled

QuietMisdreavus · 2024-06-20T22:36:53Z

@swift-ci Please test

QuietMisdreavus · 2024-06-25T17:58:12Z

@swift-ci Please test

d-ronnqvist

Looks great.

QuietMisdreavus · 2024-06-27T22:59:09Z

@swift-ci Please test

…ang#928) rdar://116409531

QuietMisdreavus requested review from patshaughnessy, d-ronnqvist and mayaepps May 23, 2024 21:59

d-ronnqvist reviewed May 24, 2024

View reviewed changes

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift Outdated Show resolved Hide resolved

mayaepps reviewed May 24, 2024

View reviewed changes

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift Outdated Show resolved Hide resolved

mayaepps reviewed May 24, 2024

View reviewed changes

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift Outdated Show resolved Hide resolved

mportiz08 mentioned this pull request May 29, 2024

Highlight unique components of overloaded declarations swiftlang/swift-docc-render#841

Merged

3 tasks

d-ronnqvist reviewed May 29, 2024

View reviewed changes

Sources/SwiftDocC/Model/Rendering/Symbol/DeclarationsRenderSection.swift Outdated Show resolved Hide resolved

patshaughnessy requested changes May 30, 2024

View reviewed changes

d-ronnqvist requested changes May 31, 2024

View reviewed changes

QuietMisdreavus force-pushed the diff-overloads branch from f949693 to dbf0f89 Compare June 3, 2024 22:45

QuietMisdreavus force-pushed the diff-overloads branch from a6d1768 to a493f69 Compare June 5, 2024 16:12

QuietMisdreavus requested review from d-ronnqvist and patshaughnessy June 5, 2024 16:43

QuietMisdreavus marked this pull request as ready for review June 5, 2024 16:43

QuietMisdreavus requested a review from daniel-grumberg June 5, 2024 16:51

patshaughnessy approved these changes Jun 7, 2024

View reviewed changes

d-ronnqvist reviewed Jun 7, 2024

View reviewed changes

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift Outdated Show resolved Hide resolved

d-ronnqvist reviewed Jun 7, 2024

View reviewed changes

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift Outdated Show resolved Hide resolved

d-ronnqvist reviewed Jun 7, 2024

View reviewed changes

Sources/SwiftDocC/Model/Rendering/RenderSectionTranslator/DeclarationsSectionTranslator.swift Outdated Show resolved Hide resolved

d-ronnqvist self-requested a review June 10, 2024 08:46

QuietMisdreavus added 16 commits June 20, 2024 16:34

add test for overload diff

67c94a3

revert recursive tokens implementation

e4e756d

This reverts commit ed38756. This reverts commit 31ee7a2.

review: represent highlights as an enum property

b035fe1

refactor and reformat the token processing code

cc722b8

add test to ensure that highlights don't happen when overloads are di…

d66e752

…sabled

review: keep partitions as substrings

c02ed90

review: make the overload declaration check a precondition

4d17161

review: remove cloning initializers in favor of mutating fields directly

2130296

review: use OverloadDeclaration explicitly

5103f6d

review: add isHighlighted convenience property

7f071f8

review: extract the overloadDeclarations assignment into a function

1121b63

review: precalculate platform names and languages for alternate decls

de125de

review: allow nil platforms in alternate declarations

daa2a70

review: refactor postProcessTokens loop to remove an optional

2be50f3

refactor: write comparison declarations in a more concise way

a942e9c

review: add a wider variety of tests for overload diff highlighting

2a47840

QuietMisdreavus force-pushed the diff-overloads branch from 7f45f2c to 2a47840 Compare June 20, 2024 22:36

review: simplify the testing harness to use a plain-text comparator

4f96243

d-ronnqvist approved these changes Jun 27, 2024

View reviewed changes

Merge branch 'main' into diff-overloads

7adfbdc

QuietMisdreavus merged commit f019ab8 into swiftlang:main Jun 27, 2024
2 checks passed

QuietMisdreavus added a commit to QuietMisdreavus/swift-docc that referenced this pull request Jun 28, 2024

Highlight declaration differences in overloaded symbol groups (swiftl…

0f9c538

…ang#928) rdar://116409531

This was referenced Jun 28, 2024

Highlight unique components of overloaded declarations [alternate approach] swiftlang/swift-docc-render#847

Closed

Highlight declaration differences in overloaded symbol groups #967

Merged

QuietMisdreavus deleted the diff-overloads branch June 28, 2024 15:14

hqhhuang mentioned this pull request Jun 28, 2024

[6.0] Highlight unique components of overloaded declarations [alternate approach] swiftlang/swift-docc-render#876

Merged

QuietMisdreavus mentioned this pull request Nov 6, 2024

Suggest the minimal type disambiguation when an overload doesn't have any unique types #1087

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Highlight declaration differences in overloaded symbol groups #928

Highlight declaration differences in overloaded symbol groups #928

QuietMisdreavus commented May 23, 2024 •

edited

Loading

QuietMisdreavus commented May 23, 2024

patshaughnessy left a comment

patshaughnessy May 29, 2024

QuietMisdreavus Jun 5, 2024

patshaughnessy May 29, 2024

mportiz08 commented May 30, 2024

d-ronnqvist left a comment

d-ronnqvist commented May 31, 2024

mportiz08 commented Jun 2, 2024

QuietMisdreavus commented Jun 4, 2024

QuietMisdreavus commented Jun 5, 2024

QuietMisdreavus commented Jun 5, 2024

patshaughnessy left a comment

QuietMisdreavus commented Jun 7, 2024

d-ronnqvist Jun 7, 2024

d-ronnqvist Jun 7, 2024

QuietMisdreavus Jun 18, 2024

d-ronnqvist Jun 19, 2024

QuietMisdreavus Jun 20, 2024

QuietMisdreavus Jun 25, 2024

QuietMisdreavus commented Jun 20, 2024

QuietMisdreavus commented Jun 25, 2024

d-ronnqvist left a comment

QuietMisdreavus commented Jun 27, 2024

Highlight declaration differences in overloaded symbol groups #928

Highlight declaration differences in overloaded symbol groups #928

Conversation

QuietMisdreavus commented May 23, 2024 • edited Loading

Summary

Performance

Dependencies

Testing

Checklist

QuietMisdreavus commented May 23, 2024

patshaughnessy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mportiz08 commented May 30, 2024

d-ronnqvist left a comment

Choose a reason for hiding this comment

d-ronnqvist commented May 31, 2024

mportiz08 commented Jun 2, 2024

QuietMisdreavus commented Jun 4, 2024

QuietMisdreavus commented Jun 5, 2024

QuietMisdreavus commented Jun 5, 2024

patshaughnessy left a comment

Choose a reason for hiding this comment

QuietMisdreavus commented Jun 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QuietMisdreavus commented Jun 20, 2024

QuietMisdreavus commented Jun 25, 2024

d-ronnqvist left a comment

Choose a reason for hiding this comment

QuietMisdreavus commented Jun 27, 2024

QuietMisdreavus commented May 23, 2024 •

edited

Loading