Clean up NNUE template parameters #5584

MinetaS · 2024-09-10T18:03:10Z

Having multiple template parameters might lead to confusion that unallowed combinations are possible to use, such as:

FeatureTransformer<TransformedFeatureDimensionsSmall, &StateInfo::accumulatorBig>

By grouping the parameters into a single struct, the code becomes more clear, comprehensible, and easier to maintain.

No functional change

Having multiple template parameters might lead to confusion that unallowed combinations are possible to use, such as: FeatureTransformer<TransformedFeatureDimensionsSmall, &StateInfo::accumulatorBig> By grouping the parameters into a single struct, the code becomes more clear, comprehensible, and easier to maintain. No functional change

vondele · 2024-09-10T18:48:15Z

I think that's the right direction, I would still suggest to run a non-regression test for these kind of cleanups.

Sopel97 · 2024-09-11T15:44:40Z

I don't like how the FeatureTransformer is getting information about the whole rest of the network architecture. The code always looked like it's something special but it's just a normal layer in the network, it just happens to be doing some more caching. Other than that looks good.

MinetaS · 2024-09-11T16:40:30Z

I don't like how the FeatureTransformer is getting information about the whole rest of the network architecture. The code always looked like it's something special but it's just a normal layer in the network, it just happens to be doing some more caching. Other than that looks good.

I agree but that also suggests that the current code leads to some confusion. The accumluator arrays are tied to the FeatureTransformer and are defined per each net type. From my perspective, having the FT take a pointer-to-member does not make it up being a generic layer as it relies on specific members of StateInfo struct.

Let me know if this looks right to you:

template<IndexType _L1, int _L2, int _L3, Accumulator<_L1> StateInfo::*_accPtr>
struct NetworkType {
    static constexpr IndexType L1 = _L1;
    static constexpr int       L2 = _L2;
    static constexpr int       L3 = _L3;

    struct FeatureTransformerType {
       static constexpr IndexType TransformedFeatureDimensions = L1;
       static constexpr Accumulator<TransformedFeatureDimensions> StateInfo::*accPtr = _accPtr;
    };
};

and

    using Transformer = FeatureTransformer<typename Type::FeatureTransformerType>;

Sopel97 · 2024-09-11T16:57:01Z

I was thinking simply

template<IndexType _L1, int _L2, int _L3, Accumulator<_L1> StateInfo::*_accPtr>
struct NetworkType {
    static constexpr IndexType L1 = _L1;
    static constexpr int       L2 = _L2;
    static constexpr int       L3 = _L3;

    static constexpr Accumulator<L1> StateInfo::*accPtr = _accPtr;
};

...
using Transformer = FeatureTransformer<Type::L1, Type::accPtr>;

MinetaS · 2024-09-11T17:03:01Z

I was thinking simply

template<IndexType _L1, int _L2, int _L3, Accumulator<_L1> StateInfo::*_accPtr>
struct NetworkType {
    static constexpr IndexType L1 = _L1;
    static constexpr int       L2 = _L2;
    static constexpr int       L3 = _L3;

    static constexpr Accumulator<L1> StateInfo::*accPtr = _accPtr;
};

...
using Transformer = FeatureTransformer<Type::L1, Type::accPtr>;

As I mentioned in the commit message, two template parameters are grouped because they are not independent. It's never allowed to have FeatureTransformer<TransformedFeatureDimensionsBig, &StateInfo::accumulatorSmall> or vice versa (there can be more unallowed combinations i.e. linrock's triple NNUE experiments). I believe going back to this implementation eliminates the whole purpose of this PR.

Sopel97 · 2024-09-11T17:10:50Z

You can't exclude all buggy code at compile time. Still nothing would prevent creating two networks that point to the same accumulator.

I don't think this one case of keeping the parameters inseparable warrants closely coupling FeatureTransformer with the whole network. You should be able to create the FeatureTransformer as a standalone layer without having to specify anything about other layers. And I think the PR has a lot of merit outside of this small change.

MinetaS · 2024-09-11T17:31:49Z

Ok, now I understand that. Although revamping FT template parameters was the main purpose of this PR, there seems to be other kinds of benefit this PR brings out.

MinetaS · 2024-09-12T05:31:55Z

The revised code failed to pass the regression test.
https://tests.stockfishchess.org/tests/view/66e1e33686d5ee47d953a5a1

The initial version test is likely going to pass the test:
https://tests.stockfishchess.org/tests/view/66e0989486d5ee47d953a45d

I believe this is false negative, but let me know how to proceed on this.

Disservin · 2024-09-12T06:31:56Z

I'd say give the revised one a rerun

MinetaS · 2024-09-13T09:35:31Z

Both tests have failed, so closing this for now. Something affects the compiler optimization.

MinetaS mentioned this pull request Sep 10, 2024

Improve build system #5543

Closed

MinetaS force-pushed the 2680c9c7-2-PR branch 2 times, most recently from 3215928 to 132d421 Compare September 10, 2024 18:07

MinetaS force-pushed the 2680c9c7-2-PR branch from 132d421 to ac58ebd Compare September 10, 2024 18:08

vondele requested a review from Sopel97 September 10, 2024 18:47

Disservin added the no-functional-change label Sep 10, 2024

MinetaS closed this Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up NNUE template parameters #5584

Clean up NNUE template parameters #5584

MinetaS commented Sep 10, 2024 •

edited

Loading

vondele commented Sep 10, 2024

Sopel97 commented Sep 11, 2024 •

edited

Loading

MinetaS commented Sep 11, 2024

Sopel97 commented Sep 11, 2024 •

edited

Loading

MinetaS commented Sep 11, 2024

Sopel97 commented Sep 11, 2024 •

edited

Loading

MinetaS commented Sep 11, 2024

MinetaS commented Sep 12, 2024

Disservin commented Sep 12, 2024

MinetaS commented Sep 13, 2024

Clean up NNUE template parameters #5584

Clean up NNUE template parameters #5584

Conversation

MinetaS commented Sep 10, 2024 • edited Loading

vondele commented Sep 10, 2024

Sopel97 commented Sep 11, 2024 • edited Loading

MinetaS commented Sep 11, 2024

Sopel97 commented Sep 11, 2024 • edited Loading

MinetaS commented Sep 11, 2024

Sopel97 commented Sep 11, 2024 • edited Loading

MinetaS commented Sep 11, 2024

MinetaS commented Sep 12, 2024

Disservin commented Sep 12, 2024

MinetaS commented Sep 13, 2024

MinetaS commented Sep 10, 2024 •

edited

Loading

Sopel97 commented Sep 11, 2024 •

edited

Loading

Sopel97 commented Sep 11, 2024 •

edited

Loading

Sopel97 commented Sep 11, 2024 •

edited

Loading