-
Notifications
You must be signed in to change notification settings - Fork 571
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
i#5036 A64 scatter/gather, part 1: Expand scalar+vector loads (#6267)
Creates an AArch64 version of drx_expand_scatter_gather() and tests for it. So far only SVE scalar+vector loads are supported. Support and tests for more instructions will follow in future commits. State restore is also not yet supported and will be implemented when i#5365 is complete. Issue: #5036
- Loading branch information
1 parent
ad5e838
commit 1a14566
Showing
23 changed files
with
2,443 additions
and
160 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,86 @@ | ||
#ifdef __ARM_FEATURE_SVE | ||
ld1b 32bit unscaled offset uxtw: PASS | ||
ld1b 32bit unscaled offset sxtw: PASS | ||
ld1b 32bit unpacked unscaled offset uxtw: PASS | ||
ld1b 32bit unpacked unscaled offset sxtw: PASS | ||
ld1b 64bit unscaled offset: PASS | ||
ld1b 64bit unscaled offset Zt==Zm: PASS | ||
ld1sb 32bit unscaled offset uxtw: PASS | ||
ld1sb 32bit unscaled offset sxtw: PASS | ||
ld1sb 32bit unpacked unscaled offset uxtw: PASS | ||
ld1sb 32bit unpacked unscaled offset sxtw: PASS | ||
ld1sb 64bit unscaled offset: PASS | ||
ld1sb 64bit unscaled offset: PASS | ||
ld1h 32bit scaled offset uxtw: PASS | ||
ld1h 32bit scaled offset sxtw: PASS | ||
ld1h 32bit unpacked scaled offset uxtw: PASS | ||
ld1h 32bit unpacked scaled offset sxtw: PASS | ||
ld1h 32bit unpacked unscaled offset uxtw: PASS | ||
ld1h 32bit unpacked unscaled offset sxtw: PASS | ||
ld1h 32bit unscaled offset uxtw: PASS | ||
ld1h 32bit unscaled offset sxtw: PASS | ||
ld1h 64bit scaled offset: PASS | ||
ld1h 64bit unscaled offset: PASS | ||
ld1h 64bit unscaled offset Zt==Zm: PASS | ||
ld1sh 32bit scaled offset uxtw: PASS | ||
ld1sh 32bit scaled offset sxtw: PASS | ||
ld1sh 32bit unpacked scaled offset uxtw: PASS | ||
ld1sh 32bit unpacked scaled offset sxtw: PASS | ||
ld1sh 32bit unpacked unscaled offset uxtw: PASS | ||
ld1sh 32bit unpacked unscaled offset sxtw: PASS | ||
ld1sh 32bit unscaled offset uxtw: PASS | ||
ld1sh 32bit unscaled offset sxtw: PASS | ||
ld1sh 64bit scaled offset: PASS | ||
ld1sh 64bit unscaled offset: PASS | ||
ld1sh 64bit unscaled offset Zt==Zm: PASS | ||
ld1w 32bit scaled offset uxtw: PASS | ||
ld1w 32bit scaled offset sxtw: PASS | ||
ld1w 32bit unpacked scaled offset uxtw: PASS | ||
ld1w 32bit unpacked scaled offset sxtw: PASS | ||
ld1w 32bit unpacked unscaled offset uxtw: PASS | ||
ld1w 32bit unpacked unscaled offset sxtw: PASS | ||
ld1w 32bit unscaled offset uxtw: PASS | ||
ld1w 32bit unscaled offset sxtw: PASS | ||
ld1w 64bit scaled offset: PASS | ||
ld1w 64bit unscaled offset: PASS | ||
ld1w 64bit unscaled offset Zt==Zm: PASS | ||
ld1sw 32bit unpacked scaled offset uxtw: PASS | ||
ld1sw 32bit unpacked scaled offset sxtw: PASS | ||
ld1sw 32bit unpacked unscaled offset uxtw: PASS | ||
ld1sw 32bit unpacked unscaled offset sxtw: PASS | ||
ld1sw 64bit scaled offset: PASS | ||
ld1sw 64bit unscaled offset: PASS | ||
ld1sw 64bit unscaled offset Zt==Zm: PASS | ||
ld1d 32bit unpacked scaled offset uxtw: PASS | ||
ld1d 32bit unpacked scaled offset sxtw: PASS | ||
ld1d 32bit unpacked unscaled offset uxtw: PASS | ||
ld1d 32bit unpacked unscaled offset sxtw: PASS | ||
ld1d 64bit scaled offset: PASS | ||
ld1d 64bit unscaled offset: PASS | ||
ld1d 64bit unscaled offset Zt==Zm: PASS | ||
#endif /* __ARM_FEATURE_SVE */ | ||
---- <application exited with code 0> ---- | ||
Basic counts tool results: | ||
Total counts: | ||
.* total \(fetched\) instructions | ||
.* total unique \(fetched\) instructions | ||
.* total non-fetched instructions | ||
.* total prefetches | ||
.* total data loads | ||
.* total data stores | ||
.* total icache flushes | ||
.* total dcache flushes | ||
1 total threads | ||
.* total scheduling markers | ||
.* | ||
Thread .* counts: | ||
.* \(fetched\) instructions | ||
.* unique \(fetched\) instructions | ||
.* non-fetched instructions | ||
.* prefetches | ||
.* data loads | ||
.* data stores | ||
.* icache flushes | ||
.* dcache flushes | ||
.* scheduling markers | ||
.* |
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.