-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
aurora/2.6 test branch #14539
base: release/2.6
Are you sure you want to change the base?
aurora/2.6 test branch #14539
Conversation
Errors are component not formatted correctly,Ticket number prefix incorrect,PR title is malformatted. See https://daosio.atlassian.net/wiki/spaces/DC/pages/11133911069/Commit+Comments,Unable to load ticket data |
54e974f
to
da88cb8
Compare
da88cb8
to
781680c
Compare
9160091
to
99b7a5a
Compare
Test stage Build on EL 8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/8/execution/node/387/log |
Test stage Build RPM on EL 9 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/8/execution/node/384/log |
Test stage Build RPM on EL 8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/8/execution/node/380/log |
Test stage Build RPM on Leap 15.5 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/8/execution/node/379/log |
Test stage Build DEB on Ubuntu 20.04 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/8/execution/node/318/log |
fa6571a
to
20cf312
Compare
Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/12/execution/node/824/log |
20cf312
to
8e72091
Compare
Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/13/execution/node/823/log |
8e72091
to
f21c6e5
Compare
Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/14/execution/node/804/log |
f21c6e5
to
258b9ac
Compare
Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/15/execution/node/798/log |
src/tests/ftest/harness/coverage.py
Outdated
@@ -6,6 +6,7 @@ | |||
|
|||
from apricot import TestWithServers | |||
from dfuse_utils import get_dfuse, start_dfuse | |||
from general_utils import get_journalctl, journalctl_time, get_journalctl_command |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
need to remove
@@ -923,7 +928,7 @@ def create_directory(hosts, directory, timeout=15, verbose=True, | |||
|
|||
""" | |||
mkdir_command = "/usr/bin/mkdir -p {}".format(directory) | |||
command = get_clush_command(hosts, args="-S -v", command=mkdir_command, command_sudo=sudo) | |||
command = get_clush_command(hosts, args="-S -B -v", command=mkdir_command, command_sudo=sudo) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While scaling for 2KECBs I encountered an issue while agent startup when it was unable to create dirs to be able to copy on all ECB in time. I tried increasing the timeout and that did not help either. But adding fanout to clush command helped instead.
command = get_clush_command(hosts, args="-S -B -v", command=mkdir_command, command_sudo=sudo) | |
command = get_clush_command(hosts, args="-S -B -v -f 1024", command=mkdir_command, command_sudo=sudo) |
Or something like that.
Let me know what you think of adding this here in the build.
IMO if there is a way to add some higher fanout values at a single place to all clush commands in our infrastructure for Aurora, it will he helpful.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really these direct clush commands should be replaced with run_remote, then run_remote can increase the fanout for all clush commands.
0ae5b72
to
2d3aee5
Compare
Test stage Python Bandit check completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14539/20/execution/node/145/log |
f7b0319
to
c27ea4d
Compare
4d8cb8b
to
2f1d59f
Compare
Required-githooks: true Signed-off-by: Dalton Bohning <[email protected]>
Skip-build: true Required-githooks: true Signed-off-by: Dalton Bohning <[email protected]>
Skip-test: true Skip-build: true Required-githooks: true Signed-off-by: Dalton Bohning <[email protected]>
Skip-test: true Skip-build: true Required-githooks: true Signed-off-by: Dalton Bohning <[email protected]>
Skip-test: true Skip-build: true Required-githooks: true Signed-off-by: Dalton Bohning <[email protected]>
Skip-build: true Summary: Agent failure test changes to support ECBs. Required-githooks: true Signed-off-by: Padmanabhan <[email protected]>
Skip-build: true Required-githooks: true Signed-off-by: Maureen Jean <[email protected]>
Required-githooks: true Signed-off-by: Maureen Jean <[email protected]>
PR-repos: libfabric@PR-77:lastBuild mercury@PR-122:lastBuild Required-githooks: true Signed-off-by: Jerome Soumagne <[email protected]>
SKip-build: true Required-githooks: true Signed-off-by: Dalton Bohning <[email protected]>
Skip-build: true Required-githooks: true Signed-off-by: Maureen Jean <[email protected]> Signed-off-by: Dalton Bohning <[email protected]>
2f1d59f
to
d7eb3dd
Compare
Skip-build: true The patch contains the following improvements: 1. When VOS level logic returns -DER_TX_RESATRT, the object level RPC handler should set 'RESEND' flag then restart the transaction with newer epoch. Because dtx_abort() logic cannot guarantee all former prepared DTX entries (on all related participants) can be aborted, especially if the former one failed for some network trouble, that may cause restarted transaction hit -DER_TX_ID_REUSED unexpectedly. 2. Compare the epoch for DTX entries with the same transaction ID for distinguishing potential reused TX ID more accurately. 3. Add DTX entry into DTX CoS cache if cannot commit it synchronously. Then subsequent batched commit logic can handle it. 4. If server complains suspected TX ID reusing, then reports -EIO to related application instead of assertion on client. 5. Control DTX related warning message frequency to avoid log flood. 6. Collect more information when generate some error/warning message. Allow-unstable-test: true Required-githooks: true Signed-off-by: Fan Yong <[email protected]>
2.6 test branch for Aurora
Before requesting gatekeeper:
Features:
(orTest-tag*
) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.Gatekeeper: