-
Notifications
You must be signed in to change notification settings - Fork 176
DIRAC 9.0
The concept of "Setup" is in the process of disappearing. The concept (which is explained in https://dirac.readthedocs.io/en/latest/AdministratorGuide/Introduction/diraccomponents.html) has a long history and effectively enabled the possibility to use one single machine as a server for multiple setups/installations (e.g. production and testing setups on the same node). This possibility is recognized now as being not useful. Therefore, this functionality is being removed.
-
1024-bit proxies no longer work. ProxyDB should be cleared of 1024 bit proxies before updating to minimise the problems.
-
PR https://github.com/DIRACGrid/DIRAC/pull/6466 removes environment variable DIRAC_USE_JSON_DECODE and flips the default of DIRAC_USE_JSON_ENCODE. Most probably nothing has to be done, unless you've overwritten these values in the past.
-
PR https://github.com/DIRACGrid/DIRAC/pull/6561 effectively makes centralized logging obligatory by swapping the default of the EnableSecurityLogging flag. Make sure to follow instructions
-
PR https://github.com/DIRACGrid/DIRAC/pull/6600 simplifies InstalledComponentDB. Apply the following:
use InstalledComponentsDB
ALTER TABLE `HostLogging` DROP COLUMN `Setup`;
-
PR https://github.com/DIRACGrid/DIRAC/pull/6605 removes Framework/CAUpdateAgent. So, remove it using the
dirac-admin-sysadmin-cli
if you have it installed -
PR https://github.com/DIRACGrid/DIRAC/pull/7442 removes the NotificationDB. Very likely you never used its tables, but please check if you have something stored in it. Then:
DROP DATABASE NotificationDB;
PR https://github.com/DIRACGrid/DIRAC/pull/7671 removes the concepts of Setup and System instances. The code of the components is updated not to use it any more. However, the Configuration should be also updated in order to drop unnecessary information and simplify the schema of the Systems section. The update of the configuration is performed with the convert_nosetup.py script. After the upgrade of the DIRAC installation to 9.0, run this script with the dirac_admin privileges and restart all the services.
Other script to remove the Setup
from Operations
: https://github.com/DIRACGrid/DIRAC/pull/7218
PR https://github.com/DIRACGrid/DIRAC/pull/6565/ makes several simplifications to the Accounting system, and removes the concept of "Setup". This concept has always been reflected in the MySQL database table names, which need to be updated. In order to do so:
- Stop inserting of records, e.g. by stopping the Accounting/DataStore service(s)
- You will have several tables for which the name has to be altered. The following selection will print out the SQL command that you will need to issue to have things done (beware to replace "DIRAC-Certification" with the name of your setup).
SET group_concat_max_len=5000;SELECT group_concat(v.name separator '; ')
FROM (
SELECT concat('RENAME TABLE `', t.table_name, '` TO `', replace(t.table_name, '_DIRAC-Certification_', '_'), '`') name
FROM information_schema.tables t
WHERE table_name like '%_DIRAC-Certification_%'
) v;
(you might need to run the above more than once).
And then:
DELETE FROM `ac_catalog_Types` where name LIKE 'DIRAC-Certification%'
(again, replace 'DIRAC-Certification%'
with the name of your setup).
- Restart the DataStore service(s)
Remove of DIRACSetup and OwnerDN in WMS and Transformations, plus several other simplifications and changes
Following changes in PRs including:
- https://github.com/DIRACGrid/DIRAC/pull/6164
- https://github.com/DIRACGrid/DIRAC/pull/6566
- https://github.com/DIRACGrid/DIRAC/pull/7157
- https://github.com/DIRACGrid/DIRAC/pull/7124
- https://github.com/DIRACGrid/DIRAC/pull/7110
- Stop pilots submission, e.g. by stopping the SiteDirector agent(s)
- Stop job and PilotManager registration and jobs matching, e.g. by stopping the JobManager, PilotManager, Matcher service(s)
- Stop transformations submission, e.g. by stopping the TransformationManager service(s)
- Alter the following tables:
use JobDB;
ALTER TABLE `Jobs` ADD COLUMN `VO` VARCHAR(64);
use PilotAgentsDB;
ALTER TABLE `PilotAgents` ADD COLUMN `VO` VARCHAR(64);
use TaskQueueDB;
ALTER TABLE `tq_TaskQueues` ADD COLUMN `Owner` VARCHAR(255) NOT NULL;
ALTER TABLE `tq_TaskQueues` ADD COLUMN VO VARCHAR(64);
use SandboxMetadataDB;
ALTER TABLE `sb_Owners` ADD COLUMN `VO` VARCHAR(64);
use TransformationDB;
ALTER TABLE `Transformations` ADD COLUMN `Author` VARCHAR(255) NOT NULL;
ALTER TABLE `Transformations` MODIFY COLUMN `AuthorDN` VARCHAR(255) DEFAULT NULL;
- Save the following script in a (whatever, e.g. in
/opt/dirac
) directory of a DIRAC server machine:
Then run it with python script_name.py -o /DIRAC/Security/UseServerCertificate=yes
-
Restart the JobManager, PilotManager and Matcher service(s)
-
Restart the TransformationManager service(s)
-
Restart the SiteDirector(s)
-
Further DB changes can take place now:
use JobDB;
ALTER TABLE `Jobs` DROP COLUMN `DIRACSetup`;
ALTER TABLE `Jobs` DROP COLUMN `OwnerDN`;
ALTER TABLE `Jobs` DROP COLUMN `JobSplitType`;
ALTER TABLE `Jobs` DROP COLUMN `MasterJobID`;
ALTER TABLE `Jobs` DROP COLUMN `ApplicationNumStatus`;
ALTER TABLE `Jobs` DROP COLUMN `SystemPriority`;
ALTER TABLE `Jobs` DROP COLUMN `DeletedFlag`;
ALTER TABLE `Jobs` DROP COLUMN `KilledFlag`;
ALTER TABLE `Jobs` DROP COLUMN `FailedFlag`;
ALTER TABLE `Jobs` DROP COLUMN `ISandboxReadyFlag`;
ALTER TABLE `Jobs` DROP COLUMN `OSandboxReadyFlag`;
ALTER TABLE `Jobs` DROP COLUMN `RetrievedFlag`;
DROP TABLE `SiteMask`
DROP TABLE `SiteMaskLogging`
use PilotAgentsDB;
ALTER TABLE `PilotAgents` DROP COLUMN `GridRequirements`;
ALTER TABLE `PilotAgents` DROP COLUMN `OwnerDN`;
ALTER TABLE `PilotAgents` DROP COLUMN `ParentID`;
ALTER TABLE `PilotAgents` DROP COLUMN `TaskQueueID`;
ALTER TABLE `PilotAgents` DROP COLUMN `OwnerGroup`;
ALTER TABLE `PilotAgents` DROP COLUMN `Broker`;
ALTER TABLE `PilotAgents` DROP COLUMN `OutputReady`;
use TaskQueueDB;
ALTER TABLE `tq_TaskQueues` DROP COLUMN `Setup`;
ALTER TABLE `tq_TaskQueues` DROP COLUMN `OwnerDN`;
DROP TABLE `tq_TQToSubmitPools`;
use SandboxMetadataDB;
ALTER TABLE `sb_EntityMapping` DROP COLUMN `EntitySetup`;
ALTER TABLE `sb_Owners` DROP COLUMN `OwnerDN`;
use ProxyDB;
DROP TABLE ProxyDB_Proxies;
DROP TABLE ProxyDB_Tokens;
PR https://github.com/DIRACGrid/DIRAC/pull/7439 removes the possibility to use external StorageElements as SandboxStore. Most probably you have nothing to do.
You can add the command RegisterPilot
in the list of commands run by the Pilot (this is already the default, and it only applies if you were previously running with a non-default list of commands)
With the changes to the permission enforcement for the Transformation one has to make sure that the Hosts running TransformationSystem Agents have the ProductionManagement property. The same is true for a shifterProxy and groups that are used for Transformations
Following https://github.com/DIRACGrid/DIRAC/issues/7335
ALTER TABLE `Transformations` ADD INDEX `status_index` (`Status`), ADD INDEX `type_index` (`Type`), ALGORITHM=INPLACE;
VMDIRAC has been removed from the release.
- Stop agent(s) WorkloadManagement/CloudDirector and service(s) WorkloadManagement/VirtualMachineManager, either directly from the machine where they are running:
runsvctrl d /opt/dirac/startup/WorkloadManagement_CloudDirector
runsvctrl d /opt/dirac/startup/WorkloadManagement_WorkloadManagement
or better using the dirac-admin-sysadmin-cli
CLI.
- Remove VirtualMachineDB
DROP DATABASE VirtualMachineDB;
- uninstall VirtualMachineManager and CloudDirector. Use the
dirac-admin-sysadmin-cli
CLI. - remove any credentials you might have used from the server
PR https://github.com/DIRACGrid/DIRAC/pull/6208 introduces a possibility to store pilot log files to remote storage. It is foreseen to use a plugin for this purpose. The PR contains a FileCacheLoggingPlugin
which sends the logs to a SE.
- Install
WorkloadManagement/TornadoPilotLoggingHandler
service - Install
WorkloadManagement/PilotLoggingAgent
agent
Configuration:
Configuration is done in a VO by VO basis, in a VO-specific Pilot section in Operations
. Defaults
section can be used as usual to set up initial settings for all VOs.
- Enable remote login in the Pilot section of a VO:
RemoteLogging = True
- Set the service URL:
RemoteLoggerURL = https://dirac.host.name:8444/WorkloadManagement/TornadoPilotLogging
- Set the upload SE, e.g.:
UploadSE = UKI-LT2-IC-HEP-disk
- Uploading is done by a Shifter called
DataManager
, so a shifter of this name should be defined in a shifter section of the VO - Set the upload path a VO can write to, e.g.:
UploadPath = /gridpp/pilotlogs
The TornadoPilotLoggingHandler
service requires a plugin name to be specified under Services/TornadoPilotLogging
:
LoggingPlugin = FileCacheLoggingPlugin
PR https://github.com/DIRACGrid/DIRAC/pull/7710 removes the old OptimizerModule. Chances are you have nothing to do, but in case your extension use it, move its corresponding optimizers to the Executor framework.
Following changes in PR https://github.com/DIRACGrid/DIRAC/pull/7162
- Stop submission of productions, e.g. by stopping the ProductionManager service(s)
- Alter the following table:
use ProductionDB;
ALTER TABLE `Productions` ADD COLUMN `Author` VARCHAR(255) NOT NULL;
ALTER TABLE `Productions` MODIFY COLUMN `AuthorDN` VARCHAR(255) DEFAULT NULL;
- Restart the ProductionManager service(s)
Following changes in PR https://github.com/DIRACGrid/DIRAC/pull/6419 and https://github.com/DIRACGrid/DIRAC/pull/6653
- Stop submission of requests, e.g. by stopping the ReqManager service(s)
- Alter the following table:
use ReqDB;
ALTER TABLE `Request` DROP COLUMN `DIRACSetup`;
ALTER TABLE `Request` ADD COLUMN `Owner` VARCHAR(255);
- Restart the ReqManager service(s)
Following changes in PR https://github.com/DIRACGrid/DIRAC/pull/7762, RSS is now mandatory. If RSS is not yet active for you, BEFORE installing Dirac v9, you need to activate it by following instructions at https://dirac.readthedocs.io/en/latest/AdministratorGuide/Systems/ResourceStatus/install.html
ARC
and ARC6
have been removed and should be replaced with AREX
.
- From a DIRAC client, execute the following command to get the port used by the CE to provide the AREX interface:
$ arcinfo -c <ce name>
Computing service: arc (production)
Information endpoint: ldap://<ce name>:2135/Mds-Vo-Name=local,o=grid (org.nordugrid.ldapng)
Information endpoint: ldap://<ce name>:2135/o=glue (org.nordugrid.ldapglue2)
Information endpoint: https://<ce name>:443/arex (org.nordugrid.arcrest) <------------ get the port used here
Information endpoint: https://<ce name>:443/arex (org.ogf.glue.emies.resourceinfo)
Submission endpoint: https://<ce name>:443/arex (status: ok, interface: org.nordugrid.arcrest)
Submission endpoint: https://<ce name>:443/arex (status: ok, interface: org.ogf.glue.emies.activitycreation)
Submission endpoint: gsiftp://<ce name>:2811/jobs (status: ok, interface: org.nordugrid.gridftpjob)
- Simply replace
CEType=ARC
orARC6
withCEType=AREX
in the configuration - If the port is not
443
(the default one), then you also need to add the following option to your configuration:
Port = <AREX port>