Data migrations

What they are and when to use them

Data migrations are generally used for code we want to run at or just after deploy-time which either:

requires interacting with another system, such as publishing-api or the search index
depends on specific data being present in the database - for example, a third party running our code won't have access to a copy of our production data
is a long-running data change which you want the flexibility to be able to run during quiet times

It is useful for them to be separate because developers and CI need to be able to run normal migrations without requiring specific data in their database or to have other services to be running.

The alternative is to write rake tasks which might be single use and/or might require the right arguments to be documented separately from related code changes.

They are implemented by reusing Rails' data migration code, but they have their own rake task and database table for tracking which ones have been run.

How to add one

Just like a normal migration, there is a Rails command:

  bundle exec rails g migration MyDataMigrationName

How to run them

Data migrations don't run automatically, they have to be run manually in all environments.

Development

Rake task:

  bundle exec rake db:data:migrate

or up to a specific version:

  bundle exec rake db:data:migrate VERSION=20140402115507

At deploy time

We have a Jenkins job for convenience to save you having to ssh onto a box to run the rake task. Here's the job on deploy.integration: Run_Whitehall_Data_Migrations

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Data migrations

What they are and when to use them

How to add one

How to run them

Development

At deploy time

Files

README.md

Latest commit

History

README.md

File metadata and controls

Data migrations

What they are and when to use them

How to add one

How to run them

Development

At deploy time