-
Notifications
You must be signed in to change notification settings - Fork 7
/
Copy pathM-SEATS-decomposition_old.qmd
541 lines (329 loc) · 50.7 KB
/
M-SEATS-decomposition_old.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
# SEATS decomposition {#m-seats .unnumbered}
left to be modified
- references to a-seats chapter (and vice versa)
- small tweaks
- references
to be checked: can be done later
SEATS is an Arima Model Based (AMB) decomposition which is the second part of Tramo-SEATS seasonal adjustment algorithm, originally developed by the Bank of Spain.
SEATS is a program for estimating unobserved components in a time series. It follows the ARIMA-model-based (AMB) method, developed from the work of CLEVELAND, W.P., and TIAO, G.C. (1976), BURMAN, J.P. (1980), HILLMER, S.C., and TIAO, G.C. (1982), BELL, W.R., and HILLMER, S.C. (1984) and MARAVALL, A., and PIERCE, D.A. (1987).
(up coming here compact reference)
In recent years, SEATS implementation in JDemetra+ v 3.x has been extended to infra-monthly data (weekly, daily, hourly...). Handling this kind of data gave way to a tailored AMB decomposition whose peculiarities are described in [this chapter](#m-sa-hf).
## In this chapter {#m-seats-intro}
This chapter provides details on [seats algorithm steps](#m-seats-steps) - series\](#m-seats-stages-output) - [quality measures](#m-seats-qm) - [filter length choices](#m-seats-filters-length-choice) - [extreme values correction](#seats-I-correc)
The practical implementation as well as all the options, using the graphical user interface or R packages are described in [this chapter](#a-sa-seats)
## SEATS steps
SEATS decomposes the linearized series into trend, seasonal, transitory and irregular components, provides forecasts for these components, together with the associated standard errors, and finally assign the deterministic effects to each component yielding the final components.
### Input from Tramo
In JDemetra+ the input for the model based signal extraction procedure is always provided by TRAMO and includes the original series $y_{t}$, the linearized series $x_{t}$ (i.e. the original series $y_{t}$ with the deterministic effects removed), the ARIMA model for the stochastic (linearized) time series $x_{t}$ and the deterministic effects (calendar effects, outliers and other regression variable effects).
### ARIMA modelling of the input series
One of the fundamental assumptions made by SEATS is that the linearized time series $x_{t}$ follows the ARIMA model
Equation 1
$$
\phi(B)\delta(B)x_{t} = \theta(B)a_{t}
$$
where:
- $B$ -- the backshift operator $(Bx_{t} = x_{t-1})$;
- $\delta(B)$ -- a non-stationary autoregressive (AR) polynomial in $B$ (unit roots);
- $\theta(B)$ -- an invertible moving average (MA) polynomial in $B$ and in $B^{S}$, which can be expressed in the multiplicative form $(1+\vartheta_{1}B + \ldots{+ \ \vartheta}{q}B^{q})(1+\Theta{1}B^{s} + \ldots{+ \ \Theta}_{Q}B^{\text{sQ}})$ ;
- $phi(B)$ -- a stationary autoregressive (AR) polynomial in $B$ and in $B^{S}$ containing regular and seasonal unit roots, with *s* representing the number of observations per year;
- $a_{t}$ -- a white-noise variable with the variance$V(a)$.
It should be noted that the stochastic time series can be predicted using its past observations and making an error. The variable $a_{t}$, which is assumed to be white noise, is the fundamental *innovation* to the series at time *t*, that is the part that cannot be predicted based on the past history of the series.
Denoting $\varphi(B) = \phi(B)\delta(B)$,can be written in a more concise form as
Equation 2
$$
\varphi(B)x_{t} = \theta(B)a_{t}
$$
where $\varphi(B)$ contains both the stationary and the nonstationary roots.
### Derivation of the models for the components
Let us consider the additive decomposition model
Equation 3
$$
x_{t} = \sum_{i = 1}^{k}x_{\text{it}}
$$
where *i* refers to the orthogonal components: trend, seasonal, transitory or irregular. Apart from the irregular component, supposed to be a white noise, it is assumed that each component follows the ARIMA model which can be represented, using the notation of equation 2 as:
Equation 4
$$
\varphi_{i}(B)\ x_{\text{it}} = \theta_{i}(B)a_{\text{it}}
$$
where
- $\varphi_{i}(B) = \phi_{i}(B)\delta_{i}(B),...x_{\text{it}}$ is the *i*-*th* unobserved component,
- $\varphi_{i}(B)$ and $\theta_{i}(B)$ are finite polynomials of order $p_{i}$ and $q_{i}$, respectively,
- $a_{\text{it}}$, the disturbance associated with such component, is a white noise process with zero mean and constant variance $V(a_{i})$ and $a_{\text{it}}$ and $a_{\text{jt}}$ are not correlated for $i \neq j$ and for any $t$..
These disturbances are functions of the innovations in the series and are called "pseudo-innovations" in the literature concerning the AMB decomposition as they refer to the components that are never observed In the JDemetra+ documentation the term "innovations" is used to refer to "pseudo-innovations".(GÓMEZ, V., and MARAVALL, A. (2001a).
The following assumptions hold for Equation 4. For each $\text{i}$ the polynomials $\phi_{i}(B)$, $\delta_{i}(B)$ and $\theta_{i}(B)$ are prime and of finite order. The roots of $\delta_{i}(B)$ lies on the unit circle; those of $\phi_{i}(B)$ lie outside, while all the roots of $\theta_{i}(B)$ are on or outside the unit circle. This means that non stationary and non invertible components are allowed. Since different roots of the AR polynomial induce peaks in the spectrum of the series at different frequencies, and given that different components are associated with the spectral peaks for different frequencies, it is assumed that for $i \neq j$ the polynomials$\ \phi_{i}(B)$ and $\phi_{j}(B)$ do not share any common root (they are coprime). Finally, it is assumed that the polynomials $\theta_{i}(B),\ i = 1,\ldots,k$ are prime share no unit root in common, guaranteeing the invertibility of the overall series. In fact, since the unit root of $\theta_{i}(B)$ induce a spectral zero, when the polynomials $\theta_{i}(B),\ i = 1,\ldots,k$ share no unit root in common, there is no frequency for which all component spectra become zero.
For description of the spectrum see chapter on [Spectral Analysis](#m-spectrum).(MARAVALL, A. (1995).
Since aggregation of ARIMA models yields ARIMA models, the series $x_{t}$ will also follow an ARIMA model, as in Equation 2, and consequently the following identity can be derived:
Equation 5:
$$
\frac{\theta(B)}{\varphi(B)}a_{t} = \sum_{i = 1}^{k}{\frac{\theta_{i}(B)}{\varphi_{i}(B)}a_{\text{it}}}
$$
In the ARIMA model based approach implemented in SEATS, the ARIMA model identified and estimated for the observed series $x_{t}$ is decomposed to derive the models for the components. In particular, the AR polynomials for the components, $\varphi_{i}(B)$, are easily derived through the factorization of the AR polynomial $\varphi(B)$:
Equation 6
$$
\varphi(B) = \prod_{i = 1}^{k}{\varphi_{i}(B)}
$$
while the MA polynomials for the components, together with the innovation variances $V(a_{i})$, cannot simply be obtained through the relationship:
Equation 7:
$$
\theta(B)a_{t} = \sum_{i= 1}^{k}{\varphi_{\text{ni}}(B)}\theta_{i}(B)a_{\text{it}}
$$
where $\varphi_{\text{ni}}(B)$ is the product of all $\varphi_{j}(B),\ j = 1,\ldots,k$, except from $\varphi_{i}(B)$. Further assumptions are therefore needed to cope with the under identification problem: i) $p_{i} \geq q_{i}$ and ii) the canonical decomposition, i.e. the decomposition that allocate all additive white noise to the irregular component (yielding non invertible components except the irregular).
To understand how SEATS factorizes the AR polynomials, first a concept of a root will be explored.(Description based on KAISER, R., and MARAVALL, A. (2000) and MARAVALL, A. (2008c).)
Equation 2 can be expressed as:
Equation 8:
$$
\psi^{- 1}(B)x_{t} = a_{t}(1+\varphi_{1}B + \ldots\varphi_{p}B^{p})x_{t} =(1+\theta_{1}B + \ldots\theta_{q}B^{q})a_{t}
$$
Let us now consider Equation 2 in the inverted form:
Equation 9:
$$
\theta(B)y_{t} = \varphi(B)a_{t}
$$
If both sides of Equation 8 multiplied by $x_{t-k}$ with $k > q$, and expectations are taken, the right hand side of the equation vanishes and the left hand side becomes:
Equation 10
$$
\varphi(B)\gamma_{k} = \gamma_{k} + \varphi_{1}\gamma_{k-1} + \ldots\varphi_{p}\gamma_{k-p} = 0
$$
where $B$ operates on the subindex $k$.
The autocorrelation function $\gamma_{k}$ is a solution of Equation 10 with the characteristic equation:
Equation 11:
$$
z^{p} +\varphi_{1}z^{p-1} + \ldots\varphi_{p-1}z + \varphi_{p} = 0
$$
If $z_{1}$,...,$\ z_{p}$ are the roots of Equation 11, the solutions of Equation 10 can be expressed as:
Equation 12:
$$
\gamma_{k} = \sum_{i = 1}^{p}z_{i}^{k}
$$
and will converge to zero as $karrow \infty$ when $|r_{i}| < 1,\ i = 1,\ldots,p$. From Equation 10 and Equation 12 it can be noticed that $z_{1} = B_{i}^{-1}$, meaning that $z_{1}$,...,$\ z_{p}$ are the inverses of the roots $B_{1},\ldots,B_{p}$ of the polynomial $\varphi(B)$. The convergence of $\gamma_{k}$ implies that the roots of the $\varphi(B)$ are larger than 1 in modulus (lie outside the unit circle). Therefore, from the equation 13.
Equation 13
$$
{\varphi(B)}^{- 1} = \frac{1}{(1-z_{1})\ldots(1-z_{1})}
$$
it can be derived that ${\varphi(B)}^{- 1}$ is convergent and all its inverse roots are less than 1 in modulus.
Equation 11 has real and complex roots (solutions). Complex number $x = a + bi$, with $a$ and $\text{b}$ both real numbers, can be represented as $x = r(cos(\omega) + i\ sin(\omega))$, where $i$ is the imaginary unit ${i}^{2}=-1)$, $r$ is the modulus of $x$, that is $r=|x| = \sqrt{a^{2} + b^{2}}$ and $\omega$ is the argument (frequency). When roots are complex, they are always in pairs of complex conjugates. The representation of the complex number $x = a + bi$ has a geometric interpretation in the complex plane established by the real axis and the orthogonal imaginary axis.
![**Geometric representation of a complex number and of its conjugate**](All_images/UG_A_image3.png)
Representing the roots of the characteristic equation 11 in the complex plane enhances understanding how they are allocated to the components. When the modulus $r$ of the roots in $\text{z}$ are greater than 1 (i.e. modulus of the roots in $\varphi(B)< 1$), the solution of the characteristic equation has a systematic explosive process, which means that the impact of the given impulse on the time series is more and more pronounced in time. This behaviour is not in line with the developments that can be identified in actual economic series. Therefore, the models estimated by TRAMO-SEATS (and X-13ARIMA-SEATS) have never inverse roots in $B$ with modulus greater than 1.
The characteristic equations associated with the regular and the seasonal differences have roots in $\varphi(B)$ with modulus $r = 1$. They are called non-stationary roots and can be represented on the unit circle. Let us consider the seasonal differencing operator applied to a quarterly time series $(1-B^{4})$. Its characteristic equation is ${(z}^{4}-1) = 0$ with solutions given by$z = \sqrt[4]{1}$, i.e. $z_{1,2} = \pm 1$ and $z_{3,4} = \pm i1$. The first two solutions are real and the last two are complex conjugates. They are represented by the black points on the unit circle on the figure below.
![**Unit roots on the unit circle**](All_images/UG_A_image4.png)
For the seasonal differencing operator $(1-B^{12})$ applied to the monthly time series the characteristic equation ${(z}^{12}-1) = 0$ has twelve non-stationary solutions given by $z = \sqrt[12]{1}:$ two real and ten complex conjugates, represented by the white circles in unit roots figure above.
The complex conjugates roots generate the periodic movements of the type:
Equation 14
$$
z_{t} = A^{t}\cos(\omega t + W)
$$
where:
- $A$ -- amplitude;
- $\omega$ -- angular frequency (in radians);
- $W$ -- phase (angle at $t = 0)$.
The frequency $f$, i.e. the number of cycles per unit time, is $\frac{\omega}{2\pi}$. If it is multiplied by *s*, the number of observations per year, the number of cycles completed in one year is derived. The period of function in Equation 14, denoted by $\tau$, is the number of units of time (months/quarters) it takes for a full circle to be completed.
For quarterly series the seasonal movements are produced by complex conjugates roots with angular frequencies at $\frac{\pi}{2}$ (one cycle per year) and $\pi$ (two cycles per year). The corresponding number of cycles per year and the length of the movements are presented in the table below.
**Seasonal frequencies for a quarterly time series**
| **Angular frequency (**$\omega$) | **Frequency (cycles per unit time) (**$f$) | **Cycles per year** | **Length of the movement measured in quarters (**$\tau$) |
|------------------|------------------|------------------|--------------------|
| $\frac{\pi}{2}$ | 0.25 | 1 | 4 |
| $\pi$ | 0.5 | 2 | 2 |
For monthly time series the seasonal movements are produced by complex conjugates roots at the angular frequencies: $\frac{\pi}{6},\frac{\pi}{3}, \frac{\pi}{2}, \frac{2\pi}{3}, \frac{5\pi}{6}$ and $\pi$. The corresponding number of cycles per year and the length of the movements are presented in the table below:
**Seasonal frequencies for a monthly time series**
<!-- {: .table .table-style} -->
| **Angular frequency (**$\omega$) | **Frequency (cycles per time unit) (**$f$) | **Cycles per year** | **Length of the movement measured in months (**$\tau$) |
|----------------|--------------------|------------------|--------------------|
| $\frac{\pi}{6}$ | 0.083 | 1 | 12 |
| $\frac{\pi}{3}$ | 0.167 | 2 | 6 |
| $\frac{\pi}{2}$ | 0.250 | 3 | 4 |
| $\frac{2\pi}{3}$ | 0.333 | 4 | 3 |
| $\frac{5\pi}{6}$ | 0.417 | 5 | 2.4 |
| $\pi$ | 0.500 | 6 | 2 |
In JDemetra+ SEATS assigns the roots of the AR full polynomial to the components according to their associated modulus and frequency (For details see MARAVALL, A., CAPORELLO, G., PÉREZ, D., and LÓPEZ, R. (2014))
- Roots of $(1-B)^{d}$ are assigned to trend component.
- Roots of $(1-B^{s})^{d_{s}} = {((1-B)(1+B + \ldots + B^{s-1}))}^{d_{s}}$ are assigned to the trend component (root of ${(1-B)}^{d_{s}}$) and to the seasonal component (roots of ${(1+B + \ldots + B^{s-1})}^{d_{s}}$).
- When the modulus of the inverse of a real positive root of $\varphi(B)$ is greater than $k$ or equal to $k$, where $k$ is the threshold value controlled by the *Trend boundary* parameter, then the root is assigned to the trend component. Otherwise it is assigned to the transitory component.
- Real negative inverse roots of $\text{ϕ}_{p}(B)$ associated with the seasonal two-period cycle are assigned to the seasonal component if their modulus is greater than *k*, where $k$ is the threshold value controlled by the *Seasonal boundary* and the *Seas. boundary (unique)* parameters. Otherwise they are assigned to the transitory component.
- Complex roots, for which the argument (angular frequency) is close enough to the seasonal frequency are assigned to the seasonal component. Closeness is controlled by the *Seasonal tolerance* and *Seasonal tolerance (unique)* parameters. Otherwise they are assigned to the transitory component.
- If $d_{s}$ (seasonal differencing order) is present and $\text{Bphi} < 0$ ($\text{Bphi}$ is the estimate of the seasonal autoregressive parameter), the real positive inverse root is assigned to the trend component and the other ($s-1$) inverse roots are assigned to the seasonal component. When $d_{s} = 0$, the root is assigned to the seasonal when $\text{Bphi} < - 0.2$ and/or the overall test for seasonality indicates presence of seasonality. Otherwise it goes to the transitory component. Also, when $\text{Bphi} > 0$, roots are assigned to the transitory component.
It should be highlighted that when $Q > P$, where $Q$ and $P$ denote the orders of the polynomials $\varphi(B)$ and $\theta(B)$, the SEATS decomposition yields a pure MA $(Q - P)$ component (hence transitory). In this case the transitory component will appear even when there is no AR factor allocated to it.
Once these rules are applied, the factorization of the AR polynomial presented by Equation 2 yields to the identification of the AR polynomials for the components which contain, respectively, the AR roots associated with the trend component, the seasonal component and the transitory component.
The AR roots close to or at the trading day frequency generates a stochastic trading day component. A stochastic trading day component is always modelled as a stationary ARMA(2,2), where the AR part contains the roots close to the TD frequency, and the MA(2) is obtained from the model decomposition (MARAVALL, A., and PÉREZ, D. (2011)). This component, estimated by SEATS, is not implemented by the current version of JDemetra+.
Then with the partial fraction expansion the spectrum of the final components are obtained.
For example, the Airline model for a monthly time series:
Equation 15
$$
(1-B)(1-B^{12})x_{t} = (1+\theta_{1}B)(1+\Theta_{1}B^{12})\ a_{t}
$$
is decomposed by SEATS into the model for the trend component:
Equation 16
$$
(1-B)(1-B)c_{t} =(1+\theta_{c,1}B + \theta_{c,2}B^{2})a_{c,t}
$$
and the model for the seasonal component:
Equation 17
$$
(1+B + \ldots + B^{11})s_{t} = (1+\theta_{s,1}B + \ldots + {\theta_{s,11}B}^{11})a_{s,t},
$$
As a result, the Airline model is decomposed as follows:
Equation 18
$$
\small
\frac{(1+\theta_{1}B)(1+\Theta_{1}B^{12})}{(1-B)(1-B)}a_{t} = \frac{(1+\theta_{s,1}B + \ldots + {\theta_{s,11}B}^{11})}{(1+B + \ldots + B^{11})}a_{s,t} + \frac{(1+\theta_{c,1}B + \theta_{c,2}B^{2})}{(1-B)(1-B)}a_{c,t} + u_{t}
$$
The transitory component is not present in this case and the irregular component is the white noise.
The partial fractions decomposition is performed in a frequency domain. In essence, it consists in portioning of the pseudo-spectrum of $x_{t}$ into additive spectra of the components. When the AMB decomposition of the ARIMA model results in the non-negative spectra for all components, the decomposition is called admissible. In such case an infinite number of admissible decompositions exists, i.e. decompositions that yield the non-negative spectra of all components.
Therefore, the MA polynomials and the innovation variances cannot be yet identified from the model of $x_{t}$. As sketched above, to solve this under identification problem and identify a unique decomposition, it is assumed that for each component the order of the MA polynomial is no greater than the order of the AR polynomial and the canonical solution of S.C. Hillmer and G.C. Tiao is applied, i.e. all additive white noise is added to the irregular component As a consequence all components derived from the canonical decomposition, except from the irregular, have a spectral minimum of zero and are thus non invertible.
Given the stochastic features of the series, it can be shown by that the canonical decomposition produces as stable as possible trend and seasonal components since it maximizes the variance of the irregular and minimizes the variance of the other components. However, there is a price to be paid as canonical components can produce larger revisions in the preliminary estimators of the component than any other admissible decomposition.
The term pseudo-spectrum is used for a non-stationary time series, while the term spectrum is used for a stationary time series.
If the ARIMA model estimated in TRAMO does not accept an admissible decomposition, SEATS replaces it with a decomposable approximation. The modified model is therefore used to decompose the series. There are also other rare situations when the ARIMA model chosen by TRAMO is changed by SEATS. It happens when, for example, the ARIMA models generate unstable seasonality or produce a senseless decomposition. Such examples are discussed by MARAVALL, A. (2009).
HILLMER, S.C., and TIAO, G.C. (1982).
GÓMEZ, V., and MARAVALL, A. (2001a).
HILLMER, S.C., and TIAO, G.C. (1982).
MARAVALL, A. (1986).
The figure below represents the pseudo-spectrum for the canonical trend and an admissible trend.
![**A comparison of canonical trend and admissible trend**](All_images/UG_A_image5.png)
A pseudo-spectrum is denoted by$g_{i}(\omega)$, where $\omega$ represents the angular frequency. The pseudo-spectrum of $x_{\text{it}}$ is defined as the Fourier transform of ACGF of $x_{t}$ which is expressed as:
Equation 19
$$
\frac{\psi_{i}(B)\psi_{i}(F)}{\delta_{i}(B)\delta_{i}(F)}V(a_{i})
$$
where:
- $\psi_{i}(F) = \frac{\theta_{i}(F)}{\phi_{i}(F)}$
- $\psi_{i}(B) = \frac{\theta_{i}(B)}{\phi_{i}(B)}$
A pseudo-spectrum for a monthly time series $x_{t}$ is presented in the figure below: The pseudo-spectrum for a monthly series. The frequency $\omega = 0$ is associated with the trend, frequencies in the range $[0 + \epsilon_{1},\ \frac{\pi}{6} - \epsilon_{2}]$ with $[0 + \epsilon_{1},\ \frac{\pi}{6} - \epsilon_{2}]$ $\epsilon_{1},\ \epsilon_{2} > 0$ and $\epsilon_{1} < \ \frac{\pi}{6} - \epsilon_{2}$ are usually associated with the business-cycle and correspond to a period longer than a year and bounded.
The frequencies in the range $[\frac{\pi}{6},\pi]$ are associated with the short term movements, whose cycle is completed in less than a year. If a series contains an important periodic component, its spectrum reveals a peak around the corresponding frequency and in the ARIMA model it is captured by an AR root. In the example below spectral peaks occur at the frequency $\omega = 0$ and at the seasonal frequencies ($\frac{\pi}{6}$, $\frac{2\pi}{6},\ \frac{3\pi}{6},\ \frac{4\pi}{6},\frac{5\pi}{6},\pi$).
![**The pseudo-spectrum for a monthly series**](All_images/UG_A_image6.png)
In the decomposition procedure, the pseudo-spectrum of the time series $x_{t}$ is divided into the spectra of its components (in the example figure below, four components were obtained).
![**The pseudo-spectra for the components**](All_images/UG_A_image7.png)
### Estimation of the components
The various components are estimated using Wiener-Kolmogorow (WK) filters. JDemetra+ includes three options to estimate the WK filter, namely *Burman*, *KalmanSmoother* and *MCElroyMatrix*[^m-seats-decomposition_old-12]. Here the first of above mentioned options, proposed by BURMAN, J.P. (1980) will be explained.
The estimation procedure and the properties of the WK filter are easier to explain with a two-component model. Let the seasonally adjusted series ($s_{t}$) be the signal of interest and the seasonal component ($n_{t}$) be the remainder, "the noise". The series is given by the model in Equation 2 and from Equation 4 the models for theoretical components are:
Equation 20
$$
\varphi_{s}(B)s_{t} = \theta_{s}(B)a_{\text{st}}
$$
and
Equation 21
$$
\varphi_{n}(B)n_{t} = \theta_{n}(B)a_{\text{nt}}
$$
From Equation 6 and Equation 7 it is clear that $\varphi(B) = \varphi_{s}(B)\varphi_{n}(B)$ and $\theta(B)a_{t} = \theta_{s}(B)a_{\text{st}}+\theta_{n}(B)a_{\text{nt}}$.
As the time series components are never observed, their estimators have to be used. Let us note $X_{T}$ an infinite realization of the time series $x_{t}$. SEATS computes the Minimum Mean Square Error (MMSE) estimator of $s_{t}$, e.g. the estimator $\widehat{s}_{t}$ that minimizes $E\lbrack({s_{t}-{\widehat{s}}_{t})}^{2}|X_{T})\rbrack$. Under the normality assumption ${\widehat{s}}_{t|T}$ is also equal to the conditional expectation $E(s_{t}|X_{T}\right)$, so it can be presented as a linear function of the elements in $X_{T}$.WHITTLE (1963) shows that the MMSE estimator of ${\widehat{s}}_{t}$ is:
Equation 22
$$
{\widehat{s}}_{t} = k_{s}\frac{\psi_{s}(B)\psi_{s}(F)}{\psi(B)\psi(F)}x_{t}
$$
where
- $\psi(B)= \frac{\theta(B)}{\phi(B)}$,
- $k_{s}=\frac{V(a_{s})}{V(a)}$,
$V(a_{s})$ is the variance of $a_{st}$ and $V(a)$ is the variance of $a_{t}$.
Expressing the $\psi(B\right)$ polynomials as functions of the AR and MA polynomials, after cancellation of roots, the estimator of $s_{t}$ can be expressed as:
Equation 23
$$
\widehat{s}_{t} =
k_{s}\frac{\theta_{s}(B)\theta_{s}(F)\varphi_{n}(B)\delta_{n}(B)\varphi_{n}(F)\delta_{n}(F)}
{\theta(B)\theta(F)}x_{t}
$$
where:
Equation 24
$$
\nu_{s}(B,F) = k_{s}\frac{\theta_{s}(B)\theta_{s}(F)\varphi_{n}(B)\delta_{n}(B)\varphi_{n}(F)\delta_{n}(F)}{\theta(B)\theta(F)}
$$
is a WK filter.
Equation 24 shows that the WK filter is two-sided (uses observations both from the past and from the future), centred (the number of points in the past is the same as in the future) and symmetric (for any $k$ the weight applied to $x_{t-k}$ and $x_{t+k}$ is the same), which allows the phase effect to be avoided. Due to invertibility of $\theta(B)$ (and $\theta(F)$) the filter is convergent in the past and in the future.
The estimator can be presented as
Equation 25
$$
\widehat{s}_{t} = \nu_{i}(B,F)x_{t}
$$
where $\nu_{i}(B,F)=\nu_{0}+ \sum_{j = 1}^{\infty}\nu_{ij}(B^{j}+F^{j})$ is the WK filter.
The example of the WK filters obtained for the pseudo-spectra of the series illustrated above is shown on the figure below: WK filters for components.
![**WK filters for components**](All_images/UG_A_image8.png)
The WK filter from Equation 24 can also be expressed as a ratio of two pseudo-autocovariance generating functions (p-ACGF). The p-ACGF function summarizes the sequence of absolutely summable autocovariances of a stationary process $x_{t}$ (see [Spectral Analysis](#m-spectrum)).
The ACGF function of an ARIMA process is expressed as:
Equation 26
$$
acgf(B) = \frac{\theta(B)\theta(F)}{\phi(B)\delta(B)\phi(F)\delta(F)}V(a)
$$
And, the WK filter can be rewritten as:
Equation 27
$$
\nu_{s}(B,F) = \frac{\gamma_{s}(B,F)}{\gamma(B,F)}
$$
where:
- $\gamma_{s}(B,F) = \frac{\theta_{s}(B)\theta_{s}(F)}{\phi_{s}(B)\delta_{s}(B)\phi_{s}(F)\delta_{s}(F)}V(a_{s})$ is the p-ACGF of $s_{t}$;
- $\gamma(B,F) = \frac{\theta(B)\theta(F)}{\phi(B)\delta(B)\phi(F)\delta(F)}V(a)$ is the p-ACGF of $x_{t}$.
From Equation 24 it can be seen that the WK filter depends on both the component and the series models. Consequently, the estimator of the component and the WK filter reflect the characteristic of data and by construction, the WK filter adapts itself to the series under consideration. Therefore, the ARIMA model is of particular importance for the SEATS method. Its misspecification results in an incorrect decomposition.
This adaptability, if the model has been correctly determined, avoids the dangers of under and overestimation with an ad-hoc filtering. For example, for the series with a highly stochastic seasonal component the filter adapts to the width of the seasonal peaks and the seasonally adjusted series does not display any spurious seasonality. Examples of WK filters for stochastic and stable seasonal components are presented on the figure below. (MARAVALL, A. (1995)).
![**WK filters for stable and stochastic seasonal components**](All_images/UG_A_image9.png)
The derivation of the components requires an infinite realization of $x_{t}$ in the direction of the past and of the future. However, the convergence of the WK filter guarantees that, in practice, it could be approximated by a truncated (finite) filter and, in most applications, for large $k$ the estimator for the central periods of the series can be safely seen as generated by the WK filter (MARAVALL, A., and PLANAS, C. (1999)).
Equation 28
$$
\widehat{s}_{t}=\nu_{k}x_{t-k} + \ldots + \nu_{0}x_{t} + \ldots + \nu_{k}x_{t+k}
$$
When $T > 2L + 1$, where $T$ is the last observed period, and $L$ is an a priori number that typically expands between 3 and 5 years, the estimator expressed by Equation 23 can be assumed as the final (historical) estimator for the central observations of the series[^m-seats-decomposition_old-16]. In practice, the Wiener-Kolmogorov filter is applied to $x_{t}$ extended with forecasts and backcasts from the ARIMA model. The final or historical estimator of ${\widehat{s}}_{t}$, is obtained with a doubly infinite filter, and therefore contains an error $e_{st}$ called final estimation error, which is equal $e_{st}=s_{t}-sociated with the regular and the seasonal differences have roots in $\varphi(B)$ with modulus $r = 1$. They are called non-stationary roots and can be represented on the unit circle. Let us consider the seasonal differencing operator applied to a quarterly time series $(1-B^{4})$. Its characteristic equation is ${(z}^{4}-1) = 0$ with solutions given by$z = \sqrt[4]{1}$, i.e. $z_{1,2} = \pm 1$ and $z_{3,4} = \pm i1$. The first two solutions are real and the last two are complex conjugates. They are represented by the black points on the unit circle on the figure below. ![**Unit roots on the unit circle**](All_images/UG_A_image4.png) For the seasonal differencing operator $(1-B^{12})$ applied to the monthly time series the characteristic equation ${(z}^{12}-1) = 0$ has twelve non-stationary solutions given by $z = \sqrt[12]{1}:$ two real and ten complex conjugates, represented by the white circles in unit roots figure above. The complex conjugates roots generate the periodic movements of the type: Equation 14 $$ z_{t} = A^{t}\cos(\omega t + W) $$ where: - $A$ -- amplitude; - $\omega$ -- angular frequency (in radians); - $W$ -- phase (angle at $t = 0)$. The frequency $f$, i.e. the number of cycles per unit time, is $\frac{\omega}{2\pi}$. If it is multiplied by *s*, the number of observations per year, the number of cycles completed in one year is derived. The period of function in Equation 14, denoted by $\tau$, is the number of units of time (months/quarters) it takes for a full circle to be completed. For quarterly series the seasonal movements are produced by complex conjugates roots with angular frequencies at $\frac{\pi}{2}$ (one cycle per year) and $\pi$ (two cycles per year). The corresponding number of cycles per year and the length of the movements are presented in the table below. **Seasonal frequencies for a quarterly time series** | **Angular frequency (**$\omega$) | **Frequency (cycles per unit time) (**$f$) | **Cycles per year** | **Length of the movement measured in quarters (**$\tau$) | |------------------|------------------|------------------|--------------------| | $\frac{\pi}{2}$ | 0.25 | 1 | 4 | | $\pi$ | 0.5 | 2 | 2 | For monthly time series the seasonal movements are produced by complex conjugates roots at the angular frequencies: $\frac{\pi}{6},\frac{\pi}{3}, \frac{\pi}{2}, \frac{2\pi}{3}, \frac{5\pi}{6}$ and $\pi$. The corresponding number of cycles per year and the length of the movements are presented in the table below: **Seasonal frequencies for a monthly time series** <!-- {: .table .table-style} --> | **Angular frequency (**$\omega$) | **Frequency (cycles per time unit) (**$f$) | **Cycles per year** | **Length of the movement measured in months (**$\tau$) | |----------------|--------------------|------------------|--------------------| | $\frac{\pi}{6}$ | 0.083 | 1 | 12 | | $\frac{\pi}{3}$ | 0.167 | 2 | 6 | | $\frac{\pi}{2}$ | 0.250 | 3 | 4 | | $\frac{2\pi}{3}$ | 0.333 | 4 | 3 | | $\frac{5\pi}{6}$ | 0.417 | 5 | 2.4 | | $\pi$ | 0.500 | 6 | 2 | In JDemetra+ SEATS assigns the roots of the AR full polynomial to the components according to their associated modulus and frequency (For details see MARAVALL, A., CAPORELLO, G., PÉREZ, D., and LÓPEZ, R. (2014)) - Roots of $(1-B)^{d}$ are assigned to trend component. - Roots of $(1-B^{s})^{d_{s}} = {((1-B)(1+B + \ldots + B^{s-1}))}^{d_{s}}$ are assigned to the trend component (root of ${(1-B)}^{d_{s}}$) and to the seasonal component (roots of ${(1+B + \ldots + B^{s-1})}^{d_{s}}$). - When the modulus of the inverse of a real positive root of $\varphi(B)$ is greater than $k$ or equal to $k$, where $k$ is the threshold value controlled by the *Trend boundary* parameter, then the root is assigned to the trend component. Otherwise it is assigned to the transitory component. - Real negative inverse roots of $\text{ϕ}_{p}(B)$ associated with the seasonal two-period cycle are assigned to the seasonal component if their modulus is greater than *k*, where $k$ is the threshold value controlled by the *Seasonal boundary* and the *Seas. boundary (unique)* parameters. Otherwise they are assigned to the transitory component. - Complex roots, for which the argument (angular frequency) is close enough to the seasonal frequency are assigned to the seasonal component. Closeness is controlled by the *Seasonal tolerance* and *Seasonal tolerance (unique)* parameters. Otherwise they are assigned to the transitory component. - If $d_{s}$ (seasonal differencing order) is present and $\text{Bphi} < 0$ ($\text{Bphi}$ is the estimate of the seasonal autoregressive parameter), the real positive inverse root is assigned to the trend component and the other ($s-1$) inverse roots are assigned to the seasonal component. When $d_{s} = 0$, the root is assigned to the seasonal when $\text{Bphi} < - 0.2$ and/or the overall test for seasonality indicates presence of seasonality. Otherwise it goes to the transitory component. Also, when $\text{Bphi} > 0$, roots are assigned to the transitory component. It should be highlighted that when $Q > P$, where $Q$ and $P$ denote the orders of the polynomials $\varphi(B)$ and $\theta(B)$, the SEATS decomposition yields a pure MA $(Q - P)$ component (hence transitory). In this case the transitory component will appear even when there is no AR factor allocated to it. Once these rules are applied, the factorization of the AR polynomial presented by Equation 2 yields to the identification of the AR polynomials for the components which contain, respectively, the AR roots associated with the trend component, the seasonal component and the transitory component. The AR roots close to or at the trading day frequency generates a stochastic trading day component. A stochastic trading day component is always modelled as a stationary ARMA(2,2), where the AR part contains the roots close to the TD frequency, and the MA(2) is obtained from the model decomposition (MARAVALL, A., and PÉREZ, D. (2011)). This component, estimated by SEATS, is not implemented by the current version of JDemetra+. Then with the partial fraction expansion the spectrum of the final components are obtained. For example, the Airline model for a monthly time series: Equation 15 $$ (1-B)(1-B^{12})x_{t} = (1+\theta_{1}B)(1+\Theta_{1}B^{12})\ a_{t} $$ is decomposed by SEATS into the model for the trend component: Equation 16 $$ (1-B)(1-B)c_{t} =(1+\theta_{c,1}B + \theta_{c,2}B^{2})a_{c,t} $$ and the model for the seasonal component: Equation 17 $$ (1+B + \ldots + B^{11})s_{t} = (1+\theta_{s,1}B + \ldots + {\theta_{s,11}B}^{11})a_{s,t}, $$ As a result, the Airline model is decomposed as follows: Equation 18 $$ \small \frac{(1+\theta_{1}B)(1+\Theta_{1}B^{12})}{(1-B)(1-B)}a_{t} = \frac{(1+\theta_{s,1}B + \ldots + {\theta_{s,11}B}^{11})}{(1+B + \ldots + B^{11})}a_{s,t} + \frac{(1+\theta_{c,1}B + \theta_{c,2}B^{2})}{(1-B)(1-B)}a_{c,t} + u_{t} $$ The transitory component is not present in this case and the irregular component is the white noise. The partial fractions decomposition is performed in a frequency domain. In essence, it consists in portioning of the pseudo-spectrum of $x_{t}$ into additive spectra of the components. When the AMB decomposition of the ARIMA model results in the non-negative spectra for all components, the decomposition is called admissible. In such case an infinite number of admissible decompositions exists, i.e. decompositions that yield the non-negative spectra of all components. Therefore, the MA polynomials and the innovation variances cannot be yet identified from the model of $x_{t}$. As sketched above, to solve this under identification problem and identify a unique decomposition, it is assumed that for each component the order of the MA polynomial is no greater than the order of the AR polynomial and the canonical solution of S.C. Hillmer and G.C. Tiao is applied, i.e. all additive white noise is added to the irregular component As a consequence all components derived from the canonical decomposition, except from the irregular, have a spectral minimum of zero and are thus non invertible. Given the stochastic features of the series, it can be shown by that the canonical decomposition produces as stable as possible trend and seasonal components since it maximizes the variance of the irregular and minimizes the variance of the other components. However, there is a price to be paid as canonical components can produce larger revisions in the preliminary estimators of the component than any other admissible decomposition. The term pseudo-spectrum is used for a non-stationary time series, while the term spectrum is used for a stationary time series. If the ARIMA model estimated in TRAMO does not accept an admissible decomposition, SEATS replaces it with a decomposable approximation. The modified model is therefore used to decompose the series. There are also other rare situations when the ARIMA model chosen by TRAMO is changed by SEATS. It happens when, for example, the ARIMA models generate unstable seasonality or produce a senseless decomposition. Such examples are discussed by MARAVALL, A. (2009). HILLMER, S.C., and TIAO, G.C. (1982). GÓMEZ, V., and MARAVALL, A. (2001a). HILLMER, S.C., and TIAO, G.C. (1982). MARAVALL, A. (1986). The figure below represents the pseudo-spectrum for the canonical trend and an admissible trend. ![**A comparison of canonical trend and admissible trend**](All_images/UG_A_image5.png) A pseudo-spectrum is denoted by$g_{i}(\omega)$, where $\omega$ represents the angular frequency. The pseudo-spectrum of $x_{\text{it}}$ is defined as the Fourier transform of ACGF of $x_{t}$ which is expressed as: Equation 19 $$ \frac{\psi_{i}(B)\psi_{i}(F)}{\delta_{i}(B)\delta_{i}(F)}V(a_{i}) $$ where: - $\psi_{i}(F) = \frac{\theta_{i}(F)}{\phi_{i}(F)}$ - $\psi_{i}(B) = \frac{\theta_{i}(B)}{\phi_{i}(B)}$ A pseudo-spectrum for a monthly time series $x_{t}$ is presented in the figure below: The pseudo-spectrum for a monthly series. The frequency $\omega = 0$ is associated with the trend, frequencies in the range $[0 + \epsilon_{1},\ \frac{\pi}{6} - \epsilon_{2}]$ with $[0 + \epsilon_{1},\ \frac{\pi}{6} - \epsilon_{2}]$ $\epsilon_{1},\ \epsilon_{2} > 0$ and $\epsilon_{1} < \ \frac{\pi}{6} - \epsilon_{2}$ are usually associated with the business-cycle and correspond to a period longer than a year and bounded. The frequencies in the range $[\frac{\pi}{6},\pi]$ are associated with the short term movements, whose cycle is completed in less than a year. If a series contains an important periodic component, its spectrum reveals a peak around the corresponding frequency and in the ARIMA model it is captured by an AR root. In the example below spectral peaks occur at the frequency $\omega = 0$ and at the seasonal frequencies ($\frac{\pi}{6}$, $\frac{2\pi}{6},\ \frac{3\pi}{6},\ \frac{4\pi}{6},\frac{5\pi}{6},\pi$). ![**The pseudo-spectrum for a monthly series**](All_images/UG_A_image6.png) In the decomposition procedure, the pseudo-spectrum of the time series $x_{t}$ is divided into the spectra of its components (in the example figure below, four components were obtained). ![**_{t}$.
[^m-seats-decomposition_old-16]: MARAVALL, A. (1998).
In the frequency domain, the Wiener-Kolmogorov filter$\ \nu(B,F)$ that provides the final estimator of $s_{t}$ is expressed as the ratio of the $s_{t}$ and $x_{t}$ pseudo-spectra:
$$
\widetilde{\nu}(\omega) = \frac{g_{s}(\omega)}{g_{x}(\omega)}
$$ {#eq-29} <!---\[7.53\]-->
The function $\widetilde{\nu}(\omega)$ is also referred as the gain of the filter.[^m-seats-decomposition_old-17] GÓMEZ, V., and MARAVALL, A. (2001a) show that when for some frequency the signal (the seasonally adjusted series) dominates the noise (seasonal fluctuations) the gain $\widetilde{\nu}(\omega)$ approaches 1. On the contrary, when for some frequency the noise dominates the gain $\widetilde{\nu}(\omega)$ approaches 0.
[^m-seats-decomposition_old-17]: GÓMEZ, V., and MARAVALL, A. (2001a).
The spectrum of the estimator of the seasonal component is expressed as:
$$
g_{\widehat{s}}(\omega) = \lbrack \frac{g_{s}(\omega)}{g_{x}(\omega)}\rbrack^{2}g_{x}(\omega)
$$ {#eq-30} <!---\[7.54\]-->
where
- $\ \lbrack \widetilde{\nu}(\omega)\rbrack^{2} = \lbrack \frac{g_{s}(\omega)}{g_{x}(\omega)}\rbrack^{2} = \lbrack \frac{g_{s}(\omega)}{g_{s}(\omega) + g_{n}(\omega)}\rbrack^{2} = \lbrack \frac{1}{1+\frac{1}{r(\omega)}}\rbrack^{2}$ is the squared gain of the filter ;
- $r(\omega) = \frac{g_{s}(\omega)}{g_{n}(\omega)}$ represents the signal-to-noise ratio.
For each $\omega$, the MMSE estimation gives the signal-to-noise ratio. If this ratio is high, then the contribution of that frequency to the estimation of the signal will be also high. Assume that the trend is a signal that needs to be extracted from a seasonal time series. Then $R(0) = 1$ and the frequency $\omega = 0$ will only be used for trend estimations. For seasonal frequencies $R(\omega) = 0$, so that these frequencies are ignored in computing the trend resulting in spectral zeros in $g_{\widehat{s}}(\omega)$. For this reason, unlike the spectrum of the component, the component spectrum contains dips as it can be seen on the figure below: Component spectrum and estimator spectrum for trend.
![**Component spectrum and estimator spectrum for trend**](All_images/UG_A_image10.png)
From the equation @eq-29 <!---\[7.53\]--> it is clear that the squared gain of the filter determines how the variance of the series contributes to the variance of the seasonal component for the different frequencies. When $\widetilde{\nu}(\omega) = 1$, the full variation of $x_{t}$ for that frequency is passed to ${\widehat{s}}_{t}$, while if $\widetilde{\nu}(\omega\right) = 0$ the variation of $x_{t}$ for that frequency is fully ignored in the computation of ${\widehat{s}}_{t}$. These two cases are well illustrated by the figure below that shows the square gain of the WK filter for two series already analysed in the figure above (Figure: WK filters for stable and stochastic seasonal components).
![**The squared gain of the WK filter for stable and stochastic seasonal components.**](All_images/UG_A_image11.png)
Since $r(\omega) \geq 0$, then $\widetilde{\nu}(\omega) \leq 1$ and from @eq-29 <!---\[7.53\]--> it can be derived that $g_{\widehat{s}}(\omega) = \widetilde{\nu}(\omega)g_{s}(\omega)$. As a result, the estimator will always underestimate the component, i.e. it will be always more stable that the component.[^m-seats-decomposition_old-18]
[^m-seats-decomposition_old-18]: Ibid.
Since $g_{\widehat{n}}(\omega) < g_{n}(\omega)$ and$\ g_{\widehat{s}}(\omega) < g_{s}(\omega)$ the expression: $g_{x}(\omega) - \lbrack g_{\widehat{n}}(\omega) + g_{\widehat{s}}(\omega)\rbrack \geq 0$ is the cross-spectrum. As it is positive, the MMSE yields correlated estimators. This effect emerges since variance of estimator is smaller than the variance of component. Nevertheless, if at least one non-stationary component exists, cross-correlations estimated by TRAMO-SEATS will tend to zero as cross-covariances between estimators of the components are finite. In practice, the inconvenience caused by this property will likely be of little relevance.
**Preliminary estimators for the components**
GÓMEZ, V., and MARAVALL, A. (2001a) point out that *the properties of the estimators have been derived for the final (or historical) estimators. For a finite (long enough) realization, they can be assumed to characterize the estimators for the central observations of the series, but for periods close to the beginning of the end the filter cannot be completed and some preliminary estimator has to be used*. Indeed, the historical estimator shown in @eq-28<!---\[7.52\]--> is obtained for the central periods of the series. However, when $t$ approaches $T$ (last observation), the WK filter requires observations, which are not available yet. For this reason a preliminary estimator needs to be used.
To introduce preliminary estimators let us consider a semi-finite realization $\lbrack x_{- \infty}$,...$\ x_{T}$\], where $T$ is the last observed period. The preliminary estimator of $x_{\text{it}}$ obtained at $T$, $(T - t = k \geq 0)$ can be expressed as
$$
{\widehat{x}}_{it|t + k}=\nu_{i}(B,F\right)x_{t|T}^{e}
$$ {#eq-31} <!---\[7.55\]-->
where
- $\nu_{i}(B,F)$ is the WK filter ;
- $x_{t|T}^{e}$ is the extended series, such that $x_{t|T}^{e} = x_{t}$ for $t \leq T$ and $x_{t|T}^{e}={\widehat{x}}_{t|T}$ for $t>T$, where ${\widehat{x}}_{t|T}$ denotes the forecast of $x_{t}$ obtained at period $T$.
The future $k$ values necessary to apply the filter are not yet available and are replaced by their optimal forecasts from the ARIMA model on $x_{t}$. When $k=0$ the preliminary estimator becomes the concurrent estimator. As the forecasts are linear functions of present and past observations of $x_{t}$, the preliminary estimator ${\widehat{x}}_{it}$ will be a truncated asymmetric filter applied to $x_{t}$ that generates a phase effect[^m-seats-decomposition_old-19].
[^m-seats-decomposition_old-19]: KAISER, R., and MARAVALL, A. (2000).
When a new observation $x_{T + 1}$ becomes available the forecast ${\widehat{x}}_{T + 1|T}$ is replaced by the observation and the forecast ${\widehat{x}}_{iT + j|T}$, $j > 1$ are updated to $x_{T + j|T + 1}$ resulting in the revision error[^m-seats-decomposition_old-20]. The total error in the preliminary estimator $d_{it|t + k}$ is expressed as a sum of the final estimation error ($e_{it}$) and the revision error ($r_{it|t + k}$), i.e.:
[^m-seats-decomposition_old-20]: MARAVALL, A. (1995).
$$
d_{it|t + k} = x_{it}-{\widehat{x}}_{it|t + k} = (x_{it} - {\widehat{x}}_{it}\right) + ( {\widehat{x}}_{it} - {\widehat{x}}_{it|t + k}) = e_{it} + r_{it|t + k}
$$ {#eq-32} <!---\[7.56\]-->
where:
- $x_{it}-i^{th}$ component;
- ${\widehat{x}}_{it|t + k}$- the estimator of $x_{it}$ when the last observation is $x_{t + k}$.
Therefore the preliminary estimator is subject not only to the final error but also to a revision error, which are orthogonal to each other[^m-seats-decomposition_old-21]. The revision error decreases as $k$ increases, until it can be assumed equal to 0 for large enough $k$.
[^m-seats-decomposition_old-21]: MARAVALL, A. (2009).
It's worth remembering that SEATS estimates the unobservable components of the time series so the "true" components are never observed. Therefore, MARAVALL, A. (2009) stresses that *the error in the historical estimator is more of academic rather than practical interest. In practice, interest centres on revisions. (...) the revision standard deviation will be an indicator of how far we can expect to be from the optimal estimator that will be eventually attained, and the speed of convergence of* ${\theta(B)\ }^{- 1}$ *will dictate the speed of convergence of the preliminary estimator to the historical one.* The analysis of an error is therefore useful for making decision concerning the revision policy, including the policy for revisions and horizon of revisions.
### PsiE-weights
The estimator of the component is calculated as ${\widehat{x}}_{it} = \nu_{s}(B,F\right)x_{t}$. By replacing $x_{it}=\frac{\theta(B)}{\gamma(B)\delta(B)}a_{t}$, the component estimator can be expressed as[^m-seats-decomposition_old-22]:
[^m-seats-decomposition_old-22]: The section is based on KAISER, R., and MARAVALL, A. (2000).
$$
{\widehat{x}}_{it} = \xi_{s}(B,F\right)a_{t}
$$ {#eq-33} <!---\[7.57\]-->
where $\xi_{s}(B,F) = \ldots + \xi_{j}B^{j} + \ldots + \xi_{1}B + \xi_{0} + \xi_{- 1}F\ldots\xi_{- j}F^{j} + \ldots$.
This representation shows the estimator as a filter applied to the innovation $a_{t}$, rather than on the series $x_{t}$[^m-seats-decomposition_old-23]. Hence, the filter from @eq-32 <!---\[7.56\]--> can be divided into two components: the first one, i.e. $\ldots + \xi_{j}B^{j}+ \ldots+ \xi_{1}B + \xi_{0}$, applies to prior and concurrent innovations, the second one, i.e. $\xi_{- 1}F + \ldots + \xi_{- j}F^{j}$ applies to future (i.e. posterior to $t$) innovations. Consequently, $\xi_{j}$ determines the contribution of $a_{t - j}$ to ${\widehat{s}}_{t}$ while $\xi_{- j}$ determines the contribution of $a_{t + j}$ to ${\widehat{s}}_{t}$. Finally, the estimator of the component can be expressed as:
[^m-seats-decomposition_old-23]: See section PsiE-weights. For further details see MARAVALL, A. (2008).
$$
{\widehat{x}}_{it} =\xi_{i}(B)^{-}a_{t} + \xi_{i}(F)^{+}a_{t + 1}
$$ {#eq-34} <!---\[7.58\]-->
where:
- $\xi_{i}{(B)}^{-}a_{t}$ is an effect of starting conditions, present and past innovations in series;
- $\xi_{i}{(F)}^{+}a_{t + 1}$ is an effect of future innovations.
For the two cases already presented in figure *WK filters for stable and stochastic seasonal components* and figure *The squared gain of the WK filter for stable and stochastic seasonal components* above, the psi-weights are shown in the figure below.
![**WK filters and squared gain of the WK filter**](All_images/UG_A_image12.png)
It can be shown that ${\xi}_{- 1},\ldots,\xi_{- j}$ are convergent and $\xi_{j},\ldots, {\xi}_{1},\xi_{0}$ are divergent. From @eq-33 <!---\[7.57\]-->, the concurrent estimator is equal to
$$
{\widehat{x}}_{it|t} = E_{t}x_{it}=E_{t}{\widehat{x}}_{it} = {\xi}_{i}(B)^{-}a_{t}
$$ {#eq-35} <!---\[7.59\]-->
so that the revision
$$
r_{it} = {\widehat{x}}_{it} - {\widehat{x}}_{it|t} = \xi_{i}(F)^{+}a_{t + 1}
$$ {#eq-36} <!---\[7.60\]-->
is a zero-mean stationary MA process. As a result, historical and preliminary estimators are cointegrated. From expression @eq-25 <!---\[7.49\]--> the relative size of the full revision and the speed of convergence can be obtained.