Hareketli ortalama modeli hata terimleri

18

Bu Box-Jenkins MA modellerinde temel bir sorudur. Anladığım kadarıyla, bir MA modeli temelde zaman serisi değerleri $Y$ önceki hata terimleri karşı doğrusal bir regresyonudur $e_t,..., e_{t-n}$ . Yani, gözlemi $Y$ ilk olarak önceki değerlerine karşı gerilemektedir $Y_{t-1}, ..., Y_{t-n}$ ve daha sonra bir veya daha fazla $Y - \hat{Y}$ değerleri, MA modeli için hata terimi olarak kullanılır.

Ancak ARIMA (0, 0, 2) modelinde hata terimleri nasıl hesaplanır? MA modeli otoregresif bir parça olmadan kullanılır ve bu nedenle tahmini bir değer yoksa, muhtemelen bir hata terimi alabilir miyim?

— Robert Kubrick
kaynak

1

Hayır, sanırım MA (n) modelinin tanımını karıştırıyorsunuz, burada regresyon sadece

açısından , tahminiyle

e_{t - i}

$e_{t-i}$

e_{t - i}

$e_{t-i}$ verilerden tahmin edildiği tahminiyle karıştırıyorsunuz .

— Xi'an

1

Sorunuzdaki temel sorun, MA modelinin temelde doğrusal bir regresyon olduğunu söylemenizdir. Hata terimlerini gözlemlemediğimizden bu doğru değildir.

— mpiktas

Ben hata terimi düşünüyorum olduğunu aslında

nerede,

ise

veya basitçe

. Bu nedenle, bir MA modeli parametre tahmini,

kısmi otokorelasyon fonksiyonunda tekrar eden bir kalıptan , yani artıkların davranışından türetilir . Bunun yerine AR parametresi tahmini, acf (Y) 'nin yinelenen bir modelini temel alır.

Y_{t} - \hat{Y_{t}}

$Y_t - \hat{Y_t}$

\hat{Y}

$\hat{Y}$

E (Y | Y_{t, . . ., t - n})

$E(Y|Y_{t,...,t-n})$

Y_{t} - Y_{t - 1}

$Y_t - Y_{t-1}$

Y

$Y$

— Robert Kubrick

20

MA Model Tahmini:

Şimdi 100 zaman noktası olan bir seri alalım ve bunun kesişmesiz MA (1) modeli ile karakterize edildiğini varsayalım. Sonra model tarafından verilir

y_{t} = ε_{t} - θ ε_{t - 1}, t = 1, 2, \dots, 100 (1)

$y_t=\varepsilon_t-\theta\varepsilon_{t-1},\quad t=1,2,\cdots,100\quad (1)$

Buradaki hata terimi gözlenmez. Bunu elde etmek için Box ve ark. Zaman Serisi Analizi: Tahmin ve Kontrol (3. Baskı) , sayfa 228 , hata teriminin,

ε_{t} = y_{t} + θ ε_{t - 1}

$\varepsilon_t=y_t+\theta\varepsilon_{t-1}$

Yani için hata terimi , Şimdi değerini bilmeden bunu hesaplayamayız . Bu nedenle, bunu elde etmek için, modelin İlk veya Ön tahminini hesaplamamız gerekir, Box ve ark. söz konusu kitabın, Bölüm 6.3.2, sayfa 202 , $t=1$

ε_{1} = y_{1} + θ ε_{0}

$\varepsilon_{1}=y_{1}+\theta\varepsilon_{0}$

θ

$\theta$

MA ( ) işleminin ilk otokorelasyonunun sıfır olmadığı ve model parametreleri açısından olarak yazılabileceği gösterilmiştir. $q$ $q$ yukarıda ifade açısından , malzeme denklemler bilinmeyen. Ön tahminleri s tahminleri değiştirilmesi ile elde edilebilir için yukarıdaki denklemde
$ρ_{k} = \frac{- θ_{k} + θ_{1} θ_{k + 1} + θ_{2} θ_{k + 2} + \dots + θ_{q - k} θ_{q}}{1 + θ_{1}^{2} + θ_{2}^{2} + \dots + θ_{q}^{2}} k = 1, 2, \dots, q$ $\rho_k=\displaystyle\frac{-\theta_{k}+\theta_1\theta_{k+1}+\theta_2\theta_{k+2}+\cdots+\theta_{q-k}\theta_q}{1+\theta_1^2+\theta_2^2+\cdots+\theta_q^2}\quad k=1,2,\cdots, q$ $\rho_1,\rho_2\cdots,\rho_q$ $\theta_1,\theta_2,\cdots,\theta_q$ $q$ $q$ $\theta$ $r_k$ $\rho_k$

$r_k$ $\theta=0.5$

ε_{1} = y_{1} + 0.5 ε_{0}

$\varepsilon_{1}=y_{1}+0.5\varepsilon_{0}$

ε_{0}

$\varepsilon_0$

t

$t$

ε_{1}

$\varepsilon_1$

Koşullu Olasılık
Koşulsuz Olasılık

Box ve ark. Bölüm 7.1.3 sayfa 227 , değerleri $\varepsilon_0$ olarak yaklaşık olarak sıfır ile değiştirilebilir $n$ orta veya büyükse, bu yöntem Koşullu Olasılıktır. Aksi takdirde, Koşulsuz Olasılık kullanılır; $\varepsilon_0$ "Havalandırma" terimi, geri tahmin yoluyla elde edilir, Box vd. bu yöntemi önerin. Geri tahmin hakkında daha fazla bilgiyi Bölüm 7.1.4 sayfa 231'de bulabilirsiniz .

İlk tahminleri ve değerini aldıktan sonra $\varepsilon_0$ , son olarak, hata teriminin özyinelemeli hesaplamasına devam edebiliriz. Sonra son aşama modelin parametresini tahmin etmektir $(1)$ , bunun artık ön tahmin olmadığını unutmayın.

Parametreyi tahmin ederken $\theta$ , I use Nonlinear Estimation procedure, particularly the Levenberg-Marquardt algorithm, since MA models are nonlinear on its parameter.

Overall, I would highly recommend you to read Box et al. Time Series Analysis: Forecasting and Control (3rd Edition).

— Al-Ahmadgaid Asaad
kaynak

Can you explain what is

r_{k}

$r_k$ ?

— Piyush Divyanakar

4

A Gaussian MA(q) model is defined (not only by Box and Jenkins!) as

Y_{t} = - \sum_{i = 1}^{q} ϑ_{i} e_{t - i} + σ e_{t}, e_{t} \overset{iid}{\sim} N (0, 1)

$Y_t = -\sum_{i=1}^q \vartheta_i e_{t-i} + \sigma e_t,\quad e_t\stackrel{\text{iid}}{\sim} \mathcal{N}(0,1)$ so the MA(q) model is a "pure" error model, the degree

q

$q$ defining how far the correlation goes back.

— Xi'an
kaynak

1

I'm still not clear on where

e_{t}

$e_t$ comes from. Is

e_{t}

$e_t$ a random variable? I don't think so, otherwise why to bother looking for

q

$q$ correlations?

— Robert Kubrick

1

Why is there a minus in your formula? Usually the minus is for AR models. Mathematically is not an issue, I'm just curious, since I've never seen minus in MA models.

— mpiktas

3

@RobertKubrick, are you aware of Wold decomposition theorem? Each stationary process has its corresponding innovation process, that is from where terms

e_{t}

$e_t$ come.

— mpiktas

1

@mpiktas Thanks, that gives some background on the error term, but I am still not clear on where the innovation process comes from, for an innovation to exist there's got to be a forecast somewhere (en.wikipedia.org/wiki/Innovation_(signal_processing)). Is the optimal

Y

$Y$ forecast simply

E (Y)

$E(Y)$ , that is the mean of the series?

— Robert Kubrick

1

You say "the observation $Y$ is first regressed against its previous values $Y_{t−1},...,Y_{t−n}$ and then one or more $Y−\hat{Y}$ values are used as the error terms for the MA model." What I say is that $Y$ is regressed against two predictor series $e_{t-1}$ and $e_{t−2}$ yielding an error process $e_t$ which will be uncorrelated for all i=3,4,,,,t .We then have two regression coefficients: $\theta_1$ representing the impact of $e_{t-1}$ and $\theta_2$ representing the impact of $e_{t-2}$ . Thus $e_t$ is a white noise random series containing n-2 values. Since we have n-2 estimable relationships we start with the assumption that e1 and e2 are equal to 0.0 . Now for any pair of $\theta_1$ and $\theta_2$ we can estimate the t-2 residual values. The combination that yields the smallest error sum of squares would then be the best estimates of $\theta_1$ and $\theta_2$ .

— IrishStat
kaynak

What are the 2 other predictor series? I am asking because when I look at the literature I have it's never clearly specified. Are these 2 other series unrelated to

Y

$Y$ ? I had the impression that all ARIMA formulation is limited to the

Y

$Y$ series.

— Robert Kubrick

1

The 2 predictors are the lags of the error terms. Since these are not known a priori since we do not know the error terms before we begin is why this has to be treated by non-linear estimation.The confusion you are having is that a model that is finite in the past ( i.e. an AR MODEL ) is potentially infinite in the errors AND a model that is finite in the errors ( i.e. an MA MODEL) is potentially infinite in the past of Y.The reason one selects an AR MODEL versus an MA MODEL is for parsimony. Sometimes we construct an ARMA MODEL which blends both the history of Y and the history of the errors.

— IrishStat

1

As I commented in the other answer, what I am still missing is what's the optimal forecast for

Y

$Y$ , which is used to calculate the innovation

e_{t - n}

$e_{t-n}$ .

— Robert Kubrick

1

See my post here for an explanation of how to understand the disturbance terms in a MA series.

You need different estimation techniques to estimate them. This is because you cannot first get the residuals of a linear regression and then include the lagged residual values as explanatory variables because the MA process uses the residuals of the current regression. In your example you are making two regression equations and using residuals from one into the other. This is not what an MA process is. It cannot be estimated with OLS.

— JoeDanger
kaynak