Two-Factor Full Factorial Design Without Replications

[Jain, Ch. 21]

We now study experiment designs with two factors, each of which taking an arbitrary number of levels.

Model

The model is:

displaymath369

Compared to the one-factor experiment, we add term tex2html_wrap_inline371 .

Question:

How does above equation with a=b=2 differ from the tex2html_wrap_inline399 model, for k=2:

displaymath395

Answer:

The Ch. 21 model treats interactions as errors. Also the Ch. 21 equation gives individual values for each factor level, whereas the tex2html_wrap_inline399 model gives one value (i.e., tex2html_wrap_inline405 ) for both factor levels.

Computation of Effects

Click here for a pdf image of Table 21.2 to see an example table for analysis of observations.

The column and row effects are tex2html_wrap_inline377 's and tex2html_wrap_inline383 's, respectively.

Derivation of effects tex2html_wrap_inline377 and tex2html_wrap_inline421 :

In the last equation, tex2html_wrap_inline425 is the column mean. So in the table computation we add each column, then divide by b to obtain tex2html_wrap_inline425 . Similarly, we obtain tex2html_wrap_inline383 for all i.

Results:

Allocation of Variation

How much of the variation is due to:

As in earlier designs, square both sides of model equation:

displaymath433

Crossterms are zero. Thus:

displaymath434

We desire SSA/SST, SSB/SST, and SSE/SST. We compute SST and SSE without computing the tex2html_wrap_inline389 's as follows:

displaymath435

and

displaymath436

For our example,

table104

Thus number of caches is an important in processor design.

Analysis of Variance

To statistically test factor significance, compute mean squared values of Y, 0, A, B, and E.

Recall:

displaymath459

and similarly for MS0, MSA, MSB, and MSE. Degrees of freedom are:

displaymath460

SSE has (a-1)(b-1) degrees of freedom because the rows and columns of the table must sum to zero.

Thus:

displaymath461

Finally, computed F statistics are:

displaymath462

Table Arrangement for Computation

See Table 21.4

See Table 21.5

Thus number of caches has significant effect on performance, but choice of workload does not.

Visual Validation of ANOVA Assumptions

Check that

  1. errors are normally distributed (Construct QQ plot.)

    See Fig. 21.1

  2. errors are statistically independent (Plot error versus response; look for trends.)

    See Fig. 21.2

  3. standard deviation of errors is constant (Check plot of error versus response for constant spread.)

    See Fig. 21.2

(3) is violated because error variance is higher at higher values of the response.

Confidence Intervals for Effects

CI formula for a quantify tex2html_wrap_inline475 is the mean plus/minus product of t distribution and standard deviation of x:

displaymath467

where variance tex2html_wrap_inline479 is

table168

and, as usual, tex2html_wrap_inline495 .

For our example, 90% CI for mean is:

displaymath468

where

displaymath469

Thus CI for tex2html_wrap_inline373 is

displaymath470

All CI's are summarized below:

  table189
Table 1: From Jain Table 21.7. Asterisk denotes ``not significant.''

Missing Observations

Sometimes a few values of tex2html_wrap_inline501 are missing.

This may be due to:

We next consider analyzing two factor designs with missing observations.

General Method

Certain cells in the table of tex2html_wrap_inline501 's will be missing.

Jain lists alternative (and controversial) analysis methods.

Example Problem

Example: comparison of six processors on 11 workloads:

See Table 21.20

Should we use an additive model?

Should We Use Additive Model

Evidence against additive model (A+B):

Deriving Effects Via Tabular Analysis of Example

We take the log of all observations to use a multiplicative model:

See Table 21.21

The grand mean is computed with 60 as the denominator, because six observations are missing from the table of tex2html_wrap_inline515 cells.

Computing Confidence Intervals

Computing Standard Deviation of Error

We will compute a confidence interval. The easiest way is just to compute SSE (not SST, SSA, etc.):

See Table 21.22

displaymath517

The degrees of freedom (DF) of errors is:

displaymath518

The term 60 reflects six missing observations!

displaymath519

displaymath520

Computing Confidence Intervals

Modify formulas expressing standard deviation of effects in terms of tex2html_wrap_inline527 to account for number of observations at each factor level.

Let

tex2html_wrap_inline529
denoted number observations of factor A

tex2html_wrap_inline531
denoted number observations of factor B

Thus

displaymath525

and similarly for tex2html_wrap_inline533 .

Confidence interval, using earlier formula ( tex2html_wrap_inline535 ), is:

See Figure 21.6

The significantly different pairs are RISC & Z8002 as well as 68000 with any other processor.

Visual Validation of Assumptions

Plot of residuals versus response shows that largest positive error is on same order of magnitude as predicted response. These points correspond to 68000.

See Fig. 21.7

A normal quantile-quantile plot of residuals shows that errors are not normally distributed. The 68000 causes deviation from normal.

See Fig. 21.8

Problem: The 68000 has too many missing observations!

Plots Without 68000

Without 68000, both visual tests pass:

See Fig. 21.9

See Fig. 21.10

Moral: If a row or column has many missing observations, exclude it!



cs5014@ei.cs.vt.edu