Lies,
damned lies, and statistics!
"Nothing in progression can rest on its original plan."Thomas Monson
"There are no facts about the future!" This is a favorite saying of noted risk authority Dr. David T. Hulett that sums it up pretty well for project managers.
[1]
Uncertainty is present in every project, every project being a unique assembly of resources and scope. Every project has its ending some time hence. Time displaces project
[1]
The author has had a
Everyone is familiar with the coin toss — one side is heads, the other side is tails. Flip it often enough and there will be all but an imperceptible difference in the number of times heads comes up as compared to
As an experiment, toss a coin 100 times. Let heads represent one estimate of the duration of a project task, say writing a specification, of 10 days, and let tails represent an estimate of duration of 15 days for the same task. Both estimates seem reasonable. Let us flip a coin to choose. If the coin were "fair" (that is, not
But what of the 101
st
toss? What will it be? Can we predict the outcome of toss 101 or know with any
The
However, projects rarely if ever repeat. Nonrepetitiveness is at the
Most people would say that the probability of an outcome of heads or tails in a fair coin toss is 50%; some might say it is half, 0.5, or one chance in two. They all would be correct. There is no chance (that is, 0 probability) that
What if the outcome was the value showing on a die from a pair of dice that was rolled or tossed? Again most people would say that the probability is 1/6 or one chance in six that a single number like a "5" would come up. If two dice were tossed, what is the chance that the sum of the showing faces will equal "7"? This is a little harder problem, but craps players know the answer. We reason this way: There are 36 combinations of faces that could come up with the repetitive roll of the die, like a "1" on both faces (1,1) or a "1" on one face and a "3" on the other (1,3). There are six combinations that total "7" (1,6; 6,1; 2,5; 5,2; 3,4; 4,3) out of a possible 36, so the
The probability that no combination of any numbers will show up on a roll of the dice is 0; the probability that any combination of
|
Combination Number |
Face Value #1 |
Face Value #2 |
Sum of Face Values |
|---|---|---|---|
|
1 |
1 |
1 |
2 |
|
2 |
2 |
1 |
3 |
|
3 |
3 |
1 |
4 |
|
4 |
4 |
1 |
5 |
|
5 |
5 |
1 |
6 |
|
6 |
6 |
1 |
7 |
|
7 |
1 |
2 |
3 |
|
8 |
2 |
2 |
4 |
|
9 |
3 |
2 |
5 |
|
10 |
4 |
2 |
6 |
|
11 |
5 |
2 |
7 |
|
12 |
6 |
2 |
8 |
|
13 |
1 |
3 |
4 |
|
14 |
2 |
3 |
5 |
|
15 |
3 |
3 |
6 |
|
16 |
4 |
3 |
7 |
|
17 |
5 |
3 |
8 |
|
18 |
6 |
3 |
9 |
|
.... |
.... |
.... |
.... |
|
36 |
6 |
6 |
12 |
|
Notes: The number "7" is the most frequent of the "Sum of Face Values," repeating once for each pattern of 6 values of the first die, for a total of 6
|
|||
|
Although not apparent from the abridged list of values, the number "5" appearing in the fourth-down position in the first pattern moves up one position each pattern repetition and thus has only 4 total appearances in 36 combinations. |
|||
The exercise of flipping
The relative frequency interpretation of probability leads to the following equation as the general definition of probability:
p( A ) = N( A )/N( M )
where p(
A
) is probability of outcome (or event)
A
, N(
A
) = number of times
A
occurs in the population, and N(
M
) = number of
Let's begin with OR. Using the relative frequency mathematics already developed, what is the probability of the event "5"? There are only four outcomes, so the event "5" probability is 4/36 or 1/9.
To make it more interesting, let's say that event "5" is the desired result of one of two study
Now let's expand the event to a "5" or a "7". Event "7" is our second study team, with
It is axiomatic in the mathematics of probability that if two events are
p( A OR B ) = p( A ) + p( B )
Now let's discuss AND. Consider the probability of the schedule event "rolling a 6 on one face AND rolling a 1 on the other face." The probability of rolling a "6" on one die is 1/6, as is the probability of rolling a "1" on the other. For the event to be true (that is, both a "6" and a "1" occur), both outcomes have to come up. Rolling the first die gives only a one-in-six chance of a "6"; there is another one-in-six chance that the second die will come up "1", altogether a one-in-six chance times a one-in-six chance, or 1/36. Of course 1/36 is very much less than 1/6. If the outcomes above were about our study teams finishing on time, one observing for a "6" on one die and the other observing for a "1" on the other die, then the probability of both finishing on time is the product of their individual probabilities. Requiring them both to finish on time
Another
p( A AND B ) = p( A * B ) = p( A ) * p( B )
We can now go one step further and consider the situation where events
A
and
B
are not mutually exclusive (that is,
A
and
B
might occur together sometimes or perhaps overlap in some way). Figure 2-1 illustrates the case. As an example, let's continue to observe pairs of dice, but the experiment will be to toss three die at once. All die
p( A OR B ) = p( A ) + p( B ) - p( A * B )
Figure 2-1:
Overlapping Events.
Notice that if A and B are mutually exclusive, then p( A * B ) = 0, which gives a result consistent with the earlier equation given for the OR situation.
Looking at the above equation, the savvy project manager recognizes that risk has increased for achieving a successful outcome of either
A
or
B
since the possibility of their joint occurrence, overlap, or collision of
A
and
B
takes away from the sum of p(
A
) + p(
B
). Such a situation could come up in the case of two resources providing inputs to the project, but if they are provided together, then they are not useful. Such collisions or race conditions (a
When A and B are not independent, then one becomes a condition on the outcome of the other. For example, the question might be: What is the probability of A given the condition that B has occurred? [3]
Consider the situation where there are 12 marbles in a jar, 4 black and 8 white.
The probability of a black marble on each of the first two draws is the AND of the probabilities of the first and second draw. We write the equation:
p( B and A ) = p( B ) * p( A B )
where B = event "black on the first draw," A = event "black on second draw, given black on first draw," and the notation "" means "given." Filling in the numbers, we have:
p( B and A ) = (4/12) * (3/11) = 1/11
Consider the project situation given in Figure 2-2. There we see two
p( A 10 * B ) = p( B A 10 ) * p( A 10 ) = 0.4 * 0.45 = 0.18
Figure 2-2:
Conditions in Task Schedules.
where A 10 = event of Task 1 finishing "on time + 10 days" and B = event of Task 2 finishing on time if Task 1 finishes "on time + 10 days." Conclusion: there is less than one chance in five that both events A 10 and B will happen. The project manager's risk management focus is on avoiding the outcome of B finishing late by managing Task 1 to less than "on time + 10 days."
There is a lot more to know about conditional probabilities because they are very useful to the project manager in decision making and planning. We will hold further discussion until Chapter 4, where conditional probabilities will be used in decision trees and tables.
To this point, we have been using a number of conventions adopted for probability analysis:
All quantitative probabilities lie in the range between the numbers 0 (absolute certainty that an outcome will not occur) and 1 (absolute certainty that an outcome will occur).
The lower case "p" is the notation for probability; it stands for a number between 0 and 1 inclusively. Typically, "p" is
If "p" is the probability that project outcome A will happen, then "1-p" is the probability that project outcome A will not occur. We then have the following equation: p + (1-p) = 1 at all times. More rigorously, we write: p( A ) + [1-p( A )] = 1, where the notation p( A ) means "probability that A will occur."
"1-p" is the probability that something else, say project outcome
B
, will happen instead of
A
. After all, there cannot be a
Sometimes it is said: B is in the (1-p) space of A , or the project manager might ask his or her team: "What is in the (1-p) space of A ?"
Project managers must always account for all the scope and be aware of all the outcomes. Thus: p( A ) + p( B ) = 1. A common mistake made by the project team is to not define or identify B , focusing exclusively on A .
Of course, there may be more possibilities than just B in the (1-p) space of A . Instead of just B , there might be B , C , D , E , and so forth. In that case, the project manager or risk analyst must be very careful to obey the following equation:
[1-p( A )] = p( B ) + p( C ) + p( D ) + p( E ) + ...
The error that often occurs is that the right side sums up too large. That is, we have the condition:
p( A ) + p(1- A ) > 1
On the right side, there are either too many possibilities identified, their respective probabilities are too large, or on the left side, the probabilities of A are misjudged. It is left to the project team to recognize the error of the situation and take corrective measures.
What about statements like "there is a 20% chance of rain or snow today in the higher
[2]
In the real world, events "5" and "7" could be the
[3] A might be one approach in a project and B might be another approach. Such a situation comes up often in projects in the form of "if, then, else" conditional branching.
[4]
A finish-to-start precedence is a notation from precedence diagramming methodology. It denotes the situation that the finish of the first task is required before the start of the second (successor) task can ensue. It is often referred to as a "waterfall" relationship because, when drawn on paper, it gives the appearance of a cascade from one task to the other. More on critical
[4] A Guide to the Project Management Body of Knowledge (PMBOK Guide) — 2000 Edition, Project Management Institute, Newtown Square, PA, chap. 6, p. 69.