Can a simple probability experiment provide a credible basis for more complex decisions?
Question: Will the number 3 come up when a fair die is thrown?
Answer: There is a (1/6) probability of the number 3 coming up. How do we know? We arrive at this number for the probability based on a significant amount of background information available to us. This can be summarized as:
- The die has 6 faces in total
- One of the 6 faces must come up when the die is thrown
- The number 3 appears on one of these faces and hence, is 1 of the 6 possibilities
- Since the die is fair, each face is equally likely to come up
But what happens if we must answer the question with no background information available to us? Since the question is in “Will…?” form, we can legally respond with a Yes or No and not explain our answer any further. The probability for each answer can be notionally taken as (1/2) in the absence of any further information regarding the possible events.
Another instance: “Will Spain beat Germany in the football match today?” A football fan (or a researcher with access to historical information) may be able to give a prediction based on how good each team is and how they have played against each other in the past. But for someone who doesn’t follow football at all, a Yes or a No are both equally acceptable answer. The only knowledge he/she needs is that football is a game played between two teams and that one team will win. Neglecting the chance of a draw, for simplicity, both teams have equal probability of winning. This is the same probability as Heads or Tails appearing in a coin toss. It is not uncommon to see people base important decisions on the result of a coin toss, even when the probability of each outcome of those decision is much more complex.
For example, a Bollywood/Hollywood villain may use a coin toss to decide whether to kill the hero or not. Even though there is a 50% chance of the coin telling him to kill the hero or spare him, the actual decision he needs to take is much more complex. Can we retain the probabilistic simplicity of the coin toss, while applying the principle to more complex statistical problems? Consider the following example.
Example: A real life situation
A student is appearing for an examination, where each question offers five choices of answer - A, B, C, D and E. He is unprepared, has no knowledge of the subject and must rely entirely on guesswork to answer the questions. He finds a (fair) coin in his pocket and decides to use it as a tool to get some help from fate. Since he has 5 choices to deal with and the coin has only 2 sides, he devises a simple algorithm.
- Is the answer A? Heads: Yes. Stop tossing | Tails: No. Proceed to next step
- Is the answer B? Heads: Yes. Stop tossing | Tails: No. Proceed to next step
- Is the answer C? Heads: Yes. Stop tossing | Tails: No. Proceed to next step
- Is the answer D? Heads: Yes. Stop tossing | Tails: No. Proceed to next step
- Is the answer E? If not A, B, C nor D, it must be E
- Number of choices per answer = 5
- Number of tosses = 5 – 1 = 4 (according to the algorithm)
Probability analysis
Outcome
of coin toss
|
Result
|
Random Variable (C)
|
Probability
|
|||
1
|
2
|
3
|
4
|
|||
H
|
-
|
-
|
-
|
A
|
1
|
½
|
T
|
H
|
-
|
-
|
B
|
2
|
¼
|
T
|
T
|
H
|
-
|
C
|
3
|
1/8
|
T
|
T
|
T
|
H
|
D
|
4
|
1/16
|
T
|
T
|
T
|
T
|
E
|
5
|
1/16
|
- Expected value of C = E(C) = {(1/2) x 1} + {(1/4) x 2} + {(1/8) x 3} + {(1/16) x 4} + {(1/16) x 5} = 1.9375 à 2
- Approximating this as C = 2, this tends to choice B in the sample space
- Variance of C = V(C) = E(C2) – {E(C)}2 = S2(C) = 1.4355
- Standard deviation of C = S(C) = 1.1981
Reliability of the coin toss as an estimator
Intuitively, the above expectation indicates that if the same method is used a large number of times, the student is most likely to get answer B to most questions. Unless a test-creator consciously takes the decision to make each option equally likely, the inherent bias in each human being can be expected to result in an uneven distribution in the solution.
If a student is very familiar with a professor and his/her past examinations, it is possible to work out a pattern and guess which answer is most likely. Let’s say that a particular professor tends to have 15 As, 20 Bs, 50 Cs, 10 Ds and 5 Es in his solution. In that case, the student may choose to adjust the algorithm to include this incomplete information he has. The simplest way to do this is to re-assign the random variables this: C:1, B:2, A:3, D:4 and E:5.
Therefore, the probability of getting a C answer using his algorithm is very high.
No comments:
Post a Comment