1.3.3 Finding Probabilities
Suppose that we are given a random experiment with a sample space $S$. To find the probability of an event, there are usually two steps: first, we use the specific information that we have about the random experiment. Second, we use the probability axioms. Let's look at an example. Although this is a simple example and you might be tempted to write the answer without following the steps, we encourage you to follow the steps.
Example
You roll a fair die. What is the probability of $E=\{1,5\}$?
 Solution

Let's first use the specific information that we have about the random experiment. The problem states that the die is fair, which means that all six possible outcomes are equally likely, i.e., $$P(\{1\})=P(\{2\})=\cdots=P(\{6\}).$$ Now we can use the axioms of probability. In particular, since the events $\{1\}, \{2\}, \cdots, \{6\}$ are disjoint we can write
$1$ $=P(S)$ $ = P\bigg(\{1\} \cup \{2\} \cup\cdots\cup \{6\}\bigg)$ $=P(\{1\})+P(\{2\})+\cdots+P(\{6\})$ $=6P(\{1\})$.
Thus, $$P(\{1\})=P(\{2\})=\cdots=P(\{6\})=\frac{1}{6}.$$ Again since $\{1\}$ and $\{5\}$ are disjoint, we have $$P(E)=P(\{1,5\})=P(\{1\})+P(\{5\})=\frac{2}{6}=\frac{1}{3}.$$

It is worth noting that we often write $P(1)$ instead of $P(\{1\})$ to simplify the notation, but we should emphasize that probability is defined for sets (events) not for individual outcomes. Thus, when we write $P(2)=\frac{1}{6}$, what we really mean is that $P(\{2\})=\frac{1}{6}$.
We will see that the two steps explained above can be used to find probabilities for much more complicated events and random experiments. Let us now practice using the axioms by proving some useful facts.
Example
Using the axioms of probability, prove the following:
 For any event $A$, $P(A^c)=1P(A)$.
 The probability of the empty set is zero, i.e., $P(\emptyset)=0$.
 For any event $A$, $P(A) \leq 1$.
 $P(AB)=P(A)P(A \cap B)$.
 $P(A \cup B)=P(A)+P(B)P(A \cap B)$, (inclusionexclusion principle for $n=2$).
 If $A \subset B$ then $P(A) \leq P(B)$.
 Solution

 This states that the probability that $A$ does not occur is $1P(A)$.
To prove it using the axioms, we can write
$1$ $ = P(S)$ $\textrm{(axiom 2)}$ $=P(A \cup A^c)$ $\textrm{(definition of complement)}$ $=P(A)+P(A^c) $ $\textrm{(since $A$ and $A^c$ are disjoint)}$  Since $\emptyset=S^c$, we can use part (a) to see that $P(\emptyset)=1P(S)=0$. Note that this makes sense as by definition: an event happens if the outcome of the random experiment belongs to that event. Since the empty set does not have any element, the outcome of the experiment never belongs to the empty set.
 From part (a), $P(A)=1P(A^c)$ and since $P(A^c) \geq 0$ (the first axiom), we have $P(A) \leq 1$.
 We show that $P(A)=P(A \cap B)+P(AB)$. Note that the two sets $A \cap B$ and $AB$
are disjoint and their union is $A$ (Figure 1.17). Thus, by the
third axiom of probability
\begin{align} P(A)&=P\big((A \cap B) \cup (AB)\big) &(\textrm{ since }A=(A \cap B) \cup (AB))\\ &=P(A \cap B)+P(AB) &\textrm{ (since $A \cap B$ and $AB$ are disjoint)}. \end{align} Note that since $AB=A \cap B^c$, we have shown $$P(A)=P(A \cap B)+P(A \cap B^c).$$ Note also that the two sets $B$ and $B^c$ form a partition of the sample space (since they are disjoint and their union is the whole sample space). This is a simple form of law of total probability that we will discuss shortly and is a very useful rule in finding probability of some events.  Note that $A$ and $BA$ are disjoint sets and their union is $A \cup B$. Thus,
$P(A \cup B)$ $ =P(A \cup (BA))$ $\textrm{($A \cup B=A \cup (BA$))}$ $=P(A)+P(BA)$ $\textrm{(since $A$ and $BA$ are disjoint)}$ $=P(A)+P(B)P(A \cap B) \hspace{20pt}$ $\textrm{(by part (d))}$  Note that $A \subset B$ means that whenever $A$ occurs $B$ occurs, too. Thus intuitively we
expect that $P(A) \leq P(B)$. Again the proof is similar as before. If $A \subset B$, then $A \cap B=A$.
Thus,
$P(B)$ $ =P(A \cap B)+P(BA)$ $\hspace{40pt}$ $\textrm{(by part (d))}$ $=P(A)+P(BA)$ $\textrm{(since $A=A \cap B$)}$ $\geq P(A)$ $\textrm{(by axiom 1)}$
 This states that the probability that $A$ does not occur is $1P(A)$.
To prove it using the axioms, we can write

Example
Suppose we have the following information:
 There is a $60$ percent chance that it will rain today.
 There is a $50$ percent chance that it will rain tomorrow.
 There is a $30$ percent chance that it does not rain either day.
 The probability that it will rain today or tomorrow.
 The probability that it will rain today and tomorrow.
 The probability that it will rain today but not tomorrow.
 The probability that it either will rain today or tomorrow, but not both.
 Solution

An important step in solving problems like this is to correctly convert them to probability language. This is especially useful when the problems become complex. For this problem, let's define $A$ as the event that it will rain today, and $B$ as the event that it will rain tomorrow. Then, let's summarize the available information:
 $P(A)=0.6$,
 $P(B)=0.5$,
 $P(A^c \cap B^c)=0.3$
 The probability that it will rain today or tomorrow: this is $P(A \cup B)$. To find this
we notice that
$P(A \cup B)$ $=1P\bigg((A \cup B)^c \bigg) \hspace{40pt}$ $\textrm{by Example 1.10}$ $=1P(A^c \cap B^c)$ $\textrm{by De Morgan's Law}$ $=10.3$ $=0.7$
 The probability that it will rain today and tomorrow: this is $P(A \cap B)$. To find
this we note that
$P(A \cap B)$ $=P(A)+P(B)P(A \cup B) \hspace{30pt}$ $\textrm{by Example 1.10}$ $=0.6+0.50.7$ $=0.4$
 The probability that it will rain today but not tomorrow: this is $P(A \cap B^c)$.
$P(A \cap B^c)$ $ = P(AB)$ $=P(A)P(A \cap B) \hspace{60pt}$ $\textrm{by Example 1.10}$ $=0.60.4$ $=0.2$
 The probability that it either will rain today or tomorrow but not both: this is
$P(AB)+P(BA)$. We have already found $P(AB)=.2$. Similarly, we can find $P(BA)$:
$P(BA)$ $=P(B)P(B \cap A) \hspace{60pt}$ $\textrm{by Example 1.10}$ $=0.50.4$ $=0.1$
Thus, $$P(AB)+P(BA)=0.2+0.1=0.3 \hspace{40pt}$$

In this problem, it is stated that there is a $50$ percent chance that it will rain tomorrow. You might have heard this information from news on the TV. A more interesting question is how the number $50$ is obtained. This is an example of a reallife problem in which tools from probability and statistics are used. As you read more chapters from the book, you will learn many of these tools that are frequently used in practice.
InclusionExclusion Principle:The formula $P(A \cup B)=P(A)+P(B)P(A \cap B)$ that we proved in Example 1.10 is a simple form of the inclusionexclusion principle. We can extend it to the union of three or more sets.
Inclusionexclusion principle:
 $P(A \cup B )= P(A)+P(B)P(A \cap B)$,
 $P(A \cup B \cup C) = P(A) + P(B) + P(C)$
$  P(A \cap B)  P(A \cap C)  P(B \cap C) + P(A \cap B \cap C)$
Generally for $n$ events $A_1, A_2,\cdots,A_n$, we have
$P\biggl(\bigcup_{i=1}^n A_i\biggr) =\sum_{i=1}^n P(A_i)\sum_{i < j}P(A_i\cap A_j) $ $ \hspace{32pt} +\sum_{i < j < k}P(A_i\cap A_j\cap A_k)\ \cdots\ +(1)^{n1}\, P\biggl(\bigcap_{i=1}^n A_i\biggr)$