1y ago

63 Views

53 Downloads

537.59 KB

25 Pages

Transcription

Review for Final

Chapter 2: Data quantifiers: sample mean, sample variance, sample standard deviation Quartiles, percentiles, median, interquartile range Dot diagrams Histogram Boxplots

Chapter 3: Set theory, set operations, union, intersection, complement, Venn diagram Counting principle, addition principle and multiplication principle Permutation and combination Conditional probability, Independent events, Bayes theorem False positive and total probability theorem

Chapter 4: Random variables, probability on random variables, accumulative propability Bernoulli distribution Binomial distribution Hypergeometrical distribution Negative binomial distribution Poisson distribution Mean, variance (standard deviation), moments Chebyshev theorem

Chapter 5: Continuous random variables Uniform distribution Exponential distribution Normal distribution, standard normal distribution π and πΌ, the use of Table 3

Chapter 6: Sample, sample mean, sample variance Law of large numbers Central limit theorem Computing probability of sample mean When population variance is not known, π‘-distribution and sample variance

Chapter 7: Inference statistics Point estimation Interval estimation

Chapter 8: Test of hypothesis Null hypothesis and alternate hypothesis Type-I and Type-II errors.

Example: A company owns 400 laptops. Each laptop has an 8% probability of not working. You randomly select 20 laptops for your salespeople. (a) What is the likelihood that 5 will be broken? (b) What is the likelihood that they will all work? (c) What is the likelihood that they will all be broken? Analysis: working and not working for one computer is a Bernoulli random variable Not working: π 0.08 Working: π 1 π 0.92 With 20 laptops, it is a binomial distribution with π 20 20 0.085 0.9215 5 20 (b). π π 0 π 0; 20,0.08 0.9220 0.9220 0 20 (c). π π 20 π 20; 20,0.08 0.0820 0.0820 20 (a). π π 5 π 5; 20,0.08

Example: An audio amplifier contains six transistors. It has been ascertained that three of the transistors are faulty but it is not known which three. Amy removes three transistors at random, and inspects them. What is the probability that two of them are faulty? Analysis: this a hypergeometric distribution problem, the pool has two different subsets, total in the pool is π 6, 3 faulty and 3 non-faulty, pick up π 3, find the probability of having two (π 2) faulty ones. The probability function for hypergeometric distribution is: π π π π₯ π π₯ π π π₯ π π Determine variable and parameters: π 6, π 3, π 3, π₯ 2 3 π π 2 2 3 3! 3! 4 1 2! 1! 2! 1! 3 6 6! 6! 3! 3! 3

Example: An oil company conducts a geological study that indicates that an exploratory oil well should have a 20% chance of striking oil. What is the probability that the first strike comes on the third well drilled? Analysis: each drill is a Bernoulli distribution (success or fail) with π 0.2. The probability of first success in π trials is the geometric distribution problem. The probability of first π success in π trials is the negative binomial distribution problem. For negative binomial distribution: π₯ 1 π π₯ π π π π 1 Where π 1, it becomes geometric distribution. π π π₯ In this problem, π₯ 3, π 1, π 0.2 (π 0.8). π π 3 0.2 0.82

Example: A new superman MasterCard has been issued to 2000 customers. Of these customers, 1500 hold a Visa card, 500 hold an American Express card and 40 hold a Visa card and an American Express card. Find the probability that a customer chosen at random holds a Visa card, given that the customer holds an American Express card. Analysis: this is a conditional probability problem (find probability of Holding American Express given the person is holding Visa). π(π΄ π΅) π π΄π΅ π(π΅) 1500 500 40 π π΄ ,π π΅ ,π π΄ π΅ 2000 2000 2000 π π΄π΅ 40/2000 40 2 500/2000 500 25

Example: Hazel thinks she may be allergic to eating peanuts, and takes a test that gives the following results: For people that really do have the allergy, the test says "Yes" 90% of the time For people that do not have the allergy, the test says "Yes" 5% of the time ("false positive") If 1.3% of the population have the allergy, and Hazel's test says "Yes", what are the chances that Hazel really does have the allergy? Analysis: this is a false positive problem. The question is to find the conditional probability π(π΄ π΅) , where π΅ is being tested positive, and π΄ is she really does have allergy. From total probability: π π΅ π π΅ π΄ π(π΄) π π΅ π΄ π(π΄ ) 0.9 0.013 0.05 0.987 0.061 π π΄π΅ π(π΄ π΅) 0.013 0.9 0.192 19.2% π(π΅) 0.061 Hazel has only 19.2 percent chance of being really have allergy.

Suppose that a random variable π has the probability distribution density function 0 π₯ 1 π π₯ π/π₯ 4 π₯ 1 (a). Find the value of π. (b). Find the probability of π π 1 . (c). Find the probability of π 2 π 4 . (d). Find the mean and variance of the random variable π. Solution: (a). π π₯ ππ₯ π ππ₯ 1 π₯4 π 3π₯ 3 π 1, π 3 1 3 (b). π π 1 0 4 1 1 π₯ 3 8 64 2 3 3 (d). π 1 π₯ π₯ 4 ππ₯ 2 π₯ 2 1.5 1 3 πΈ π 2 π₯ 2 4 ππ₯ 3π₯ 1 3.0, 1 π₯ (c). π 2 π 4 1 4 3 ππ₯ 2 π₯4 πππ π πΈ π 2 π2 3 2.25 0.75

Example: The new Endeavor SUV has been recalled because 5% of the cars experience brake failure. The Tahoe dealership has sold 200 of these cars. What is the probability that fewer than 4% of the cars from Tahoe experience brake failure? Analysis: this is actually a binomial distribution problem, but can be solved as normal distribution problem. In binomial distribution, π 0.05 π 0.95 , π 200. π₯ 0.04 200 8, π π π π 8 F Z π From the approximation of binomial distribution as normal distribution, we have π ππ 200 0.05 10, π 2 πππ 200 0.05 0.95 9.5 π 3.08 8 10 π 0.64 3.08 From Table 3, can find the value of πΉ π .

Example: To estimate the spending of people during Christmas, a department store takes a random sample of 30 people. It finds out that the mean spending of the sample is 800 dollars and the standard deviation is 200. Assume that peopleβs spending is normally distributed, with 98 percent confidence, over what interval does the mean of population spending lie? Analysis: The problem has the sample size 30, the mean and standard deviation are both with the sample, further, it assumes that the population is normally distributed, therefore it is a t-distribution problem (Table 4 will be used). The equation relevant to this problem is: π π π₯ π‘πΌ/2 π π₯ π‘πΌ/2 π π π₯ 800, π 200, π 30,1 πΌ 98%, πΌ/2 0.01. Find from Table 4: π‘0.01 2.462, 200 200 800 2.462 π 800 2.462 30 30 Question: what is π£ 30, and Table 4 cannot give π‘πΌ/2 ?

Example: A sample of size 10 is used to estimate the mean height of a plant which has standard deviation 10 inches. What is the probability (or confidence) that the error is less than 5 inches in this estimation. Analysis: The problem has the sample size 10, the standard deviation is with the Population, that is π, therefore this is a central limit theorem problem (Table 3 will be used). The equation relevant to this problem is: π πΈ π§πΌ/2 π πΈ 5, π 10, π 10. Find π§πΌ/2 , then find πΌ. π§πΌ/2 πΈ π 5 10 π 10 Confidence factor 1 πΌ

Example: The number of calls for service at the DMV counter follow the Poisson distribution. The average service rate is 2 people per minute. What is the probability that the time between two calls is (a). Less than 1 minute (b). Greater than 5 minutes? Analysis: between two calls there is no call, so this corresponds to the x 0 case in Poisson distribution and is proportional to π πΌπ‘ (π πΌπ‘), or π π‘ πΌπ πΌπ‘ , π‘ 0 Here πΌ 2. Solution: (a). π π‘ 1 (b). π π‘ 5 1 1 2π‘ ππ‘ π π‘ ππ‘ 2π 0 0 2π 2π‘ ππ‘ 5

Example: the number of customers arriving at a bank can be described by a Poisson distribution. An average of 4 customers arrive per minute. What is the probability that the time between arrivals of two customers will be a) 15 seconds? b) at least 30 seconds? Analysis: arrival is a Poisson process with π ππ , the probability with no customer Between a given time interval t is π0 π π π 0 π π ππ‘ 0! ππ‘ 4π‘ Therefore π π‘ ππ 4π (need to be normalized) 1 a) a 4; π π‘ 15 π π π‘ 4 π 4 1 b) π π‘ 30 π π π‘ 2 π 4 1/4 4π‘ π ππ‘ 0 4π‘ π ππ‘ 1/2 π 2 1 π 1

Example: A random sample of size 20 is taken from a population With uniform distribution: 0.2 0 π₯ 5 π π₯ 0 ππ‘βπππ€ππ π What will be the variance of the sample mean? Analysis: from the central limit theorem, the variance of the sample mean From a continuous population is: π2 πππ π π The question is then to find the variance π 2 of the uniform distribution. Solution: for uniform distribution π 2.5, π2 πΈ π₯2 π2 5 0.2π₯ 2 ππ₯ 2.52 0 π2 πππ π π

Example: computer break-down per year are integers, 0, 1, 2, 3, . Assume the mean number of computer break-down per year is 11.6 with standard deviation of 3.3. Using a normal distribution, approximate the probability that there will be at least 8 (8 or more) break-downs in a given year, and the break-down between 9 and 15. Analysis: this is a normal distribution problem, the key to solve this problem is To convert the random variable to the standard normal distribution. π π₯ 8 1 πΉ π§ π 9 π₯ 15 πΉ π§ 8 11.6 1 0.1379 0.8621 3.3 15 11.6 9 11.6 πΉ π§ πΉ 1.03 πΉ( 0.78) 3.3 3.3

Example: A library loses, on average, 6 books per year. What are the probabilities it loses (a) 4 books on a given year (b) 10 books over a 2 year period Analysis: this is a Poisson process problem: π ππ ππ₯ π π π π₯ π π₯! (a) πΌ 6. Ξ» 6 1 Therefore π(4; 6) (b) πΌ 6. 64 π 6 4! 0.134 Ξ» 6 2 12 Therefore π 10; 12 1210 π 12 10! πΉ 10; 12 πΉ 9; 12 0.134

Example: An insurance company is reviewing its current policy rates. When originally setting the rates they believed that the average claim amount was 1,800. They are concerned that the true mean is actually higher than this, because they could potentially lose a lot of money. They randomly select 40 claims, and calculate a sample mean of 1,950. Assuming that the standard deviation of claims is 500, and set πΌ 0.05, test to see if the insurance company should be concerned. π»0 : Average claim amount is less or equal to 1,800. π»1 : Average claim amount is greater than 1,800. Known conditions: π 1800, π₯ 1950, π 500, π 40, πΌ 0.05 (π§πΌ 1.96). One-sided test: π»0 : π₯ π π/ π π§πΌ π₯ π 1950 1800 1.89 1.98 π/ π 500/ 40 Therefore π»0 is true.

Example: Trying to encourage people to stop driving to campus, the university claims that on average it takes people 30 minutes to find a parking space on campus. I donβt think it takes so long to find a spot. In fact I have a sample of the last five times I drove to campus, and I calculated π₯ 20. Assuming that the time it takes to find a parking spot is normal, and that π 6 minutes, then perform a hypothesis test with level πΌ 0.10 to see if my claim is correct. π»0 : On average it takes 30 minutes to find parking spot π»1 : It takes less than 30 minutes to find a parking spot Known conditions: π 30, π₯ 20, π 6, π 5, πΌ 0.1 (π§πΌ 1.28). One-sided test: π»0 : π₯ π π π π§πΌ π₯ π 20 30 3.73 1.28 π/ π 6/ 5 Therefore π»0 is false. π»1 is true.

Example: A sample of 40 sales receipts from a grocery store has π₯ 137 and π 30.2. Use these values and level of significance as 0.01 to test whether or not the mean of sales at the grocery store are different from 150. π»0 : The average of sales is 150. π»1 : The average of sales is not 150. Known conditions: π 150, π₯ 137, π 30.2, π 40, πΌ 0.01 (π§πΌ/2 2.58). Two-sided test: π»0 : π§πΌ π₯ π π/ π π§πΌ π₯ π 137 150 2.72 2.58 π/ π 30.2/ 40 Therefore π»0 is false. π»1 is true.

Example: A company owns 400 laptops. Each laptop has an 8% probability of not working. You randomly select 20 laptops for your salespeople. (a) What is the likelihood that 5 will be broken? . spending of the sample is 800 dollars and the standard deviation is 200. Assume that people's spending is normally distributed, with 98 percent

Related Documents: