1
ETB1100 Business Statistics, S2 2014
Assignment 2
Topic: Inference & Simple linear regression
Due date: 11 am, Thursday of Week 11 (16th October 2014)
• ASSIGNMENTS WITHOUT SIGNED COVER SHEETS WILL NOT BE ACCEPTED
• This assignment is to be submitted in a print form.
• A penalty of 10% per day will apply to late assignments, unless extension on assignment
due date has been sought with valid documentary evidence.
• Post your completed assignment in assignment box labelled ‘ETB1100 Bus Statistics’ and
located on the first floor of Berwick campus building 901 near Faculty of Bus & Eco Student
Services. (You must be able to provide a replacement copy of the assignment if this is
requested. Make sure you keep a printed copy and an electronic copy as back-up.)
Assignment Weight: 15%
Note: This assignment will be marked out 75 marks.
Data for Assignment 2: See file ETB1100_A2_Data.xls in Assignment 2 Section on the unit’s
website in Moodle.
Note: In answering parts of the questions where statistical inference is conducted, you must include
some or all of the following (as appropriate) for full credit:
1. the reason for using a particular formula or distribution
2. the null and alternative hypotheses
3. the critical value(s) used
4. the level of significance
5. the distribution of the statistic employed in a test, and
6. the conclusion.
• Answers to Question 3 of this assignment must be supported by Regression Summary Output
generated using Excel.
___________________________________________________________________________
Question 1 (7+2+2+6+4+4 = 25 marks)
The manager of a local fast-food restaurant is interested in improving the services provided to
customers who use the restaurant’s drive-up windows. As a first step in this process, the manager asks
his assistant to record the time (in minutes) it takes to serve a large number of customers at the final
window in the facility’s drive up system. For a random sample of 50 customers selected during the
busiest hours of the day, the service times are given in worksheet Qn1 of file ETB1100_A2_Data.xls.2
(a) Using this sample of service times, obtain a 90% confidence interval estimate for the mean
service time of all customers arriving during the busiest hours of the day at this fast-food
operation.
(b) Interpret the confidence interval constructed in part (a).
(c) What assumption did you make in obtaining the confidence interval in part (a)?
(d) Now using your random sample of 10 customers and their service times from Assignment 1
Question 2 obtain a 90% confidence interval estimate for the mean service time of all
customers arriving during the busiest hours of the day at this fast-food operation.
(e) Compare the widths of the confidence intervals obtained in parts (a) and (d), commenting and
giving a possible reason for the discrepancy.
(f) Explain which confidence interval, part (a) or (d), is likely to produce more stable (reliable)
estimate for the mean service times of all customers arriving during the busiest hours of the
day at this fast-food operation and why.
Question 2 (7+4+4 = 15 marks)
A quality control manager at a factory that manufactures memory cards for digital cameras wishes to
determine whether the mean data transfer rate of a large shipment of cards is equal to 10MB/s (that is,
10 Megabytes per second).
The quality control manager finds the data transfer rates for a random sample of 64 memory cards,
which are given in worksheet Qn 2 of file ETB1100_A2_Data.xls.
(a) On the basis of the given sample, can the manager conclude at the 5% level of significance
that there is no evidence of the mean data transfer rate to be different from 10 MB/s? Perform
the hypothesis test using the critical value approach.
(b) What is the p-value for the test you performed in (a)? Interpret this value.
(c) Would your conclusion change if it was known that the population standard deviation of the
transfer rates is 0.8? Show all details to support your answer.
Question 3 (4+2+6+4+4+6+6+3 = 35 marks)
The owner of the Pizza Delight restaurant chain would like to understand the relationship between the
quantities sold (Y) and the price of its speciality thick-crust pizza (X). The owner has gathered data on
the number of pizzas sold last month and the pizza price for his 20 outlets in Melbourne, Australia.
These data are provided in worksheet Qn3 of file ETB1100_A2_Data.xls.
(a) Obtain scatter plot of Y against X and describe the type of relationship between these variables.
The owner of the Pizza Delight restaurant chain has proposed the fitting of a linear regression model
between the number of pizzas sold and the price.
(b) On the basis of analysis in part (a), explain why do you think that it is reasonable to fit the model
proposed by the owner? 3
(c) Provide the theoretical linear regression model linking variables ‘Quantity sold’ and ‘Price’
which can be used by the owner. Use Excel to fit this model and report the estimated equation.
Has the sign of the estimate for ?1 turned out as indicated by the scatter plot?
(d) Interpret the estimated coefficients for ?0 and ?1.
(e) Report the value of coefficient of determination. Explain, what information about the usefulness
of this fitted model is conveyed by this value? Is this a useful model?
(f) At 5% level of significance, can we say that there is a significant negative linear relationship
between the quantity sold and the price of pizza? Show all details to support your answer.
(g) Showing all relevant details obtain a 90% confidence interval for the slope ?1? On the basis of
this interval, can we say that there is a significant negative linear relationship between the
quantity sold and the price of pizza? Explain.
(h) Predict quantity sold if the owner sets the price of thick-crust pizza at $20. Is this predicted value
a reliable estimate? Explain.
__________________________________