CS 1538 – Fall 2009 – Practice Questions and Guidelines for Final Exam

There will be a variety of questions on the final exam, which will be divided into a closed book and an open book section.  You will first be given the closed book section to complete.  When you finish that section and turn it in you will be given the open book section, which you will have the remainder of the exam time to complete. 

Below is an explanation of the types of questions and an example problem or two for each type.  The solutions follow in the latter part of this document.  Closed book problems are given first, followed by open book problems.  For additional examples of closed book problems, see the Fill in the Blanks + True / False and the Short Answer practice questions from the midterm exam (as well as those sections on the midterm exam itself).

I strongly recommend giving the problems a good try before looking at the solutions.  Note that the problems shown are only example problems and are not necessarily representative in number of the questions that will actually be on the exam.  Thoroughly read over the Review Sheet to see the material that you need to know for this exam.


CLOSED BOOK EXAMPLE PROBLEMS (see midterm practice questions and midterm exam for example problems from first part of the term)

Fill in the Blanks and True/False – these problems will deal mainly with definitions and possibly simple computations.  For false questions you will be expected to explain WHY they are false.

1)    If we determine a function F-1 that maps values from U[0,1) into some other distribution, we are using the ___________________________________________________ to generate that distribution.

 

2)    A b, or Type II error is the probability that _________________________________________________

 

3)    For a given a confidence level, we can REDUCE the b error value by ______________________________ _______________________________________________.

 

4)    The process of testing the implementation of a simulation model to ensure that it has been programmed correctly and debugged is called model __________________________________.

 

5)    If an output value of interest, Y, is averaged over 10 runs of a simulation, we call the result, Y_bar, the _________________________________________ of the actual mean value.

 

6)    (T/F) Interarrival times for a simulation are typically represented by decision variables.

 

7)    The t-distribution approaches a standard normal distribution as the degrees of freedom approach ¥.

 

8)    Comparing alternative system designs using common random numbers will likely have _______________________________ [higher, lower] sample variances than comparing using independent random numbers.

 


Short Answer – these problems will typically include definitions, comparisons and explanations.  There may be some calculations, but most of the calculations will be in the Problem Solving below. (see midterm practice questions and midterm exam for example problems from first part of the term)

1)    For pseudo-random number generators to be useful, they should produce sequences over U[0,1) that are uniform and independent.  Informally state what each of these properties means.  Give and justify an example of a sequence that is uniform over U[0,1) but not independent, and a sequence that is independent but not uniform over U[0,1).

2)    Consider the notion of Face Validity of a simulation model.  Explain what this is and give an example of a model that fails the Face Validity test.

3)    Consider the following result of a confidence interval test:

Y_bar – ta/2,f (s_bar(Y_bar)) ≤ µ ≤ Y_bar + ta/2,f (s_bar(Y_bar))

Explain in words (and pictures, if it will help) what this means.  Be detailed and specific, identifying each term and what the overall expression means.


OPEN BOOK EXAMPLE PROBLEMS

Problem Solving – these problems will be similar in nature to problems done in lecture and assigned in Written Assignment 3 and Written Assignment 4.  They will involve use of formulas / equations / procedures that we discussed in lecture as well as some problem-analyzing and problem-solving skills.   Be sure to bring your text and other printed documents with you to prepare you for these problems.  I will also allow you to view the online notes and to use Excel on the lab computers.  I have only included three problems below.  Refer to your homework problems and textbook / in-class examples for additional problems.

1)    A proposed random number generator produced the following sequence of values over U[0,1):

0.3   0.8  0.1  0.9  0.5

Using the Kolmogorov-Smirnov test, determine whether or not this sequence is uniform over U[0,1) with a level of significance of a = 0.05.  Be specific and show all work in the process.

2)    Develop a generator for the random variable whose cdf is

F(x) =  x/3                               for 0 ≤ x ≤ 2

            2/3 + (x-2)/24              for 2 < x ≤ 10

3)    Problem 10.2 from the Banks text:  System data for the job shop of Exercise 1 (no need to know Exercise 1 to answer this problem, however) revealed that the average time spent by a job in the shop was approximately 4 working days.  The model made the following predictions on seven independent replications, for average time spent in the shop:

3.70   4.21  4.35  4.13  3.83  4.32  4.05

Is the model output consistent with system behavior?  Conduct a statistical test using a level of significance a = 0.01.


Miscellaneous – problems here may include simple traces, writing small code segments or other problems that don't fit into the other 3 categories.

Consider the set of interarrival times below.  Draw a histogram of these times (choosing a reasonable interval for the bins) and make a preliminary guess on the distribution based on that histogram.

62, 47, 71, 45, 65, 43, 58, 35, 55, 31, 51, 25, 52, 56, 49

 


 

 

 

 

 

 

SOLUTIONS

Don't look below until you have given all of the problems a GOOD try!



 

 

 

 

 

 


CLOSED BOOK PROBLEMS

Fill in the Blanks and True/False – these problems will deal mainly with definitions and possibly simple computations.  For false questions you will be expected to explain WHY they are false.

1)    If we determine a function F-1 that maps values from U[0,1) into some other distribution, we are using the _________INVERSE TRANSFORM TECHNIQUE_______________ to generate that distribution.

 

2)    A b, or Type II error is the probability that ___H0 will NOT be rejected when it should be_____

 

3)    For a given a confidence level, we can REDUCE the b error value by ____increasing the number of runs___.

 

4)    The process of testing the implementation of a simulation model to ensure that it has been programmed correctly and debugged is called model ________verification______________.

 

5)    If an output value of interest, Y, is averaged over 10 runs of a simulation, we call the result, Y_bar, the _________point estimate__________________ of the actual mean value.

 

6)    (T/F) Interarrival times for a simulation are typically represented by decision variables.  FALSE – interarrival times are typically uncontrollable variables, since they are determined external to the system.

 

7)    The t-distribution approaches a standard normal distribution as the degrees of freedom approach ¥.  TRUE

 

8)    Comparing alternative system designs using common random numbers will likely have ________lower__________________ [higher, lower] sample variances than comparing using independent random numbers.


Short Answer – these problems will typically include definitions, comparisons and explanations.  There may be some calculations, but most of the calculations will be in the Problem Solving below.

1)    For pseudo-random number generators to be useful, they should produce sequences over U[0,1) that are uniform and independent.  Informally state what each of these properties means.  Give and justify an example of a sequence that is uniform over U[0,1) but not independent, and a sequence that appears to be independent but not uniform over U[0,1).

Answer:  If the values are uniform, then the probability that a value will be anywhere within the range of values is equally likely.  If they are independent, then previous values do not (appear to) affect future values and there are no discernable patterns in the values.

Sequence that is uniform but not independent:  0.05, 0.15, 0.25, 0.35, 0.45, 0.55, 0.65, 0.75, 0.85, 0.95.  This sequence is uniform because it covers the range completely (it would easily pass a Komolgorov-Smirnov test for uniformity) but is not independent because each Xi+1 = Xi + 0.1.

Sequence that is independent but not uniform: 0.25, 0.12, 0.38, 0.53, 0.32, 0.44, 0.39, 0.04, 0.26, 0.19.  This sequence appears to be independent (there are no obvious correlations) but is clearly not uniform, since the maximum value is 0.53 and the average will be much less than the expected average of ~0.5.

2)    Consider the notion of Face Validity of a simulation model.  Explain what this is and give an example of a model that fails the Face Validity test.

Answer: See slide 238 in notes and p. 362 of text for definitions.  Consider as an example the Panera simulation.  If the input distribution from the doors is the same during all hours of operation, the model would not be valid since it is obvious to informed (and even uninformed) parties that this distribution should be different at different times of day.

3)    Consider the following result of a confidence interval test:

Y_bar – ta/2,f (s_bar(Y_bar)) ≤ µ ≤ Y_bar + ta/2,f (s_bar(Y_bar))

Explain in words (and pictures, if it will help) what this means.  Be detailed and specific, identifying each term and what the overall expression means.

 

Answer: The value µ is the "actual" mean value of the output variable of interest, Y.  Y_bar is the point estimate of this mean and s_bar is the standard error of the point estimator.  ta/2,f  is the value in the standard t-distribution with f degrees of freedom.  The product, ta/2,f (s_bar(Y_bar)), is the half-length of the confidence interval.  Overall, the expression is saying that the actual mean, µ , will be within a half-length of the point estimate with a  probability 1-a.

 


OPEN BOOK PROBLEMS

Problem Solving – these problems will be similar in nature to problems done in lecture and assigned in Written Assignment 3 and Written Assignment 4.  They will involve use of formulas / equations / procedures that we discussed in lecture as well as some problem-analyzing and problem-solving skills.   Be sure to bring your text with you to prepare you for these problems.  I will also allow you to view the online notes and to use Excel on the lab computers.  I have only included three problems below.  Refer to your homework problems and textbook / in-class examples for additional problems.

1)    A proposed random number generator produced the following sequence of values over U[0,1):

0.3  0.8  0.1  0.9  0.5

Using the Kolmogorov-Smirnov test, determine whether or not this sequence is uniform over U[0,1) with a level of significance of a = 0.05.  Be specific and show all work in the process.

Answer: First we sort the values in increasing order, yielding: 0.1, 0.3, 0.5, 0.7, 0.9.  Next we find the D+ and D- values, getting the maximum positive and negative differences of our sequence from the uniform function F(x) = x.  Setting things up, we get:

Ri:                    0.1       0.3       0.5       0.8       0.9
i/N:                  0.2       0.4       0.6       0.8       1.0
i/N-Ri:             0.1       0.1       0.1       0.0       0.1       D+ = max of these = 0.1
Ri-(i-1)/N:       0.1       0.1       0.1       0.2       0.1       D- = max fo these = 0.2

D = max(D+, D-) = 0.2

Looking up the critical value from Table A.8 for a = 0.05 with n = 5, we get 0.565.  Since D is less than this value, our values pass the K-S test.
 

2)    Develop a generator for the random variable whose cdf is

F(x) =  x/3                               for 0 ≤ x ≤ 2

            2/3 + (x-2)/24              for 2 < x ≤ 10

Answer: Consider the subrange 0 ≤ x ≤ 2.  Based on the cdf, let R = x/3.  Plugging in the range endpoints for x, we get 0 ≤ R ≤ 2/3.  Solving for x, we get x = 3R.
Consider the subrange 2 < x ≤ 10.  Based on the cdf, let R = 2/3 + (x-2)/24.  Plugging in the range endpoints for x, we get 2/3 < R ≤ 1.  Solving for x, we get x = 2 + 24(R – 2/3) = 24R – 14.  Now we can complete our generator as:

X =      3R                   0 ≤ R ≤ 2/3
            24R – 14         2/3 <  R ≤ 1

3)    Problem 10.2 from the Banks text:  System data for the job shop of Exercise 1 (no need to know Exercise 1 to answer this problem, however) revealed that the average time spent by a job in the shop was approximately 4 working days.  The model made the following predictions on seven independent replications, for average time spent in the shop:

3.70     4.21     4.35     4.13     3.83     4.32     4.05

Is the model output consistent with system behavior?  Conduct a statistical test using a level of significance a = 0.01.

Answer:  We assume µ = 4.  Conduct a t-test using the 7 values against the mean (using the TTEST function in EXCEL), revealing an answer of 0.38398.  Using the TINV function on this value with 6 degrees of freedom, we get the t-value = 0.913476.   Since our a level of significance is 0.01, we use the TINV function on this value with 6 degrees of freedom to get the critical value in this case = 3.7074.  Since our t-value is well under the critical value, we do not reject the null hypothesis.

Note that this problem could also be solved using the formulas in the text, in the following fashion:

Y_bar = 4.08  Sy = 0.2441 (using the formulas on p. 369 of the text)
From this we can use Equation 10.2 to get t0 = (Y_bar – µ)/(Sy/n1/2) = (4.084 – 4)/(0.2441/71/2) = 0.91.  The rest of the answer is the same as above.


Miscellaneous – problems here may include simple traces, writing small code segments or other problems that don't fit into the other 3 categories.

Consider the set of interarrival times below.  Draw a histogram of these times (choosing a reasonable interval for the bins) and make a preliminary guess on the distribution based on that histogram.

62, 47, 71, 45, 65, 43, 58, 35, 55, 31, 51, 25, 52, 56, 49

Answer:

Try intervals of 10 as follows:

20-29   30-39   40-49   50-59   60-69   70-79
-------   ------    ------    ------    ------    ------
    1         2          4          5          2          1

Without analysis, the result appears to be normal.  Naturally, analysis is needed to verify this assertion.