Engineering > QUESTIONS & ANSWERS > Georgia Institute Of Technology - ISYE 6501-2019 Midterm Quiz 1 With Correct Answers. (All)

Georgia Institute Of Technology - ISYE 6501-2019 Midterm Quiz 1 With Correct Answers.

Document Content and Description Below

11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - GT St… Step 2: Midterm Quiz 1 - GT Students (Launch Procto... rtrack Õrst before taking the Midterm Quiz 1) 95 Minute Time Limit Instructions Work alone. Do not collaborate with or copy from anyone else. You may use any of the following resources: One sheet (both sides) of handwritten (not photocopied or scanned) notes If any question seems ambiguous, use the most reasonable interpretation (i.e. don't be like Calvin): Good Luck!    Question 0 -- Practice with Drag & Drop 0 points possible (ungraded)  Keyboard Help Some of the quiz questions are Drag-and-Drop. You'll need to drag one or more answers to a location. Some answers might not be used at all, and some answers will be used once. To get full credit you might need to drag more than one answer to some locations, just one answer to other locations, and some locations might not have any correct answers. Please do this quick practice question. The question will give you feedback to make sure you've done it correctly, but the real quiz questions will not. FEEDBACK  Correctly placed 3 items.  Good work! You have completed this drag and drop problem. Note that: (1) There are two places you could've put (x=2,y=3); either one would be correct. (2) One location (x+y=2) had nothing dragged to it. Another location had two answers dragged to it. (3) One choice (x=1,y=7) was not dragged anywhere, since it wasn't correct for anything. You have used 6 of 10 attempts.  Drag each model or method to a category of question it is commonly used for. For models/methods that have more than one correct category, choose any one correct category; for models/methods that have no correct category listed, do not drag them. FEEDBACK  Correctly placed 8 items.  Misplaced 1 item. Submit You have used 1 of 1 attempts.  Reset  Show Answer CUSUM Principal component analysis Support vector machine k-means ARIMA CART Exponential smoothing k-nearest-neighbor Linear regression Logistic regression Random forest Cross validation GARCH Question 2 2.19/3.0 points (graded) Select all of the following models that are designed for use with time series data:  You have used 1 of 1 attempt  Did not place 3 required items.  Good work! You have completed this drag and drop problem.  Final attempt was used, highest score is 9.0 k-nearest-neighbor Principal component analysis ARIMA  k-means CUSUM  Logistic regression GARCH  Exponential smoothing  Random forest Linear regression Support vector machine Answers are displayed within the problem Information for Questions 3a, 3b, 3c Figures A and B show the training data for a soft classiÕcation problem, using two predictors (x and x ) to separate between black and white points. The dashed lines are the classiÕers found using SVM. Figure A uses a linear kernel, and Figure B uses a nonlinear kernel that required Õtting 16 parameter values. Figure A Figure B Question 3a 2.4/3.0 points (graded) 3a. Select all of the following statements that are true.  1 2 Figure A's classiÕer is based only on the value of x2 . Figure A has fewer classiÕcation errors in the training data. Figure A's classiÕer has a wider margin in the training data.  11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - GT St… Answers are displayed within the problem Answers are displayed within the problem  You have used 1 of 1 attempt Question 3b 2.25/3.0 points (graded) 3b. Select all of the following statements that are true.  You have used 1 of 1 attempt Question 3c 1.5/3.0 points (graded) 3c. Select all of the following statements that are true. Figure A's classiÕer incorrectly classiÕes exactly 4 white points in the training data.  Figure A shows that the black point (7.2,1.4) is an outlier.  Figure B's classiÕer has a narrower margin in the training data.  Figure B's classiÕer is more likely to be over-Õt.  Figure B's classiÕer incorrectly classiÕes exactly 5 white points in the training data. Figure B shows that the black point (7.2,1.4) should be white. Answers are displayed within the problem Answers are displayed within the problem  You have used 1 of 1 attempt Question 3d 3.0/3.0 points (graded) In the soft classiÕcation SVM model where we select coeÞcients ... to minimize 3d. Select each of the following cases when we would want to decrease the value of .  You have used 1 of 1 attempt A new point at (3,3) would be classiÕed as white by Figure B's classiÕer.  A new point at (3,3) would be classiÕed as white by a -nearest-neighbor algorithm for . k 5 ≤ k ≤ 10 In Figure A, if the training data had 1000 more white points to the right of the classiÕer, a 1000-nearest-neighbor algorithm would classify a new point at (3,3) as white.  We want a larger margin even if it induces more classiÕcation errors in the training set. We are willing to accept a smaller margin in order to reduce classiÕcation errors in the training set. Neither. Answers are displayed within the problem Question 3e 0.99/3.0 points (graded) 3e. In the hard classiÕcation SVM model, it might be desirable to not put the classiÕer in a location that has equal margin on both sides... (select all correct answers):  You have used 1 of 1 attempt Information for Questions 4a, 4b, 4c Seven diàerent regression models have been Õtted, using diàerent sets of variables. The Õgure below shows the resulting adjusted R-squared value for various models, as measured by cross-validation. ...because moving the classiÕer will usually result in fewer classiÕcation errors in the validation data. ...because moving the classiÕer will usually result in fewer classiÕcation errors in the test data. ...when the costs of misclassifying the two types of points are signiÕcantly diàerent.  Question 4a 0.0/3.0 points (graded) Which of the models would you expect to perform worst on a test data set?  You have used 1 of 1 attempt Model 6, because it has a slightly lower Adjusted than Model 5 and uses one more predictor. R2 Model 2, because it's the simplest of those with a high Adjusted R2 . Model 5, because it has the highest Adjusted R2 . Model 1, because it has much lower Adjusted R2 .  Answers are displayed within the problem Answers are displayed within the problem Question 4b 1.5/3.0 points (graded) Under which of the following conditions would Model 3 be the most appropriate to use (select all correct answers)?  You have used 1 of 1 attempt Additional Information for Question 4c The table below shows the Akaike Information Criterion (AIC), Corrected AIC, and Bayesian Information Criterion (BIC) for each of the models.  Data collection for x6 is too expensive for it to be used in the model. Government regulations require using x2 for this sort of model. It is important to Õnd the simplest good model that includes x3 .  The value of x3 is not known in time for use in the model.  Answers are displayed within the problem 6 -1.31 1.35 15.90 7 0.19 3.71 19.31 Question 4c 0.75/3.0 points (graded) Based on the table above and the Õgure shown for Question 4a, select all of the following statements that are correct.  You have used 1 of 1 attempt Information for all parts of Question 5 Atlanta’s main library has collected the following day-by-day data over the past six years (more than 2000 data points): x = Number of books borrowed from the library on that day x = Day of the week x = Temperature x = Amount of rainfall x = Whether the library was closed that day x = Whether public schools were open that day Adjusted (see Õgure above 4a) and BIC (see table above 4c) give qualitatively opposite evaluations of Model 7.  R2 Among Models 2 and 4, AIC suggests that Model 2 is as likely as Model 4 to be better. e(-5.67-(-4.77))/2 = 63.8% Among Models 2 and 4, AIC suggests that Model 4 is as likely as Model 2 to be better.  e(-5.67-(-4.77))/2 = 63.8% BIC suggests that Model 7 is very likely to be better than Model 5. Answers are displayed within the problem Question 5a 2.0/2.0 points (graded) Select all data that are categorical (including binary data):  You have used 1 of 1 attempt Questions 5b and 5c 0.0/4.0 points (graded) The library believes that if it was hotter yesterday, fewer books will be borrowed today (and if it was cooler yesterday, more books will be borrowed today), so they add a new predictor: = temperature the day before b. If the library is correct that on average, if it was hotter yesterday, fewer books will be borrowed today (and if it was cooler yesterday, more books will be borrowed today), what sign (positive or negative) would you expect the new predictor's coeÞcient to have? Number of books borrowed from the library on that day Day of the week Temperature Amount of rainfall Whether the library was closed that day Whether public schools were open that day Positive, because the response (books borrowed today) is a positive number 11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - Answers are displayed within the problem  c. Does make the model autoregressive?  You have used 1 of 1 attempt Information for Question 5d The library believes that as the temperature gets either too cold or too hot, more people come indoors to the library to borrow books. They have Õt the data to a quadratic function (see the Õgure below). Negative, because higher values of decrease the response (books borrowed today)  x7 Positive, because higher values of increase the response (books borrowed today) x7 x7 No, because the model does not use previous response data to predict the day response.  t Yes, because the model uses day t - 1 data to predict day t circulation. Yes, because the model uses both day and day temperature data as predictors. t - 1 t Answers are displayed within the problem Question 5d 0.0/4.0 points (graded) How would you incorporate the new information above into the library's regression model?  You have used 1 of 1 attempt Question 5e-i,ii 6.0/6.0 points (graded) Add a (temperature)2 variable to the model.  Replace the temperature variable with a (temperature)2 variable in the model. Change the model to estimate the square root of the books borrowed, as a function of temperature, day of the week, inches of rainfall, whether the day is a holiday, and whether schools were open. Answers are displayed within the problem The library has built a triple exponential smoothing (Holt-Winters) model of the number of books borrowed each day, using a multiplicative annual cycle of seasonality. i. Every Wednesday, local schools bring children to visit the library and check out books. So, the number of books borrowed on those days is much higher than an average day. The model only has an annual seasonal cycle length, not a weekly one. Is the model likely to over-predict or under-predict books borrowed on Wednesdays? ii. Is the model likely to over-predict or under-predict books borrowed on Thursdays, when it is open for a full business day? You have used 1 of 1 attempt Question 5e-iii 3.0/3.0 points (graded) iii. Aside from seasonal and trend eàects, the library believes that the random variation in books borrowed each day is small. Should they expect the best value of (the baseline smoothing constant) to be: Over-predict Under-predict Neither Over-predict Under-predict Neither Answers are displayed within the problem Answers are displayed within the problem You have used 1 of 1 attempt Information for Questions 5f, 5g, 5h The library would like to compare the regression and exponential smoothing models to determine which is a better predictor, using the mean absolute error |(books borrowed) – (model’s estimate)|/n as a measure of prediction quality. Question 5f 0.0/4.0 points (graded) Select the best of the following four options for splitting the data:  You have used 1 of 1 attempt  70% for training, 15% for validation, 15% for test  15% for training, 70% for validation, 15% for test 15% for training, 15% for validation, 70% for test 55% for training, 15% for cross-validation, 15% for validation, 15% for test Question 5h 3.0/4.0 points (graded) The person who built these models discovered that although the regression model performed much better on the training set, the two models performed about the same on the validation set: Question 5g 4/4 points (graded)  Keyboard Help Match each data set with its purpose. Drag the purpose next to the appropriate data set. Answers are displayed within the problem Mean absolute error (training set) Mean absolute error (validation set) Regression model 117 152 Exponential smoothing model 148 153 Select all of the reasonable suggestions below:  You have used 1 of 1 attempt Question 5i 2.01/3.0 points (graded) Fewer books are borrowed on Fridays than any other day. The library would like to determine whether there has been a change in the Friday eàect on borrowing, over the past forty years (for this part only, assume there are forty years of data available). Select all of the approaches that might reasonably be correct. To choose between the models, we should see which one does better on the training set. The regression model is clearly better, because it does better on the training set and about the same on the validation set. The exponential smoothing model is probably Õt too much to random patterns (i.e., it is overÕt), because it performs much worse than the regression model on the training set. If there had been 20 models, the one that performed best on the validation set would probably not perform as well on the test set as it did on the validation set. on the number of additional books borrowed on the average Friday compared to the average other day over the past forty years.  11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - GT St… https://courses.edx.org/courses/course-v1:GTx+ISYE6501x+2T2019b/courseware/a8e7783f3b6d4b21bbf5720bb6f02a92/7d4f7b9ba5e84153af… 19/24 Answers are displayed within the problem  You have used 1 of 1 attempt Information for Questions 6a, 6b A logistic regression model was built to model the probability that a retailer’s inventory of a popular product will run out before the next delivery from the manufacturer, based on a number of factors (amount of current inventory, past demand, promotions, etc.). If the logistic regression’s output is greater than a threshold value p, the retailer pays an additional amount for a quick delivery to avoid running out. There are three confusion matrices below, for three diàerent threshold values of p: Question 6a 0.0/3.0 points (graded) Let be the cost of paying for a quick delivery (if the model's output is above p). Let be the cost of running out of inventory. Select all of the statements that are correct: Use exponential smoothing (with ) to Õnd the seasonal mulitplier values for each Friday, and use CUSUM on those values.  L = 7 Ct Build a regression model for each of the forty years, and use CUSUM on the coeÞcients of the Friday variable.  11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - GT St… https://courses.edx.org/courses/course-v1:GTx+ISYE6501x+2T2019b/courseware/a8e7783f3b6d4b21bbf5720bb6f02a92/7d4f7b9ba5e84153af… 20/24 Answers are displayed within the problem Answers are displayed within the problem  You have used 1 of 1 attempt Question 6b 3.0/3.0 points (graded) The retailer’s primary goal right now is to build its market share, so it estimates the cost C of running out to be 20 times worse than the cost D of paying for an early delivery (i.e., C = 20D). Which threshold value of p would you suggest?   Question 7 8/8 points (graded)  Keyboard Help The Õgures below each show a data set that will be used in k-means clustering algorithms (where distance between values is important). Each data set has two attributes. For each data set, drag to it the data preparations that are needed for k-means to work well on the data set. 11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - GT St… https://courses.edx.org/courses/course-v1:GTx+ISYE6501x+2T2019b/courseware/a8e7783f3b6d4b21bbf5720bb6f02a92/7d4f7b9ba5e84153af… 21/24 Information for Questions 8a, 8b FEEDBACK  Correctly placed 4 items.  Good work! You have completed this drag and drop problem.  Final attempt was used, highest score is 8.0 A regression tree approach was used to describe the eàect of 7 diàerent covariates (x1 through x7) on monthly sales. The tree is shown below. In each model, only the signiÕcant covariates are shown. Question 8a 2.0/5.0 points (graded) Select all of the following statements that are true according to this regression tree:  You have used 1 of 1 attempt The eàect of x7 is the same regardless of the values of x1 and x2 .  x1 is only relevant when its value is small (less than 3.5). x4 is irrelevant when predicting monthly sales.  The regression tree can predict monthly sales better for values of greater than or equal to 3.5 than for values of less than 3.5.  x1 x1 The eàect of on sales is greatest when is small and is large ( and ). x3 x1 x2 x1 < 3.5 x2 > 2.0 11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - GT St… Answers are displayed within the problem Answers are displayed within the problem Question 8b 2.01/3.0 points (graded) A random forest model was built for the same purpose, using the same 7 covariates. Which of the following statements are true?  You have used 1 of 1 attempt Information for Question 8c A data scientist has run principal component analysis on the 7 covariates, with the following results: Component Eigenvalue The random forest model uses many trees, but returns a single tree solution that can be analyzed. The random forest model uses a single tree solution. The random forest model can report the relative importance of each variable.  11/11/2019 Step 2: Midterm Quiz 1 - GT Students (Launch Proctortrack first before taking the Midterm Quiz 1) | Step 2: Midterm Quiz 1 - GT St… © All Rights Reserved Answers are displayed within the problem Question 8c 2.0/4.0 points (graded) Select all of the following statements that are correct:  You have used 1 of 1 attempt End of Quiz -- no answer required 0 points possible (ungraded) You have reached the end of the quiz. Please make sure you have hit "submit" on every answer. If you have remaining time and want to go back over any of the questions, do that before clicking "Next". It is likely that the Õrst principal component has much more predictive power than principal components 2-7.  It is likely that the Õrst original covariate has much more predictive power than covariates 2-7. It is likely that the last original covariate has much less predictive power than covariates 1-6. The Õrst principal component cannot contain information from all 7 original covariates. [Show More]

Last updated: 1 year ago

Preview 1 out of 24 pages

Add to cart

Instant download

GET ASSIGNMENT HELP