Information Technology > QUESTIONS & ANSWERS > Georgia Tech Intro Analytics Modeling - ISYE-6501. 100% Accurate answers, rated A+ (All)
Intro Analytics Modeling - ISYE-6501 Homework 7 Question 10.1 Using the same crime data set uscrime.txt as in Questions 8.2 and 9.1, find the best model you can using (a) a regression tree model, ... and (b) a random forest model. In R, you can use the tree package or the rpart package, and the randomForest package. For each model, describe one or two qualitative takeaways you get from analyzing the results (i.e., don’t just stop when you have a good model, but interpret it too). Answer: (a) a regression tree model R code:Intro Analytics Modeling - ISYE-6501Intro Analytics Modeling - ISYE-6501 Plots:Intro Analytics Modeling - ISYE-6501 Conclusion: Looks like pruning the tree actually increased the residual mean deviance, implying that it's a worse fit. Interesting.Intro Analytics Modeling - ISYE-6501 Plot: Conclusion: it's best to use the max number of terminal nodes, 7, as it has the least amount of error.Intro Analytics Modeling - ISYE-6501 Analysis: R2 (0.72) was large. However, overfitting may be a problem. “Po1” is an important predictor NW is more important than Pop. the model only used 3 predictors in assembling the tree.Intro Analytics Modeling - ISYE-6501 (b) a random forest model. R-code: [Show More]
Last updated: 1 year ago
Preview 1 out of 15 pages
Connected school, study & course
About the document
Uploaded On
Sep 03, 2022
Number of pages
15
Written in
This document has been written for:
Uploaded
Sep 03, 2022
Downloads
0
Views
76
In Browsegrades, a student can earn by offering help to other student. Students can help other students with materials by upploading their notes and earn money.
We're available through e-mail, Twitter, Facebook, and live chat.
FAQ
Questions? Leave a message!
Copyright © Browsegrades · High quality services·