Information Technology > Solutions Guide > Georgia Institute Of Technology ISYE 6501 Homework 10 Complete Solutions - Introduction To Analytics (All)

# Georgia Institute Of Technology ISYE 6501 Homework 10 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501

### Document Content and Description Below

ISYE6501 HOMEWORK 10 Question 14.1 The breast cancer data set breast-cancer-wisconsin.data.txt from http://archive.ics.uci.edu/ml/ machine-learning-databases/breast-cancer-wisconsin/ (description a... t http://archive.ics.uci.edu/ml/ datasets/Breast+Cancer+Wisconsin+%28Original%29 ) has missing values. 1. Use the mean/mode imputation method to impute values for the missing data. 2. Use regression to impute values for the missing data. 3. Use regression with perturbation to impute values for the missing data. 4. (Optional) Compare the results and quality of classification models (e.g., SVM, KNN) build using (1) the data sets from questions 1,2,3; (2) the data that remains after data points with missing values are removed; and (3) the data set when a binary variable is introduced to indicate missing values. Question 15.1 Describe a situation or problem from your job, everyday life, current events, etc., for which optimization would be appropriate. What data would you need? I worked at a bank, and our fraud agents needs to review Direct Deposits that we identified as suspiscious. In that case, I built a logistic regression model to identify the Deposits that are more likely to be suspiscious. The challenge is that we can only use model score to prioritize the queue. Also Deposit with higher amounts might have a higher priority. Moreover, Deposits are processed in batch at different time of the day, so depending on the time of the day, the day of the week and the week of the monther, the quantity of deposits might differ; and agents have a limited time to review the suspicious deposits. So since agents are a limited resource, optimization might be appropriate to estimate the number of agent needed at any given time of the day. The data that I would need is: * The list of deposits at any given time/day, the amount and the fraud score. * The team budget, the max number of agents on week days vs week ends, and the average time spent to review each. * The minimum penetration rate(reviewed deposits over total deposit) [Show More]

Last updated: 1 year ago

Preview 1 out of 10 pages

OR

OR

## Reviews( 0 )

### \$10.00

Can't find what you want? Try our AI powered Search

OR

95
0

### Document information

Connected school, study & course

May 16, 2022

Number of pages

10

Written in

#### Seller

##### Tessa

Member since 1 year

534 Documents Sold

This document has been written for:

May 16, 2022

0

Views

95

## THE BEST STUDY GUIDES

Avoid resits and achieve higher grades with the best study guides, textbook notes, and class notes written by your fellow students

#### Avoid examination resits

Your fellow students know the appropriate material to use to deliver high quality content. With this great service and assistance from fellow students, you can become well prepared and avoid having to resits exams.

Your fellow student knows the best materials to research on and use. This guarantee you the best grades in your examination. Your fellow students use high quality materials, textbooks and notes to ensure high quality

#### Earn from your notes

Get paid by selling your notes and study materials to other students. Earn alot of cash and help other students in study by providing them with appropriate and high quality study materials.

\$10.00

## WHAT STUDENTS SAY ABOUT US

##### What is Browsegrades

In Browsegrades, a student can earn by offering help to other student. Students can help other students with materials by upploading their notes and earn money.

##### We are here to help

We're available through e-mail, Twitter, Facebook, and live chat.
FAQ