subject

The file UniversalBank. xls contains data on 5000 customers. The data include customer demographic information (age, income, etc.), the customer's relationship with the bank (mortgage, securities account, etc.), and the customer response to the last personal loan campaign (Personal Loan). Among these 5000 customers, only 480(= 9.6%) accepted the personal loan that was offered to them in the earlier campaign. In this exercise we focus on two predictors: Online (whether or not the customer is an active user of online banking services) and Credit Card (abbreviated CC below) (does the customer hold a credit card issued by the bank), and the outcome Personal Loan (abbreviated Loan below). Partition the data into training (60%) and validation (40%) sets.

a. Create a pivot table for the training data with Online as a column variable, CC as a row variable, and Loan as a secondary row variable. The values inside the cells should convey the count (how many records are in that cell).

b. Consider the task of classifying a customer that owns a bank credit card and is actively using online banking services. Looking at the pivot table, what is the probability that this customer will accept the loan offer? [This is the probability of loan acceptance (Loan = 1) conditional on having a bank credit card (CC = 1) and being an active user of online banking services (Online = 1).]

c. Create two separate pivot tables for the training data. One will have Loan (rows) as a function of Online (columns) and the other will have Loan (rows) as a function of CC.

d. Compute the following quantities [P(A|B) means "the probability of A given B"]:

i. P(CC = 1|Loan = 1) (the proportion of credit card holders among the loan acceptors)

ii. P(Online = 1|Loan = 1)

iii. P(Loan = 1) (the proportion of loan acceptors)

iv. P(CC = 1|Loan = 0)

v. P(Online = 1|Loan = 0)

vi. P(Loan = 0)

e. Use the quantities computed above to compute the naïve Bayes probability P(Loan = 1|CC = 1, Online = 1).

f. Compare this value with the one obtained from the crossed pivot table in (b). Which is a more accurate estimate?

g. In XLMiner, run naive Bayes on the data. Examine the "Conditional probabilities" table, and find the entry that corresponds to P(Loan = 1|CC = 1, Online = 1). Compare this to the number you obtained in (e).

ansver
Answers: 1

Another question on Computers and Technology

question
Computers and Technology, 22.06.2019 15:00
Atool that matches persoal skills qualities interests and talets to a career is called a
Answers: 1
question
Computers and Technology, 22.06.2019 21:50
Answer the following questions regarding your system by using the commands listed in this chapter. for each question, write the command you used to obtain the answer. a. what are the total number of inodes in the root filesystem? how many are currently utilized? how many are available for use? b. what filesystems are currently mounted on your system? c. what filesystems are available to be mounted on your system? d. what filesystems will be automatically mounted at boot time?
Answers: 1
question
Computers and Technology, 22.06.2019 23:30
What are listed in the vertical columns across the top of the event editor? a. file names b. conditions c. check marks d. action types
Answers: 1
question
Computers and Technology, 23.06.2019 02:00
In the context of an internet connection, llc stands for leased line connection liability limited company local loop complex local loop carrier
Answers: 1
You know the right answer?
The file UniversalBank. xls contains data on 5000 customers. The data include customer demographic i...
Questions
question
Mathematics, 09.03.2022 16:40
question
Mathematics, 09.03.2022 16:40
question
Mathematics, 09.03.2022 16:40
question
World Languages, 09.03.2022 16:40
question
Mathematics, 09.03.2022 16:40
question
Mathematics, 09.03.2022 16:40
question
Chemistry, 09.03.2022 16:40
question
Social Studies, 09.03.2022 16:40
Questions on the website: 13722366