1 Financial Condition of Banks. The file Banks.csv includes data

1 Financial Condition of Banks. The file Banks.csv includes data on a sample of 20 banks. The “Financial Condition” column records the judgment of an expert on the financial condition of each bank. This outcome variable takes one of two possible values—weak or strong—according to the financial condition of the bank. The predictors are two ratios used in the financial analysis of banks: TotLns&Lses/Assets is the ratio of total loans and leases to total assets and TotExp/Assets is the ratio of total expenses to total assets. The target is to use the two ratios for classifying the financial condition of a new bank. Run a logistic regression model (on the entire dataset) that models the status of a bank as a function of the two financial measures provided. Specify the success class as weak (this is similar to creating a dummy that is 1 for financially weak banks and 0 otherwise), and use the default cutoff value of 0.5.

a. Consider a new bank whose total loans and leases/assets ratio = 0.6 and total expenses/assets ratio = 0.11. From your logistic regression model, estimate the following four quantities for this bank (use R to do all the intermediate calculations; show your final answers to four decimal places): the logit, the odds, the probability of being financially weak, and the classification of the bank (use cutoff = 0.5).

b. The cutoff value of 0.5 is used in conjunction with the probability of being financially weak. Compute the threshold that should be used if we want to make a classification based on the odds of being financially weak, and the threshold for the corresponding logit.

c. When a bank that is in poor financial condition is misclassified as financially strong, the misclassification cost is much higher than when a financially strong bank is misclassified as weak. To minimize the expected cost of misclassification, should the cutoff value for classification (which is currently at 0.5) be increased or decreased?

2. Competitive Auctions on eBay.com. The file eBayAuctions.csv contains information on 1972 auctions transacted on eBay.com during May–June 2004. The goal is to use these data to build a model that will distinguish competitive auctions from noncompetitive ones. A competitive auction is defined as an auction with at least two bids placed on the item being auctioned. The data include variables that describe the item (auction category), the seller (his or her eBay rating), and the auction terms that the seller selected (auction duration, opening price, currency, day of week of auction close). In addition, we have the price at which the auction closed. The goal is to predict whether or not an auction of interest will be competitive. Data preprocessing. Create dummy variables for the categorical predictors. These include Category (18 categories), Currency (USD, GBP, Euro), EndDay (Monday–Sunday), and Duration (1, 3, 5, 7, or 10 days).

a. Create pivot tables for the mean of the binary outcome (Competitive?) as a function of the various categorical variables (use the original variables, not the dummies). Use the information in the tables to reduce the number of dummies that will be used in the model. For example, categories that appear most similar with respect to the distribution of competitive auctions could be combined.

b. Split the data into training (60%) and validation (40%) datasets. Run a logistic model with all predictors with a cutoff of 0.5. c. If we want to predict at the start of an auction whether it will be competitive, we cannot use the information on the closing price. Run a logistic model with all predictors as above, excluding price. How does this model compare to the full model with respect to predictive accuracy?

d. Interpret the meaning of the coefficient for closing price. Does closing price have a practical significance? Is it statistically significant for predicting competitiveness of auctions? (Use a 10% significance level.)

e. Use stepwise selection (use function step() in the stats package or function stepAIC() in the MASS package) and an exhaustive search (use function glmulti() in package glmulti) to find the model with the best fit to the training data. Which predictors are used?

f. Use stepwise selection and an exhaustive search to find the model with the lowest predictive error rate (use the validation data). Which predictors are used?

Pages (275 words)
Standard price: \$0.00

Latest Reviews

Impressed with the sample above? Wait there is more

Related Questions

Write and develop an APA formatted paper (3-5 pages in

Write and develop an APA formatted paper (3-5 pages in length) that includes: Introduction to the topic A description of a situation resulting from a

What common goals did American Indians, gay and lesbian citizens,

What common goals did American Indians, gay and lesbian citizens, and women share in their quests for equal rights, How did their agendas differ, What

You have been recently elected State Senator for the great

You have been recently elected State Senator for the great state of New Jersey of your congressional district. You ran on the campaign slogan: “THE

Subject or book: Analyzing and Visualizing Data It is a

Subject or book: Analyzing and Visualizing Data It is a priority that students are provided with strong educational programs and courses that allow them to

In this unit, you were introduced to different research methodologies

In this unit, you were introduced to  different research methodologies and design. For this assignment, you  will apply what you have learned and assess

APA 7th edition -2 pages / 3 references must be

APA 7th edition -2 pages / 3 references must be within 5 years.   ALL SUBTITLE SHOULD BE SPERATE  -Introduction (Paragraph) -Describes nursing leadership  (First

discuss 150 words apa references Global Health Issues, Policy, and

discuss 150 words apa references   Global Health Issues, Policy, and Healthcare Delivery The infant mortality rate in India is estimated to be 27 deaths

In a paragraph of no less than 8 sentences, explain

In a paragraph of no less than 8 sentences, explain and summarize both self-regulation and self-efficacy making sure to explain how one develops self-efficacy. In

Renin Angiotensin Aldosterone System (RAAS). The student will prepare a

Renin Angiotensin Aldosterone System (RAAS).  The student will prepare a written research paper on the selected topic: Paper will be no less than 3 pages

Being the President of the United States is a very

Being the President of the United States is a very powerful and influential position. This week we will discuss education reform and welfare reform. Identify

From book: Analytics, Data Science, Artificial Intelligence Systems for Decision

From book:  Analytics, Data Science,  Artificial Intelligence Systems for Decision Support (11th Edition) Chapter#7 Q#1 – Explain the relationship among data mining, text mining, and

Porter’s Five Forces Model can be utilized to determine the

Porter’s Five Forces Model can be utilized to determine the competitive intensity of an industry. Pick an industry and analyze this industry using Porter’s 5

New questions