Refer to the ROC curve:

As you move along the curve, what changes?
A. The priors in the population
B. The true negative rate in the population
C. The proportion of events in the training data
D. The probability cutoff for scoring
Refer to the exhibit.

Based on the control plot, which conclusion is justified regarding the means of the response?
A. All groups are significantly different from each other.
B. 2XL is significantly different from all other groups.
C. Only XL and 2XL are not significantly different from each other.
D. No groups are significantly different from each other.
Refer to the exhibit:

The box plot was used to analyze daily sales data following three different ad campaigns. The business analyst concludes that one of the assumptions of ANOVA was violated.
Which assumption has been violated and why?
A. Normality, because Prob > F < .0001.
B. Normality, because the interquartile ranges are different in different ad campaigns.
C. Constant variance, because Prob > F < .0001.
D. Constant variance, because the interquartile ranges are different in different ad campaigns.
Screening for non-linearity in binary logistic regression can be achieved by visualizing:
A. A scatter plot of binary response versus a predictor variable.
B. A trend plot of empirical logit versus a predictor variable.
C. A logistic regression plot of predicted probability values versus a predictor variable.
D. A box plot of the odds ratio values versus a predictor variable.
An analyst investigates Region (A, B, or C) as an input variable in a logistic regression model.
The analyst discovers that the probability of purchasing a certain item when Region = A is 1.
What problem does this illustrate?
A. Collinearity
B. Influential observations
C. Quasi-complete separation
D. Problems that arise due to missing values
The question will ask you to provide a missing statement. Given the following SAS program:

Which SAS statement will complete the program to correctly score the data set NEW_DATA?
A. Score data data=MYDIR.NEW_DATA out=scores;
B. Score data data=MYDIR.NEW_DATA output=scores;
C. Score data=HYDIR.NEU_DATA output=scores;
D. Score data=MYDIR, NEW DATA out=scores;
A marketing analyst assessed the effect of web page design (A, B, or C) on customers' intent to purchase an expensive product. The focus group was divided randomly into three sub-groups, each of which was asked to view one of the web pages and then give their intent to purchase on a scale from 0 to 100. The analyst also asked the customers to give their income, which was coded as: I (lowest), II (medium), or III (highest). After analyzing the data, the analyst claimed that there was significant interaction and the webpage design mainly influenced high income people.
Which graph supports the analyst's conclusion?


A. Option A
B. Option B
C. Option C
D. Option D
A non-contributing predictor variable (Pr > |t| = 0.658) is removed from an existing multiple linear regression model. What will be the result?
A. An increase in R-Square
B. A decrease in R-Square
C. A decrease in Mean Square Error
D. No change in R-Square
What is a benefit to performing data cleansing (imputation, transformations, etc.) on data after partitioning the data for honest assessment as opposed to performing the data cleansing prior to partitioning the data?
A. It makes inference on the model possible.
B. It is computationally easier and requires less time.
C. It omits the training (and test) data sets from the benefits of the cleansing methods.
D. It allows for the determination of the effectiveness of the cleansing method.
This question will ask you to provide a missing option.
A business analyst is investigating the differences in sales figures across 8 sales regions. The analyst is interested in viewing the regression equation parameter estimates for each of the design variables.
Which option completes the program to produce the regression equation parameter estimates?

A. Solve
B. Estimate
C. Solution
D. Est