Want to pass your Data Science and Big Data Analytics E20-007 exam in the very first attempt? Try Pass2lead! It is equally effective for both starters and IT professionals.
VCE
In which phase of the data analytics lifecycle do Data Scientists spend the most time in a project?
A. Discovery
B. Data Preparation
C. Model Building
D. Communicate Results
Assume you are performing an analysis to determine fraud detection on credit card usage. You will need to ensure higher-risk transactions. These may indicate that fraudulent credit card activity is retained in your data for analysis and not dropped as outliers during pre- processing.
What is the approach for loading data into the analytical sandbox for this analysis?
A. ELT
B. ETL
C. EDW
D. OLTP
You have fit a decision tree classifier using 12 input variables. The resulting tree used 7 of the 12 variables, and is 5 levels deep. Some of the nodes contain only 3 data points. The AUC of the model is
0.85. What is your evaluation of this model?
A. The tree is probably overfit. Try fitting shallower trees and using an ensemble method.
B. The AUC is high, and the small nodes are all very pure. This is an accurate model.
C. The tree did not split on all the input variables. You need a larger data set to get a more accurate model.
D. The AUC is high, so the overall model is accurate. It is not well-calibrated, because the small nodes will give poor estimates of probability.