Want to pass your Databricks Certified Associate Developer for Apache Spark 3.0 DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK exam in the very first attempt? Try Pass2lead! It is equally effective for both starters and IT professionals.
VCE
Which of the following DataFrame operators is never classified as a wide transformation?
A. DataFrame.sort()
B. DataFrame.aggregate()
C. DataFrame.repartition()
D. DataFrame.select()
E. DataFrame.join()
The code block displayed below contains an error. When the code block below has executed, it should have divided DataFrame transactionsDf into 14 parts, based on columns storeId and
transactionDate (in this order). Find the error.
Code block:
transactionsDf.coalesce(14, ("storeId", "transactionDate"))
A. The parentheses around the column names need to be removed and .select() needs to be appended to the code block.
B. Operator coalesce needs to be replaced by repartition, the parentheses around the column names need to be removed, and .count() needs to be appended to the code block.
C. Operator coalesce needs to be replaced by repartition, the parentheses around the column names need to be removed, and .select() needs to be appended to the code block.
D. Operator coalesce needs to be replaced by repartition and the parentheses around the column names need to be replaced by square brackets.
E. Operator coalesce needs to be replaced by repartition.
Which of the following code blocks displays various aggregated statistics of all columns in DataFrame transactionsDf, including the standard deviation and minimum of values in each column?
A. transactionsDf.summary()
B. transactionsDf.agg("count", "mean", "stddev", "25%", "50%", "75%", "min")
C. transactionsDf.summary("count", "mean", "stddev", "25%", "50%", "75%", "max").show()
D. transactionsDf.agg("count", "mean", "stddev", "25%", "50%", "75%", "min").show()
E. transactionsDf.summary().show()