A data analyst has set up a SQL query to run every four hours on a SQL endpoint, but the SQL endpoint is taking too long to start up with each run.
Which of the following changes can the data analyst make to reduce the start-up time for the endpoint while managing costs?
A. Reduce the SQL endpoint cluster size
B. Increase the SQL endpoint cluster size
C. Turn off the Auto stop feature
D. Increase the minimum scaling value
E. Use a Serverless SQL endpoint
A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minute.
A data analyst has created a dashboard based on this gold-level data. The project stakeholders want to see the results in the dashboard updated within one minute or less of new data becoming available within the gold-level tables.
Which of the following cautions should the data analyst share prior to setting up the dashboard to complete this task?
A. The required compute resources could be costly
B. The gold-level tables are not appropriately clean for business reporting
C. The streaming data is not an appropriate data source for a dashboard
D. The streaming cluster is not fault tolerant
E. The dashboard cannot be refreshed that quickly
Which of the following approaches can be used to ingest data directly from cloud-based object storage?
A. Create an external table while specifying the DBFS storage path to FROM
B. Create an external table while specifying the DBFS storage path to PATH
C. It is not possible to directly ingest data from cloud-based object storage
D. Create an external table while specifying the object storage path to FROM
E. Create an external table while specifying the object storage path to LOCATION
After running DESCRIBE EXTENDED accounts.customers;, the following was returned:
Now, a data analyst runs the following command:
DROP accounts.customers;
Which of the following describes the result of running this command?
A. Running SELECT * FROM delta. `dbfs:/stakeholders/customers` results in an error.
B. Running SELECT * FROM accounts.customers will return all rows in the table.
C. All files with the .customers extension are deleted.
D. The accounts.customers table is removed from the metastore, and the underlying data files are deleted.
E. The accounts.customers table is removed from the metastore, but the underlying data files are untouched.
Which of the following should data analysts consider when working with personally identifiable information (PII) data?
A. Organization-specific best practices for PII data
B. Legal requirements for the area in which the data was collected
C. None of these considerations
D. Legal requirements for the area in which the analysis is being performed
E. All of these considerations
Which of the following is an advantage of using a Delta Lake-based data lakehouse over common data lake solutions?
A. ACID transactions
B. Flexible schemas
C. Data deletion
D. Scalable storage
E. Open-source formats
A data analyst has been asked to count the number of customers in each region and has written the following query:

If there is a mistake in the query, which of the following describes the mistake?
A. The query is using count(*), which will count all the customers in the customers table, no matter the region.
B. The query is missing a GROUP BY region clause.
C. The query is using ORDER BY, which is not allowed in an aggregation.
D. There are no mistakes in the query.
E. The query is selecting region, but region should only occur in the ORDER BY clause.
Which of the following statements about a refresh schedule is incorrect?
A. A query can be refreshed anywhere from 1 minute to 2 weeks.
B. Refresh schedules can be configured in the Query Editor.
C. A query being refreshed on a schedule does not use a SQL Warehouse (formerly known as SQL Endpoint).
D. A refresh schedule is not the same as an alert.
E. You must have workspace administrator privileges to configure a refresh schedule.
A data analyst creates a Databricks SQL Query where the result set has the following schema:
region STRINGnumber_of_customer INT
When the analyst clicks on the "Add visualization" button on the SQL Editor page, which of the following types of visualizations will be selected by default?
A. Violin Chart
B. Line Chart
C. Bar Chart
D. Histogram
E. There is no default. The user must choose a visualization type.
A data analyst has been asked to produce a visualization that shows the flow of users through a website. Which of the following is used for visualizing this type of flow?
A. Heatmap
B. Choropleth
C. Word Cloud
D. Pivot Table
E. Sankey