HOTSPOT
You have an Azure Synapse Analytics serverless SQL pool and an Azure Data Lake Storage Gen2 account.
You need to query all the files in the `csv/taxi/' folder and all its subfolders. All the files are in CSV format and have a header row.
How should you complete the query? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:
You use the Vertipaq Analyzer to analyze tables in a dataset as shown in the Tables exhibit. (Click the Tables tab.)
The table relationships for the dataset are shown in the Relationships exhibit. (Click the Relationships tab.)
You need to reduce the model size by eliminating invalid relationships. Which column should you remove?
A. Sales[Sales Amount]
B. Sales[RowlD]
C. Sales[Sales ID]
D. Plan[RowlD]
You have an Azure subscription that contains an Azure Synapse Analytics workspace. You create an Azure Data Lake Storage Gen2 account and upload a CSV file named Filel.csv. You need to use Synapse Studio to query the data in Filel.csv by using a serverless SQL pool. Which Transact-SQL operator should you include in the query?
A. STRIMO_SPLIT
B. OPENOUERY
C. OPCNROWSET
D. OPEMDATASOURCE
You have a deployment pipeline for a Power BI workspace. The workspace contains two datasets that use import storage mode.
A database administrator reports a drastic increase in the number of queries sent from the Power BI service to an Azure SQL database since the creation of the deployment pipeline.
An investigation into the issue identifies the following:
One of the datasets is larger than 1 GB and has a fact table that contains more than 500 million rows. When publishing dataset changes to development, test, or production pipelines, a refresh is triggered against the entire dataset.
You need to recommend a solution to reduce the size of the queries sent to the database when the dataset changes are published to development, test, or production.
What should you recommend?
A. From Capacity settings in the Power Bl Admin portal, reduce the Max Intermediate Row Set Count setting.
B. Configure the dataset to use a composite model that has a DirectQuery connection to the fact table.
C. Enable the large dataset storage format for workspace.
D. From Capacity settings in the Power Bl Admin portal, increase the Max Intermediate Row Set Count setting.
You use an Apache Spark notebook in Azure Synapse Analytics to filter and transform data.
You need to review statistics for a DataFrame that includes:
The column name The column type The number of distinct values Whether the column has missing values
Which function should you use?
A. displayHTML()
B. display(df, summary=true)
C. %%configure
D. display(df)
E. %%lsmagic
You have a Power BI Premium capacity.
From the Power BI Premium Capacity Metrics app, you discover the following:
There is insufficient CPU to execute dataset refreshes.
Out-of-memory throttling occurs when the dataset is waiting.
You need to recommend a solution to resolve the performance issues.
Solution: You move the datasets to a larger capacity.
Does this meet the goal?
A. Yes
B. No
You have a Power BI tenant and an Azure subscription named Sub1. The Power BI tenant and Sub1 are linked to a single Azure AD tenant.
In Sub1, you create a storage account named storage1.
You need to configure a Power BI workspace to store dataflows in storage1. The solution must use the principle of least privilege.
Which three roles should you assign for storage1? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
A. Reader
B. Storage Blob Data Contributor
C. Contributor
D. Owner
E. Storage Blob Data Owner
F. Storage Blob Data Reader
You have a PostgreSQL database named db1.
You have a group of data analysts that will create Power BI datasets. Each analyst will use data from a different schema in db1.
You need to simplify the process for the analysts to initially connect to db1 when using Power BI Desktop.
Which type of file should you use?
A. PBIT
B. PBIX
C. PBIDS
From Power Query Editor, you profile the data shown in the following exhibit.
The IoT GUID and IoT ID columns are unique to each row in the query.
You need to analyze IoT events by the hour and day of the year. The solution must improve dataset performance.
Solution: You split the IoT DateTime column into a column named Date and a column named Time.
Does this meet the goal?
A. Yes
B. No
You need to use Power BI to ingest data from an API. The API requires that an API key be passed in the headers of the request. Which type of authentication should you use?
A. organizational account
B. Basic
C. Web API
D. Anonymous