Azure Enterprise Data Analyst Associate DP-500 Practice Test Engine: Try These 162 Exam Questions
Guaranteed Success in Azure Enterprise Data Analyst Associate DP-500 Exam Dumps
NEW QUESTION # 30
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are using an Azure Synapse Analytics serverless SQL pool to query a collection of Apache Parquet files by using automatic schema inference. The files contain more than 40 million rows of UTF-8-encoded business names, survey names, and participant counts. The database is configured to use the default collation.
The queries use open row set and infer the schema shown in the following table.
You need to recommend changes to the queries to reduce I/O reads and tempdb usage.
Solution: You recommend defining a data source and view for the Parquet files. You recommend updating the query to use the view.
Does this meet the goal?
- A. Yes
- B. No
Answer: B
Explanation:
Explanation
Solution: You recommend using OPENROWSET WITH to explicitly specify the maximum length for businessName and surveyName.
The size of the varchar(8000) columns are too big. Better reduce their size.
A SELECT...FROM OPENROWSET(BULK...) statement queries the data in a file directly, without importing the data into a table. SELECT...FROM OPENROWSET(BULK...) statements can also list bulk-column aliases by using a format file to specify column names, and also data types.
Reference: https://docs.microsoft.com/en-us/sql/t-sql/functions/openrowset-transact-sql
NEW QUESTION # 31
You use Azure Synapse Analytics and Apache Spark notebooks to You need to use PySpark to gain access to the visual libraries. Which Python libraries should you use?
- A. Matplotlib only
- B. TensorFlow only
- C. Seaborn only
- D. Matplotlib and TensorFlow
- E. Seaborn and TensorFlow
- F. Matplotlib and Seaborn
Answer: F
Explanation:
Matplotlib
You can render standard plotting libraries, like Matplotlib, using the built-in rendering functions for each library.
Matplotlib is a plotting library for the Python programming language and its numerical mathematics extension NumPy.
Additional libraries
Beyond these libraries, the Azure Synapse Analytics Runtime also includes the following set of libraries that are often used for data visualization:
Seaborn
Seaborn is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics.
Reference:
https://seaborn.pydata.org/
NEW QUESTION # 32
You need to recommend an automated solution to monitor Power Bl user activity. The solution must meet the following requirements:
* Security admins must identify when users export reports from Power Bl within five days of a new sensitivity label being applied to the artifacts in Power Bl.
* Power Bl admins must identify updates or changes to the Power Bl capacity.
* The principle of least privilege must be used.
Which log should you include in the recommendation for each group? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Box 1: the unified audit log in Microsoft 365
Security admins must identify when users export reports from Power BI within five days of a new sensitivity label being applied to the artifacts in Power BI.
Use the audit log
If your task is to track user activities across Power BI and Microsoft 365, you work with auditing in Microsoft
365 compliance or use PowerShell. Auditing relies on functionality in Exchange Online, which automatically supports Power BI.
You can filter the audit data by date range, user, dashboard, report, dataset, and activity type. You can also download the activities in a csv (comma-separated value) file to analyze offline.
Box 2: Power BI activity log
Power BI admins must identify updates or changes to the Power BI capacity.
Use the activity log
Power BI administrators can analyze usage for all Power BI resources at the tenant level by using custom reports that are based on the Power BI activity log.
Reference: https://docs.microsoft.com/en-us/power-bi/admin/service-admin-auditing
NEW QUESTION # 33
You are creating an external table by using an Apache Spark pool in Azure Synapse Analytics. The table will contain more than 20 million rows partitioned by date. The table will be shared with the SQL engines.
You need to minimize how long it takes for a serverless SQL pool to execute a query data against the table.
In which file format should you recommend storing the table data?
- A. Apache Parquet
- B. Delta
- C. CSV
- D. JSON
Answer: A
Explanation:
Explanation
Prepare files for querying
If possible, you can prepare files for better performance:
* Convert large CSV and JSON files to Parquet. Parquet is a columnar format. Because it's compressed, its file sizes are smaller than CSV or JSON files that contain the same data. Serverless SQL pool skips the columns and rows that aren't needed in a query if you're reading Parquet files. Serverless SQL pool needs less time and fewer storage requests to read it.
Reference: https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/best-practices-serverless-sql-pool
https://stackoverflow.com/questions/65320949/parquet-vs-delta-format-in-azure-data-lake-gen-2-store
NEW QUESTION # 34
You have new security and governance protocols for Power Bl reports and datasets. The new protocols must meet the following requirements.
* New reports can be embedded only in locations that require authentication.
* Live connections are permitted only for workspaces that use Premium capacity datasets.
Which three actions should you recommend performing in the Power Bl Admin portal? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point.
- A. From the Premium per user settings, set XMLA Endpoint to Off.
- B. From Embed Codes, delete all the codes.
- C. From Tenant settings, set Publish to web to Disable.
- D. From Tenant settings, disable Allow XMLA endpoints and Analyze in Excel with on-premises datasets.
- E. From Capacity settings, set XMLA Endpoint to Read Write.
Answer: C,D,E
Explanation:
Reference:
https://powerbi.microsoft.com/en-us/blog/power-bi-february-service-update
NEW QUESTION # 35
You use Vertipaq Analyzer to analyze a model.
The Relationships tab contains the results shown in the following exhibit.
Use the drop-down menus to select the answer choice that completes each statement based on the information presented in the graphic.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Box 1: Customer
There are 1804 invalid rows (records) in the Customer table.
Box 2: 1804
There is a very good video explaining the concept on
https://www.youtube.com/watch?v=XvZyo9Kgyio&list=PLU6II7MW-aiIREc1XdCYviQEj4Ui2pNDS&index=
NEW QUESTION # 36
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
From Power Query Editor, you profile the data s
the following exhibit
The loT GUID and loT ID columns are unique to each row in the query.
You need to analyze loT events by the hour and day of the year. The solution must improve dataset performance.
Solution: You change the loT DateTime column to the date data type.
Does this meet the goal?
- A. Yes
- B. No
Answer: B
NEW QUESTION # 37
You need to create the customized Power Bl usage reporting. The Usage Metrics Report dataset has already been created. The solution must minimize development and administrative effort.
Which four actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.
Answer:
Explanation:
1 - From powerbi.com, create a new report..
2 - Add a report measure
3 - Add visuals to the report
4 - Publish the report to the Sales Analytics workspace
NEW QUESTION # 38
You need to implement object-level security (OLS) in the Power Bl dataset for the sales associates.
Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.
Answer:
Explanation:
1 - From Power BI Desktop, create a role for the sales associates.
2 - From Tabular Editor, set Object Level Security to None for the Customer[Email] column and save the changes.
3 - From Power BI Desktop, publish the dataset to the Sales Analytics workspace.
NEW QUESTION # 39
You are using an Azure Synapse notebook to create a Python visual.
You run the following code cell to import a dataset named Iris.
A sample of the data is shown in the following table.
You need to create the visual shown in the exhibit. (Click the Exhibit tab.)
How should you complete the Python code? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
NEW QUESTION # 40
You have an Azure Synapse Analytics serverless SQL pool.
You need to return a list of files and the number of rows in each file.
How should you complete the Transact-SQL statement? To answer, drag the appropriate values to the targets. Each value may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Reference:
https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/develop-openrowset
NEW QUESTION # 41
You discover a poorly performing measure in a Power Bl data model.
You need to review the query plan to analyze the amount of time spent in the storage engine and the formula engine.
What should you use?
- A. Performance analyzer in Power Bl Desktop
- B. DAX Studio
- C. Vertipaq Analyzer
- D. Tabular Editor
Answer: A
Explanation:
Explanation
Monitor report performance in Power BI Desktop using the Performance Analyzer. Monitoring will help you learn where the bottlenecks are, and how you can improve report performance.
Monitoring performance is relevant in the following situations:
Your Import data model refresh is slow.
Your DirectQuery or Live Connection reports are slow.
Your model calculations are slow.
Slow queries or report visuals should be a focal point of continued optimization.
Reference: https://docs.microsoft.com/en-us/power-bi/guidance/monitor-report-performance
Topic 2, Litware, Inc. Overview
Litware, Inc. is a retail company that sells outdoor recreational goods and accessories. The company sells goods both online and at its stores located in six countries.
Azure Resources
Litware has the following Azure resources:
* An Azure Synapse Analytics workspace named synapseworkspace1
* An Azure Data Lake Storage Gen2 account named datalake1 that is associated with synapseworkspace1
* A Synapse Analytics dedicated SQL pool named SQLDW
Dedicated SQL Pool
SQLDW contains a dimensional model that contains the following table.
SQLDW contains the following additional tables.
SQLDW contains a view named dbo.CustomerPurchases that creates a distinct list of values from dbo.Customer [customeriD], dbo.Customer
[CustomerEmail], dbo.ProductfProductID] and dbo.Product[ProductName].
The sales data in SQLDW is updated every 30 minutes. Records in dbo.SalesTransactions are updated in SQLDW up to three days after being created. The records do NOT change after three days.
Power BI
Litware has a new Power Bl tenant that contains an empty workspace named Sales Analytics.
All users have Power B1 Premium per user licenses.
IT data analysts are workspace administrators. The IT data analysts will create datasets and reports.
A single imported dataset will be created to support the company's sales analytics goals. The dataset will be refreshed every 30 minutes.
Analytics Goals
Litware identifies the following analytics goals:
* Provide historical reporting of sales by product and channel over time.
* Allow sales managers to perform ad hoc sales reporting with minimal effort.
* Perform market basket analysis to understand which products are commonly purchased in the same transaction.
* Identify which customers should receive promotional emails based on their likelihood of purchasing promoted products.
Litware plans to monitor the adoption of Power Bl reports over time. The company wants custom Power Bl usage reporting that includes the percent change of users that view reports in the Sales Analytics workspace each month.
Security Requirements
Litware identifies the following security requirements for the analytics environment:
* All the users in the sales department and the marketing department must be able to see Power B1 reports that contain market basket analysis and data about which customers are likely to purchase a product.
* Customer contact data in SQLDW and the Power B1 dataset must be labeled as Sensitive. Records must be kept of any users that use the sensitive data.
* Sales associates must be prevented from seeing the CustomerEmail column in Power B1 reports.
* Sales managers must be prevented from modifying reports created by other users.
Development Process Requirements
Litware identifies the following development process requirements:
* SQLDW and datalake1 will act as the development environment. Once feature development is complete, all entities in synapseworkspace1 will be promoted to a test workspace, and then to a production workspace.
* Power Bl content must be deployed to test and production by using deployment pipelines.
* All SQL scripts must be stored in Azure Repos.
The IT data analysts prefer to build Power Bl reports in Synapse Studio.
NEW QUESTION # 42
You are configuring a Power Bl report for accessibility as shown in the following table.
You need to change the default colors of all three visuals to make the report more accessible to users who have color vision deficiency. Which two settings should you configure in the Customize theme window? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point.
- A. First-level elements colors
- B. Theme colors
- C. Sentiment colors
- D. Divergent colors
Answer: B,C
NEW QUESTION # 43
You have a deployment pipeline for a Power BI workspace. The workspace contains two datasets that use import storage mode.
A database administrator reports a drastic increase in the number of queries sent from the Power BI service to an Azure SQL database since the creation of the deployment pipeline.
An investigation into the issue identifies the following:
One of the datasets is larger than 1 GB and has a fact table that contains more than 500 million rows.
When publishing dataset changes to development, test, or production pipelines, a refresh is triggered against the entire dataset.
You need to recommend a solution to reduce the size of the queries sent to the database when the dataset changes are published to development, test, or production.
What should you recommend?
- A. Create a dataset parameter to reduce the fact table row count in the development and test pipelines.
- B. Enable the large dataset storage format for workspace.
- C. In the dataset. change the fact table from an import table to a hybrid table.
- D. Turn off auto refresh when publishing the dataset changes to the Power Bl service.
Answer: C
Explanation:
Explanation
Hybrid tables
Hybrid tables are tables with incremental refresh that can have both import and direct query partitions. During a clean deployment, both the refresh policy and the hybrid table partitions are copied. When deploying to a pipeline stage that already has hybrid table partitions, only the refresh policy is copied. To update the partitions, refresh the table.
Refreshes are faster - Only the most recent data that has changed needs to be refreshed.
Reference: https://docs.microsoft.com/en-us/power-bi/create-reports/deployment-pipelines-best-practices
NEW QUESTION # 44
You have a Power Bl tenant that is configured as shown in the following exhibit.
Use the drop-down menus to select the answer choice that completes each statement based on the information presented in the graphic. NOTE: Each correct answer is worth one point.
Answer:
Explanation:
NEW QUESTION # 45
You are planning a Power Bl solution for a customer.
The customer will have 200 Power Bl users. The customer identifies the following requirements:
* Ensure that all the users can create paginated reports.
* Ensure that the users can create reports containing Al visuals.
* Provide autoscaling of the CPU resources during heavy usage spikes.
You need to recommend a Power Bl solution for the customer. The solution must minimize costs. What should you recommend?
- A. Power Bl Premium per user
- B. Power Bl Report Server
- C. a Power Bl Premium per capacity
- D. Power Bl Pro per user
Answer: A
Explanation:
Announcing Power BI Premium Per User general availability and autoscale preview for Gen2.
Power BI Premium per user features and capabilities
* Pixel perfect paginated reports are available for operational reporting capabilities based on SSRS technology. Users can create highly formatted reports in various formats such as PDF and PPT, which are embeddable in applications and are designed to be printed or shared.
* Automated machine learning (AutoML) in Power BI enables business users to build ML models to predict outcomes without having to write any code.
* Etc.
Note:
Power BI empowers every business user and business analyst to get amazing insights with AI infused experiences. With Power BI Premium, we enable business analysts to not only analyze and visualize their data, but to also build an end-to-end data platform through drag and drop experiences. Everything from ingesting and transforming data at scale, to building automated machine learning models, and analyzing massive volumes of data is now possible for our millions of business analysts.
NEW QUESTION # 46
......
The Microsoft DP-500 exam covers a wide range of topics related to analytics solutions, including designing and implementing data storage solutions, designing and implementing data processing solutions, designing and implementing data security solutions, and designing and implementing data visualization solutions. It also covers topics such as data integration, data cleansing, and data transformation, as well as best practices for data governance and compliance.
Test Engine to Practice DP-500 Test Questions: https://www.topexamcollection.com/DP-500-vce-collection.html
Microsoft DP-500 Daily Practice Exam New 2024 Updated 162 Questions: https://drive.google.com/open?id=1oagOKGMQ7HbD0nnT7WX2pprJSc_WBK7W

