New Year Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70special

Databricks Databricks-Certified-Data-Analyst-Associate Databricks Certified Data Analyst Associate Exam Exam Practice Test

Databricks Certified Data Analyst Associate Exam Questions and Answers

Testing Engine

  • Product Type: Testing Engine
$37.5  $124.99

PDF Study Guide

  • Product Type: PDF Study Guide
$33  $109.99
Question 1

In which of the following situations will the mean value and median value of variable be meaningfully different?

Options:

A.

When the variable contains no outliers

B.

When the variable contains no missing values

C.

When the variable is of the boolean type

D.

When the variable is of the categorical type

E.

When the variable contains a lot of extreme outliers

Question 2

A data analyst has been asked to count the number of customers in each region and has written the following query:

If there is a mistake in the query, which of the following describes the mistake?

Options:

A.

The query is using count('). which will count all the customers in the customers table, no matter the region.

B.

The query is missing a GROUP BY region clause.

C.

The query is using ORDER BY. which is not allowed in an aggregation.

D.

There are no mistakes in the query.

E.

The query is selecting region but region should only occur in the ORDER BY clause.

Question 3

A data analyst has a managed table table_name in database database_name. They would now like to remove the table from the database and all of the data files associated with the table. The rest of the tables in the database must continue to exist.

Which of the following commands can the analyst use to complete the task without producing an error?

Options:

A.

DROP DATABASE database_name;

B.

DROP TABLE database_name.table_name;

C.

DELETE TABLE database_name.table_name;

D.

DELETE TABLE table_name FROM database_name;

E.

DROP TABLE table_name FROM database_name;

Question 4

Which of the following is a benefit of Databricks SQL using ANSI SQL as its standard SQL dialect?

Options:

A.

It has increased customization capabilities

B.

It is easy to migrate existingSQL queries to Databricks SQL

C.

It allows for the use of Photon's computation optimizations

D.

It is more performant than other SQL dialects

E.

It is more compatible with Spark's interpreters

Question 5

A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Databricks SQL Dashboard.

Which of the following steps will they need to take when creating and adding both data visualizations to the Databricks SQL Dashboard?

Options:

A.

They will need to alter the Query to return two separate sets of results.

B.

They will need to add two separate visualizations to the dashboard based on the same Query.

C.

They will need to create two separate dashboards.

D.

They will need to decide on a single data visualization to add to the dashboard.

E.

They will need to copy the Query and create one data visualization per query.

Question 6

A data team has been given a series of projects by a consultant that need to be implemented in the Databricks Lakehouse Platform.

Which of the following projects should be completed in Databricks SQL?

Options:

A.

Testing the quality of data as it is imported from a source

B.

Tracking usage of feature variables for machine learning projects

C.

Combining two data sources into a single, comprehensive dataset

D.

Segmenting customers into like groups using a clustering algorithm

E.

Automatingcomplex notebook-based workflows with multiple tasks

Question 7

A data analyst has been asked to provide a list of options on how to share a dashboard with a client. It is a security requirement that the client does not gain access to any other information, resources, or artifacts in the database.

Which of the following approaches cannot be used to share the dashboard and meet the security requirement?

Options:

A.

Download the Dashboard as a PDF and share it with the client.

B.

Set a refresh schedule for the dashboard and enter the client's email address in the "Subscribers" box.

C.

Take a screenshot of the dashboard and share it with the client.

D.

Generate a Personal Access Token that is good for 1 day and share it with the client.

E.

Download a PNG file of the visualizations in the dashboard and share them with the client.

Question 8

An analyst writes a query that contains a query parameter. They then add an area chart visualization to the query. While adding the area chart visualization to a dashboard, the analyst chooses "Dashboard Parameter" for the query parameter associated with the area chart.

Which of the following statements is true?

Options:

A.

The area chart will use whatever is selected in the Dashboard Parameter while all or the other visualizations will remain changed regardless of their parameter use.

B.

The area chart will use whatever is selected in the Dashboard Parameter along with all of the other visualizations in the dashboard that use the same parameter.

C.

The area chart will use whatever value is chosen on the dashboard at the time the area chart is added to the dashboard.

D.

The area chart will use whatever value is input by the analyst when the visualization is added to the dashboard. The parameter cannot be changed by the user afterwards.

E.

The area chart will convert to a Dashboard Parameter.

Question 9

A data analyst created and is the owner of the managed table my_ table. They now want to change ownership of the table to a single other user using Data Explorer.

Which of the following approaches can the analyst use to complete the task?

Options:

A.

Edit the Owner field in the table page by removing their own account

B.

Edit the Owner field in the table page by selecting All Users

C.

Edit the Owner field in the table page by selecting the new owner's account

D.

Edit the Owner field in the table page by selecting the Admins group

E.

Edit the Owner field in the table page by removing all access

Question 10

A data analyst has created a user-defined function using the following line of code:

CREATE FUNCTION price(spend DOUBLE, units DOUBLE)

RETURNS DOUBLE

RETURN spend / units;

Which of the following code blocks can be used to apply this function to the customer_spend and customer_units columns of the table customer_summary to create column customer_price?

Options:

A.

SELECT PRICE customer_spend, customer_units AS customer_price FROM customer_summary

B.

SELECT price FROM customer_summary

C.

SELECT function(price(customer_spend, customer_units)) AS customer_price FROM customer_summary

D.

SELECT double(price(customer_spend, customer_units)) AS customer_price FROM customer_summary

E.

SELECT price(customer_spend, customer_units) AS customer_price FROM customer_summary

Question 11

The stakeholders.customers table has 15 columns and 3,000 rows of data. The following command is run:

After runningSELECT * FROM stakeholders.eur_customers, 15 rows are returned. After the command executes completely, the user logs out of Databricks.

After logging back in two days later, what is the status of thestakeholders.eur_customersview?

Options:

A.

The view remains available and SELECT * FROM stakeholders.eur_customers will execute correctly.

B.

The view has been dropped.

C.

The view is not available in the metastore, but the underlying data can be accessed with SELECT * FROM delta. `stakeholders.eur_customers`.

D.

The view remains available but attempting to SELECT from it results in an empty result set because data in views are automatically deleted after logging out.

E.

The view has been converted into a table.

Question 12

Data professionals with varying titles use the Databricks SQL service as the primary touchpoint with the Databricks Lakehouse Platform. However, some users will use other services like Databricks Machine Learning or Databricks Data Science and Engineering.

Which of the following roles uses Databricks SQL as a secondary service while primarily using one of the other services?

Options:

A.

Business analyst

B.

SQL analyst

C.

Data engineer

D.

Business intelligence analyst

E.

Data analyst

Question 13

Which of the following layers of the medallion architecture is most commonly used by data analysts?

Options:

A.

None of these layers are used by data analysts

B.

Gold

C.

All of these layers are used equally by data analysts

D.

Silver

E.

Bronze