Associate-Data-Practitioner Google Cloud Associate Data Practitioner (ADP Exam) Questions and Answers

Questions 4

You are designing a pipeline to process data files that arrive in Cloud Storage by 3:00 am each day. Data processing is performed in stages, where the output of one stage becomes the input of the next. Each stage takes a long time to run. Occasionally a stage fails, and you have to address

the problem. You need to ensure that the final output is generated as quickly as possible. What should you do?

Options:

Design a Spark program that runs under Dataproc. Code the program to wait for user input when an error is detected. Rerun the last action after correcting any stage output data errors.

Design the pipeline as a set of PTransforms in Dataflow. Restart the pipeline after correcting any stage output data errors.

Design the workflow as a Cloud Workflow instance. Code the workflow to jump to a given stage based on an input parameter. Rerun the workflow after correcting any stage output data errors.

Design the processing as a directed acyclic graph (DAG) in Cloud Composer. Clear the state of the failed task after correcting any stage output data errors.

Buy Now

Questions 5

Your team wants to create a monthly report to analyze inventory data that is updated daily. You need to aggregate the inventory counts by using only the most recent month of data, and save the results to be used in a Looker Studio dashboard. What should you do?

Options:

Create a materialized view in BigQuery that uses the SUM( ) function and the DATE_SUB( ) function.

Create a saved query in the BigQuery console that uses the SUM( ) function and the DATE_SUB( ) function. Re-run the saved query every month, and save the results to a BigQuery table.

Create a BigQuery table that uses the SUM( ) function and the _PARTITIONDATE filter.

Create a BigQuery table that uses the SUM( ) function and the DATE_DIFF( ) function.

Buy Now

Questions 6

Your organization has several datasets in BigQuery. The datasets need to be shared with your external partners so that they can run SQL queries without needing to copy the data to their own projects. You have organized each partner’s data in its own BigQuery dataset. Each partner should be able to access only their data. You want to share the data while following Google-recommended practices. What should you do?

Options:

Use Analytics Hub to create a listing on a private data exchange for each partner dataset. Allow each partner to subscribe to their respective listings.

Create a Dataflow job that reads from each BigQuery dataset and pushes the data into a dedicated Pub/Sub topic for each partner. Grant each partner the pubsub. subscriber IAM role.

Export the BigQuery data to a Cloud Storage bucket. Grant the partners the storage.objectUser IAM role on the bucket.

Grant the partners the bigquery.user IAM role on the BigQuery project.

Buy Now

Questions 7

You are working on a data pipeline that will validate and clean incoming data before loading it into BigQuery for real-time analysis. You want to ensure that the data validation and cleaning is performed efficiently and can handle high volumes of data. What should you do?

Options:

Write custom scripts in Python to validate and clean the data outside of Google Cloud. Load the cleaned data into BigQuery.

Use Cloud Run functions to trigger data validation and cleaning routines when new data arrives in Cloud Storage.

Use Dataflow to create a streaming pipeline that includes validation and transformation steps.

Load the raw data into BigQuery using Cloud Storage as a staging area, and use SQL queries in BigQuery to validate and clean the data.

Buy Now

Questions 8

Your company is migrating their batch transformation pipelines to Google Cloud. You need to choose a solution that supports programmatic transformations using only SQL. You also want the technology to support Git integration for version control of your pipelines. What should you do?

Options:

Use Cloud Data Fusion pipelines.

Use Dataform workflows.

Use Dataflow pipelines.

Use Cloud Composer operators.

Buy Now

Questions 9

Your organization uses a BigQuery table that is partitioned by ingestion time. You need to remove data that is older than one year to reduce your organization’s storage costs. You want to use the most efficient approach while minimizing cost. What should you do?

Options:

Create a scheduled query that periodically runs an update statement in SQL that sets the “deleted" column to “yes” for data that is more than one year old. Create a view that filters out rows that have been marked deleted.

Create a view that filters out rows that are older than one year.

Require users to specify a partition filter using the alter table statement in SQL.

Set the table partition expiration period to one year using the ALTER TABLE statement in SQL.

Buy Now

Questions 10

Your organization’s ecommerce website collects user activity logs using a Pub/Sub topic. Your organization’s leadership team wants a dashboard that contains aggregated user engagement metrics. You need to create a solution that transforms the user activity logs into aggregated metrics, while ensuring that the raw data can be easily queried. What should you do?

Options:

Create a Dataflow subscription to the Pub/Sub topic, and transform the activity logs. Load the transformed data into a BigQuery table for reporting.

Create an event-driven Cloud Run function to trigger a data transformation pipeline to run. Load the transformed activity logs into a BigQuery table for reporting.

Create a Cloud Storage subscription to the Pub/Sub topic. Load the activity logs into a bucket using the Avro file format. Use Dataflow to transform the data, and load it into a BigQuery table for reporting.

Create a BigQuery subscription to the Pub/Sub topic, and load the activity logs into the table. Create a materialized view in BigQuery using SQL to transform the data for reporting

Buy Now

Questions 11

You manage a Cloud Storage bucket that stores temporary files created during data processing. These temporary files are only needed for seven days, after which they are no longer needed. To reduce storage costs and keep your bucket organized, you want to automatically delete these files once they are older than seven days. What should you do?

Options:

Set up a Cloud Scheduler job that invokes a weekly Cloud Run function to delete files older than seven days.

Configure a Cloud Storage lifecycle rule that automatically deletes objects older than seven days.

Develop a batch process using Dataflow that runs weekly and deletes files based on their age.

Create a Cloud Run function that runs daily and deletes files older than seven days.

Buy Now

Questions 12

Your company has several retail locations. Your company tracks the total number of sales made at each location each day. You want to use SQL to calculate the weekly moving average of sales by location to identify trends for each store. Which query should you use?

Associate-Data-Practitioner Question 12

Options:

Option A

Option B

Option C

Option D

Buy Now

Questions 13

You manage an ecommerce website that has a diverse range of products. You need to forecast future product demand accurately to ensure that your company has sufficient inventory to meet customer needs and avoid stockouts. Your company's historical sales data is stored in a BigQuery table. You need to create a scalable solution that takes into account the seasonality and historical data to predict product demand. What should you do?

Options:

Use the historical sales data to train and create a BigQuery ML time series model. Use the ML.FORECAST function call to output the predictions into a new BigQuery table.

Use Colab Enterprise to create a Jupyter notebook. Use the historical sales data to train a custom prediction model in Python.

Use the historical sales data to train and create a BigQuery ML linear regression model. Use the ML.PREDICT function call to output the predictions into a new BigQuery table.

Use the historical sales data to train and create a BigQuery ML logistic regression model. Use the ML.PREDICT function call to output the predictions into a new BigQuery table.

Buy Now

Questions 14

You are a database administrator managing sales transaction data by region stored in a BigQuery table. You need to ensure that each sales representative can only see the transactions in their region. What should you do?

Options:

Add a policy tag in BigQuery.

Create a row-level access policy.

Create a data masking rule.

Grant the appropriate 1AM permissions on the dataset.

Buy Now

Questions 15

You need to create a new data pipeline. You want a serverless solution that meets the following requirements:

• Data is streamed from Pub/Sub and is processed in real-time.

• Data is transformed before being stored.

• Data is stored in a location that will allow it to be analyzed with SQL using Looker.

Which Google Cloud services should you recommend for the pipeline?

Options:

1. Dataproc Serverless

2. Bigtable

1. Cloud Composer

2. Cloud SQL for MySQL

1. BigQuery

2. Analytics Hub

1. Dataflow

2. BigQuery

Buy Now

Questions 16

You want to process and load a daily sales CSV file stored in Cloud Storage into BigQuery for downstream reporting. You need to quickly build a scalable data pipeline that transforms the data while providing insights into data quality issues. What should you do?

Options:

Create a batch pipeline in Cloud Data Fusion by using a Cloud Storage source and a BigQuery sink.

Load the CSV file as a table in BigQuery, and use scheduled queries to run SQL transformation scripts.

Load the CSV file as a table in BigQuery. Create a batch pipeline in Cloud Data Fusion by using a BigQuery source and sink.

Create a batch pipeline in Dataflow by using the Cloud Storage CSV file to BigQuery batch template.

Buy Now

Questions 17

You manage a web application that stores data in a Cloud SQL database. You need to improve the read performance of the application by offloading read traffic from the primary database instance. You want to implement a solution that minimizes effort and cost. What should you do?

Options:

Use Cloud CDN to cache frequently accessed data.

Store frequently accessed data in a Memorystore instance.

Migrate the database to a larger Cloud SQL instance.

Enable automatic backups, and create a read replica of the Cloud SQL instance.

Buy Now

Questions 18

You are designing an application that will interact with several BigQuery datasets. You need to grant the application’s service account permissions that allow it to query and update tables within the datasets, and list all datasets in a project within your application. You want to follow the principle of least privilege. Which pre-defined IAM role(s) should you apply to the service account?

Options:

roles/bigquery.jobUser and roles/bigquery.dataOwner

roles/bigquery.connectionUser and roles/bigquery.dataViewer

roles/bigquery.admin

roles/bigquery.user and roles/bigquery.filteredDataViewer

Buy Now

Answer:

Explanation:

roles/bigquery.jobUser:

This role allows a user or service account to run BigQuery jobs, including queries. This is necessary for the application to interact with and query the tables.

From Google Cloud documentation: "BigQuery Job User can run BigQuery jobs, including queries, load jobs, export jobs, and copy jobs."

roles/bigquery.dataOwner:

This role grants full control over BigQuery datasets and tables. It allows the service account to update tables, which is a requirement of the application.

From Google Cloud documentation: "BigQuery Data Owner can create, delete, and modify BigQuery datasets and tables. BigQuery Data Owner can also view data and run queries."

Why other options are incorrect:

B. roles/bigquery.connectionUser and roles/bigquery.dataViewer:

roles/bigquery.connectionUser is used for external connections, which is not required for this task. roles/bigquery.dataViewer only allows viewing data, not updating it.

C. roles/bigquery.admin:

roles/bigquery.admin grants excessive permissions. Following the principle of least privilege, this role is too broad.

D. roles/bigquery.user and roles/bigquery.filteredDataViewer:

roles/bigquery.user grants the ability to run queries, but not the ability to modify data. roles/bigquery.filteredDataViewer only provides permission to view filtered data, which is not sufficient for updating tables.

Principle of Least Privilege:

The principle of least privilege is a security concept that states that a user or service account should be granted only the permissions necessary to perform its intended tasks.

By assigning roles/bigquery.jobUser and roles/bigquery.dataOwner, we provide the application with the exact permissions it needs without granting unnecessary access.

Google Cloud Documentation References:

BigQuery IAM roles:https://cloud.google.com/bigquery/docs/access-control-basic-roles

IAM best practices:https://cloud.google.com/iam/docs/best-practices-for-using-iam

Questions 19

Your organization has several datasets in their data warehouse in BigQuery. Several analyst teams in different departments use the datasets to run queries. Your organization is concerned about the variability of their monthly BigQuery costs. You need to identify a solution that creates a fixed budget for costs associated with the queries run by each department. What should you do?

Options:

Create a custom quota for each analyst in BigQuery.

Create a single reservation by using BigQuery editions. Assign all analysts to the reservation.

Assign each analyst to a separate project associated with their department. Create a single reservation by using BigQuery editions. Assign all projects to the reservation.

Assign each analyst to a separate project associated with their department. Create a single reservation for each department by using BigQuery editions. Create assignments for each project in the appropriate reservation.

Buy Now

Questions 20

You work for an online retail company. Your company collects customer purchase data in CSV files and pushes them to Cloud Storage every 10 minutes. The data needs to be transformed and loaded into BigQuery for analysis. The transformation involves cleaning the data, removing duplicates, and enriching it with product information from a separate table in BigQuery. You need to implement a low-overhead solution that initiates data processing as soon as the files are loaded into Cloud Storage. What should you do?

Options:

Use Cloud Composer sensors to detect files loading in Cloud Storage. Create a Dataproc cluster, and use a Composer task to execute a job on the cluster to process and load the data into BigQuery.

Schedule a direct acyclic graph (DAG) in Cloud Composer to run hourly to batch load the data from Cloud Storage to BigQuery, and process the data in BigQuery using SQL.

Use Dataflow to implement a streaming pipeline using anOBJECT_FINALIZEnotification from Pub/Sub to read the data from Cloud Storage, perform the transformations, and write the data to BigQuery.

Create a Cloud Data Fusion job to process and load the data from Cloud Storage into BigQuery. Create anOBJECT_FINALIZE notification in Pub/Sub, and trigger a Cloud Run function to start the Cloud Data Fusion job as soon as new files are loaded.

Buy Now

Questions 21

Your team is building several data pipelines that contain a collection of complex tasks and dependencies that you want to execute on a schedule, in a specific order. The tasks and dependencies consist of files in Cloud Storage, Apache Spark jobs, and data in BigQuery. You need to design a system that can schedule and automate these data processing tasks using a fully managed approach. What should you do?

Options:

Use Cloud Scheduler to schedule the jobs to run.

Use Cloud Tasks to schedule and run the jobs asynchronously.

Create directed acyclic graphs (DAGs) in Cloud Composer. Use the appropriate operators to connect to Cloud Storage, Spark, and BigQuery.

Create directed acyclic graphs (DAGs) in Apache Airflow deployed on Google Kubernetes Engine. Use the appropriate operators to connect to Cloud Storage, Spark, and BigQuery.

Buy Now

Questions 22

Your team needs to analyze large datasets stored in BigQuery to identify trends in user behavior. The analysis will involve complex statistical calculations, Python packages, and visualizations. You need to recommend a managed collaborative environment to develop and share the analysis. What should you recommend?

Options:

Create a Colab Enterprise notebook and connect the notebook to BigQuery. Share the notebook with your team. Analyze the data and generate visualizations in Colab Enterprise.

Create a statistical model by using BigQuery ML. Share the query with your team. Analyze the data and generate visualizations in Looker Studio.

Create a Looker Studio dashboard and connect the dashboard to BigQuery. Share the dashboard with your team. Analyze the data and generate visualizations in Looker Studio.

Connect Google Sheets to BigQuery by using Connected Sheets. Share the Google Sheet with your team. Analyze the data and generate visualizations in Gooqle Sheets.

Buy Now

Questions 23

You are migrating data from a legacy on-premises MySQL database to Google Cloud. The database contains various tables with different data types and sizes, including large tables with millions of rowsand transactional data. You need to migrate this data while maintaining data integrity, and minimizing downtime and cost. What should you do?

Options:

Set up a Cloud Composer environment to orchestrate a custom data pipeline. Use a Python script to extract data from the MySQL database and load it to MySQL on Compute Engine.

Export the MySQL database to CSV files, transfer the files to Cloud Storage by using Storage Transfer Service, and load the files into a Cloud SQL for MySQL instance.

Use Database Migration Service to replicate the MySQL database to a Cloud SQL for MySQL instance.

Use Cloud Data Fusion to migrate the MySQL database to MySQL on Compute Engine.

Buy Now

Questions 24

Your organization has highly sensitive data that gets updated once a day and is stored across multiple datasets in BigQuery. You need to provide a new data analyst access to query specific data in BigQuery while preventing access to sensitive data. What should you do?

Options:

Grant the data analyst the BigQuery Job User IAM role in the Google Cloud project.

Create a materialized view with the limited data in a new dataset. Grant the data analyst BigQuery Data Viewer IAM role in the dataset and the BigQuery Job User IAM role in the Google Cloud project.

Create a new Google Cloud project, and copy the limited data into a BigQuery table. Grant the data analyst the BigQuery Data Owner IAM role in the new Google Cloud project.

Grant the data analyst the BigQuery Data Viewer IAM role in the Google Cloud project.

Buy Now

Questions 25

You have a Dataproc cluster that performs batch processing on data stored in Cloud Storage. You need to schedule a daily Spark job to generate a report that will be emailed to stakeholders. You need a fully-managed solution that is easy to implement and minimizes complexity. What should you do?

Options:

Use Cloud Composer to orchestrate the Spark job and email the report.

Use Dataproc workflow templates to define and schedule the Spark job, and to email the report.

Use Cloud Run functions to trigger the Spark job and email the report.

Use Cloud Scheduler to trigger the Spark job. and use Cloud Run functions to email the report.

Buy Now

Questions 26

You are using your own data to demonstrate the capabilities of BigQuery to your organization’s leadership team. You need to perform a one-time load of the files stored on your local machine into BigQuery using as little effort as possible. What should you do?

Options:

Write and execute a Python script using the BigQuery Storage Write API library.

Create a Dataproc cluster, copy the files to Cloud Storage, and write an Apache Spark job using the spark-bigquery-connector.

Execute the bq load command on your local machine.

Create a Dataflow job using the Apache Beam FileIO and BigQueryIO connectors with a local runner.

Buy Now

Exam Code: Associate-Data-Practitioner

Exam Name: Google Cloud Associate Data Practitioner (ADP Exam)

Last Update: Sep 11, 2025

Questions: 106

PDF + Testing Engine

$134.99

Testing Engine

$99.99

PDF (Q&A)

$84.99

Weekend Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: netbudy65

Dumpsbuddy logo

Associate-Data-Practitioner Google Cloud Associate Data Practitioner (ADP Exam) Questions and Answers

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

PDF + Testing Engine

Testing Engine

PDF (Q&A)

Quick Links

Why Us

Unlimited Packages

Site Secure

We Accept