Spring Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: buysanta

Exact2Pass Menu

Google Professional Data Engineer Exam

Last Update 12 hours ago Total Questions : 400

The Google Professional Data Engineer Exam content is now fully updated, with all current exam questions added 12 hours ago. Deciding to include Professional-Data-Engineer practice exam questions in your study plan goes far beyond basic test preparation.

You'll find that our Professional-Data-Engineer exam questions frequently feature detailed scenarios and practical problem-solving exercises that directly mirror industry challenges. Engaging with these Professional-Data-Engineer sample sets allows you to effectively manage your time and pace yourself, giving you the ability to finish any Google Professional Data Engineer Exam practice test comfortably within the allotted time.

Question # 31

Which of the following statements about the Wide & Deep Learning model are true? (Select 2 answers.)

A.

The wide model is used for memorization, while the deep model is used for generalization.

B.

A good use for the wide and deep model is a recommender system.

C.

The wide model is used for generalization, while the deep model is used for memorization.

D.

A good use for the wide and deep model is a small-scale linear regression problem.

Question # 32

Which software libraries are supported by Cloud Machine Learning Engine?

A.

Theano and TensorFlow

B.

Theano and Torch

C.

TensorFlow

D.

TensorFlow and Torch

Question # 33

When creating a new Cloud Dataproc cluster with the projects.regions.clusters.create operation, these four values are required: project, region, name, and ____.

A.

zone

B.

node

C.

label

D.

type

Question # 34

Which of the following is not true about Dataflow pipelines?

A.

Pipelines are a set of operations

B.

Pipelines represent a data processing job

C.

Pipelines represent a directed graph of steps

D.

Pipelines can share data between instances

Question # 35

Which role must be assigned to a service account used by the virtual machines in a Dataproc cluster so they can execute jobs?

A.

Dataproc Worker

B.

Dataproc Viewer

C.

Dataproc Runner

D.

Dataproc Editor

Question # 36

When a Cloud Bigtable node fails, ____ is lost.

A.

all data

B.

no data

C.

the last transaction

D.

the time dimension

Question # 37

Which of the following IAM roles does your Compute Engine account require to be able to run pipeline jobs?

A.

dataflow.worker

B.

dataflow.compute

C.

dataflow.developer

D.

dataflow.viewer

Question # 38

The _________ for Cloud Bigtable makes it possible to use Cloud Bigtable in a Cloud Dataflow pipeline.

A.

Cloud Dataflow connector

B.

DataFlow SDK

C.

BiqQuery API

D.

BigQuery Data Transfer Service

Question # 39

Your company has recently grown rapidly and now ingesting data at a significantly higher rate than it was previously. You manage the daily batch MapReduce analytics jobs in Apache Hadoop. However, the recent increase in data has meant the batch jobs are falling behind. You were asked to recommend ways the development team could increase the responsiveness of the analytics without increasing costs. What should you recommend they do?

A.

Rewrite the job in Pig.

B.

Rewrite the job in Apache Spark.

C.

Increase the size of the Hadoop cluster.

D.

Decrease the size of the Hadoop cluster but also rewrite the job in Hive.

Question # 40

You work for a manufacturing plant that batches application log files together into a single log file once a day at 2:00 AM. You have written a Google Cloud Dataflow job to process that log file. You need to make sure the log file in processed once per day as inexpensively as possible. What should you do?

A.

Change the processing job to use Google Cloud Dataproc instead.

B.

Manually start the Cloud Dataflow job each morning when you get into the office.

C.

Create a cron job with Google App Engine Cron Service to run the Cloud Dataflow job.

D.

Configure the Cloud Dataflow job as a streaming job so that it processes the log data immediately.

Go to page: