Professional-Data-Engineer Google Professional Data Engineer Exam exact Exam Questions

Google Professional Data Engineer Exam

Last Update 13 hours ago Total Questions : 400

The Google Professional Data Engineer Exam content is now fully updated, with all current exam questions added 13 hours ago. Deciding to include Professional-Data-Engineer practice exam questions in your study plan goes far beyond basic test preparation.

You'll find that our Professional-Data-Engineer exam questions frequently feature detailed scenarios and practical problem-solving exercises that directly mirror industry challenges. Engaging with these Professional-Data-Engineer sample sets allows you to effectively manage your time and pace yourself, giving you the ability to finish any Google Professional Data Engineer Exam practice test comfortably within the allotted time.

Question # 31

Which of the following statements about the Wide & Deep Learning model are true? (Select 2 answers.)

The wide model is used for memorization, while the deep model is used for generalization.

A good use for the wide and deep model is a recommender system.

The wide model is used for generalization, while the deep model is used for memorization.

A good use for the wide and deep model is a small-scale linear regression problem.

Question # 32

Which software libraries are supported by Cloud Machine Learning Engine?

Theano and TensorFlow

Theano and Torch

TensorFlow

TensorFlow and Torch

Question # 33

When creating a new Cloud Dataproc cluster with the projects.regions.clusters.create operation, these four values are required: project, region, name, and ____.

zone

node

label

type

Question # 34

Which of the following is not true about Dataflow pipelines?

Pipelines are a set of operations

Pipelines represent a data processing job

Pipelines represent a directed graph of steps

Pipelines can share data between instances

Question # 35

Which role must be assigned to a service account used by the virtual machines in a Dataproc cluster so they can execute jobs?

Dataproc Worker

Dataproc Viewer

Dataproc Runner

Dataproc Editor

Question # 36

When a Cloud Bigtable node fails, ____ is lost.

all data

no data

the last transaction

the time dimension

Question # 37

Which of the following IAM roles does your Compute Engine account require to be able to run pipeline jobs?

dataflow.worker

dataflow.compute

dataflow.developer

dataflow.viewer

Question # 38

The _________ for Cloud Bigtable makes it possible to use Cloud Bigtable in a Cloud Dataflow pipeline.

Cloud Dataflow connector

DataFlow SDK

BiqQuery API

BigQuery Data Transfer Service

Question # 39

Your company has recently grown rapidly and now ingesting data at a significantly higher rate than it was previously. You manage the daily batch MapReduce analytics jobs in Apache Hadoop. However, the recent increase in data has meant the batch jobs are falling behind. You were asked to recommend ways the development team could increase the responsiveness of the analytics without increasing costs. What should you recommend they do?

Rewrite the job in Pig.

Rewrite the job in Apache Spark.

Increase the size of the Hadoop cluster.

Decrease the size of the Hadoop cluster but also rewrite the job in Hive.

Question # 40

You work for a manufacturing plant that batches application log files together into a single log file once a day at 2:00 AM. You have written a Google Cloud Dataflow job to process that log file. You need to make sure the log file in processed once per day as inexpensively as possible. What should you do?

Change the processing job to use Google Cloud Dataproc instead.

Manually start the Cloud Dataflow job each morning when you get into the office.

Create a cron job with Google App Engine Cron Service to run the Cloud Dataflow job.

Configure the Cloud Dataflow job as a streaming job so that it processes the log data immediately.