Summer Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: buysanta

Exact2Pass Menu

ACA Big Data Certification Exam

Last Update 10 hours ago Total Questions : 78

The ACA Big Data Certification Exam content is now fully updated, with all current exam questions added 10 hours ago. Deciding to include ACA-BigData1 practice exam questions in your study plan goes far beyond basic test preparation.

You'll find that our ACA-BigData1 exam questions frequently feature detailed scenarios and practical problem-solving exercises that directly mirror industry challenges. Engaging with these ACA-BigData1 sample sets allows you to effectively manage your time and pace yourself, giving you the ability to finish any ACA Big Data Certification Exam practice test comfortably within the allotted time.

Question # 11

Synchronous development in DataWorks provides both wizard and script modes.

Score 1

A.

True

B.

False

Question # 12

If a MySQL database contains 100 tables, and jack wants to migrate all those tables to MaxCompute

using DataWorks Data Integration, the conventional method would require him to configure 100 data

synchronization tasks. With _______ feature in DataWorks, he can upload all tables at the same time.

Score 2

A.

Full-Database Migration feature

B.

Configure a MySQL Reader plug-in

C.

Configure a MySQL Writer plug-in

D.

Add data sources in Bulk Mode

Question # 13

Apache Spark included in Alibaba E-MapReduce(EMR) is a fast and general-purpose cluster computing

system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports

general execution graphs. It also supports a rich set of higher-level tools. Which of the following tools

does not be included in Spark?

Score 2

A.

Spark SQL for SQL and structured data processing

B.

MLlib for machine learning

C.

GraphX for graph processing

D.

TensorFlow for AI

Question # 14

When we use the MaxCompute tunnel command to upload the log.txt file to the t_log

table, the t_log is a partition table and the partitioning column is (p1 string, p2 string).

Which of the following commands is correct?

A.

tunnel upload log.txt t_log/p1= " b1”, p2= " b2 "

B.

tunnel upload log.txt t_log/(p1= " b1”, p2= " b2 " )

C.

tunnel upload log.txt t_log/p1= " b1 " /p2= " b2 "

Question # 15

The data development mode in DataWorks has been upgraded to the three-level structure

comprising of _____, _____, and ______. (Number of correct answers: 3)

Score 2

A.

Project

B.

Solution

C.

Business flow

D.

Directory

Question # 16

A business flow in DataWorks integrates different node task types by business type, such a structure

improves business code development facilitation. Which of the following descriptions about the node

type is INCORRECT?

Score 2

A.

A zero-load node is a control node that does not generate any data. The virtual node is generally used

as the root node for planning the overall node workflow.

B.

An ODPS SQL task allows you to edit and maintain the SQL code on the Web, and easily implement

code runs, debug, and collaboration.

C.

The PyODPS node in DataWorks can be integrated with MaxCompute Python SDK. You can edit the

Python code to operate MaxCompute on a PyODPS node in DataWorks.

D.

The SHELL node supports standard SHELL syntax and the interactive syntax. The SHELL task can run on

the default resource group

Question # 17

One Alibaba Cloud account is entitled to join only one organization that uses DataWorks.

A.

True

B.

False

Question # 18

You are working on a project where you need to chain together MapReduce, Hive jobs.

You also need the ability to use forks, decision points, and path joins. Which ecosystem

project should you use to perform these actions?

A.

Apache HUE

B.

Apache Zookeeper

C.

Apache Oozie

D.

Apache Spark

Question # 19

MaxCompute SQL is suitable for processing less real-time massive data, and employs a

syntax similar to that of SQL. The efficiency of data query can be improved through

creating proper indexes in the table.

A.

True

B.

False

Question # 20

You want to understand more about how users browse your public website. For example,

you want to know which pages they visit prior to placing an order. You have a server farm

of 100 web servers hosting your website. Which is the most efficient process to gather

these web servers across logs into traditional Hadoop ecosystem.

A.

Just copy them into HDFS using curl

B.

Ingest the server web logs into HDFS using Apache Flume

C.

Channel these clickstreams into Hadoop using Hadoop Streaming

D.

Import all user clicks from your OLTP databases into Hadoop using Sqoop

Go to page: