Summer Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: buysanta

Exact2Pass Menu

ACA Big Data Certification Exam

Last Update 8 hours ago Total Questions : 78

The ACA Big Data Certification Exam content is now fully updated, with all current exam questions added 8 hours ago. Deciding to include ACA-BigData1 practice exam questions in your study plan goes far beyond basic test preparation.

You'll find that our ACA-BigData1 exam questions frequently feature detailed scenarios and practical problem-solving exercises that directly mirror industry challenges. Engaging with these ACA-BigData1 sample sets allows you to effectively manage your time and pace yourself, giving you the ability to finish any ACA Big Data Certification Exam practice test comfortably within the allotted time.

Question # 1

A distributed file system like GFS and Hadoop are design to have much larger block(or chunk) size

like 64MB or 128MB, which of the following descriptions are correct? (Number of correct answers: 4)

Score 2

A.

It reduces clients ' need to interact with the master because reads and writes on the same block( or

chunck) require only one initial request to the master for block location information

B.

Since on a large block(or chunk), a client is more likely to perform many operations on a given block, it

can reduce network overhead by keeping a persistent TCP connection to the metadata server over an

extended period of time

C.

It reduces the size of the metadata stored on the master

D.

The servers storing those blocks may become hot spots if many clients are accessing the same small

files

E.

If necessary to support even larger file systems, the cost of adding extra memory to the meta data

server is a big price

Question # 2

Where is the meta data (e.g.,table schemas) in Hive?

A.

Stored as metadata on the NameNode

B.

Stored along with the data in HDFS

C.

Stored in the RDBMS like MySQL

D.

Stored in ZooKeeper

Question # 3

There are various methods for accessing to MaxCompute, for example, through management console, client command line, and Java API. Command line tool odpscmd can be used to create, operate,

or delete a table in a project.

Score 1

A.

True

B.

False

Question # 4

Function Studio is a web project coding and development tool independently developed by the

Alibaba Group for function development scenarios. It is an important component of DataWorks.

Function Studio supports several programming languages and platform-based function development

scenarios except for ______ .

Score 2

A.

Real-time computing

B.

Python

C.

Java

D.

Scala

Question # 5

Alibaba Cloud Elastic MapReduce (E-MapReduce) is a big data processing solution to quickly process

huge amounts of data. Based on open source Apache Hadoop and Apache Spark, E-MapReduce flexibly

manages your big data use cases such as trend analysis, data warehousing, and analysis of continuously

streaming data.

Score 1

A.

True

B.

False

Question # 6

Your company stores user profile records in an OLTP databases. You want to join these records with

web server logs you have already ingested into the Hadoop file system. What is the best way to obtain

and ingest these user records?

Score 2

A.

Ingest with Hadoop streaming

B.

Ingest using Hive

C.

Ingest with sqoop import

D.

Ingest with Pig ' s LOAD command

Question # 7

_______ instances in E-MapReduce are responsible for computing and can quickly add computing

power to a cluster. They can also scale up and down at any time without impacting the operations of the

cluster.

Score 2

A.

Task

B.

Gateway

C.

Master

D.

Core

Question # 8

MaxCompute takes Project as a charged unit. The bill is charged according to three aspects: the

usage of storage, computing resource, and data download respectively. You pay for compute and

storage resources by the day with no long-term commitments.

Score 1

A.

True

B.

False

Question # 9

MaxCompute Tunnel provides high concurrency data upload and download services. User can use

the Tunnel service to upload or download the data to MaxCompute. Which of the following descriptions

about Tunnel is NOT correct:

Score 2

A.

MaxCompute Tunnel provides the Java programming interface for usersB. MaxCompute provides two data import and export methods: using Tunnel Operation on the console

directly or using TUNNEL written with java

B.

If data fails to be uploaded, use the restore command to restore the upload from where it was

interrupted

C.

Tunnel commands are mainly used to upload or download data.They provide the following

functions:upload, download, resume, show, purge etc.

Question # 10

DataWorks can be used to create all types of tasks and configure scheduling cycles as

needed. The supported granularity levels of scheduling cycles include days, weeks,

months, hours, minutes and seconds.

A.

True

B.

False

Go to page: