Labour Day Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: buysanta

Exact2Pass Menu

Question # 4

You want to make your model more frugal to reduce the cost of collecting and processing data.

You plan to do this by removing features that are highly correlated. You would like to create a heat

map that displays the correlation so that you can identify candidate features to remove.

Which Accelerated Data Science (ADS) SDK method is appropriate to display the comparability

between Continuous and Categorical features?

A.

pearson_plot()

B.

cramersv_plot()

C.

correlation_ratio_plot()

D.

corr()

Full Access
Question # 5

Which TWO statements are true about published conda environments?

A.

The odsc conda init command is used to configure the location of published conda en-vironments.

B.

They can be used in Data Science Jobs and model deployments.

C.

Your notebook session acts as the source to share published conda environment with team members.

D.

You can only create published conda environment by modifying a Data Science conde

E.

They are curated by Oracle Cloud Infrastructure (OCI) Data Science.

Full Access
Question # 6

As a data scientist, you have stored sensitive data in a database. You need to protect this data by

using a master encryption algorithm, which uses symmetric keys. Which master encryption

algorithm would you choose in the Oracle Cloud Infrastructure (OCI) Vault service?

A.

Triple Data Encryption Standard Algorithm

B.

Elliptical Curve Cryptography Digital Signature Algorithm

C.

Advanced Encryption Standard Keys

D.

Rivert-Shamir-Adleman Keys

Full Access
Question # 7

You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail?

A.

Update the deployment to use fewer instances.

B.

Delete the deployment.

C.

Reduce the load balancer bandwidth limit so that fewer requests come in.

D.

Update the deployment to use a larger virtual machine (mare CPUs/memory).

E.

Update the deployment to add more instances.

Full Access
Question # 8

You want to write a Python script to create a collection of different projects for your data science

team. Which Oracle Cloud Infrastructure (OCI) Data Science interface would you use?

A.

The OCI Software Development Kit (SDK)

B.

OCI Console

C.

Command line interface (CLI)

D.

Mobile App

Full Access
Question # 9

You have created a Data Science project in a compartment called Development and shared it

with a group of collaborators. You now need to move the project to a different compartment called

Production after completing the current development iteration.

Which statement is correct?

A.

Moving a project to a different compartment also moves its associated notebook sessions

and models to the new compartment.

B.

Moving a project to a different compartment requires deleting all its associated notebook

sessions and models first.

C.

You cannot move a project to a different compartment after it has been created.

D.

You can move a project to a different compartment without affecting its associated

notebook sessions and models

Full Access
Question # 10

You have created a conda environment in your notebook session. This is the first time you are

working with published conda environments. You have also created an Object Storage bucket with

permission to manage the bucket.

Which two commands are required to publish the conda environment?

A.

odac conda publish --slug

B.

odsc conda list --override

C.

odsc conda init --bucket_namespace --bucket_name

D.

odsc conda create --file manifest.yaml

E.

conda activate /home/datascience/conda/

Full Access
Question # 11

You are a data scientist with a set of text and image files that need annotation, and you want to use Oracle Cloud Infrastructure (OCI) Data Labeling. Which of the following THREE an-notation classes are supported by the tool.?

A.

Object Detection

B.

Named Entity Extraction

C.

Classification (single/multi label)

D.

Key-Point and Landmark

E.

Polygonal Segmentation

F.

Semantic Segmentation

Full Access
Question # 12

Six months ago, you created and deployed a model that predicts customer churn for a call center. Initially, it was yielding quality predictions. However, over the last two months, users have been questioning the credibility of the predictions. Which TWO methods customer churn would you employ to verify the accuracy of the model?

A.

Redeploy the model

B.

Retrain the model

C.

Operational monitoring

D.

Validate the model using recent data

E.

Drift monitoring

Full Access
Question # 13

You are a data scientist working inside a notebook session and you attempt to pip install a

package from a public repository that is not included in your conda environment. After running this

command, you get a network timeout error.

What might be missing from your networking configuration?

A.

FastConnect to an on-premises network.

B.

Primary Virtual Network Interface Card (VNIC).

C.

NAT Gateway with public internet access.

D.

Service Gateway with private subnet access

Full Access
Question # 14

As a data scientist, you create models for cancer prediction based on mammographic images.

The correct identification is very crucial in this case. After evaluating two models, you arrive at the

following confusion matrix.

Model 1 has Test accuracy is 80% and recall is 70%.

• Model 2 has Test accuracy is 75% and recall is 85%.

Which model would you prefer and why?

A.

Model 2, because recall is high.

B.

Model 1, because the test accuracy is high.

C.

Model 2, because recall has more impact on predictions in this use se.

D.

Model 1, because recall has lesser impact on predictions in this use case

Full Access
Question # 15

You are preparing a configuration object necessary to create a Data Flow application. Which THREE parameter values should you provide?

A.

The path to the arhive.zip file.

B.

The local path to your pySpark script.

C.

The compartment of the Data Flow application.

D.

The bucket used to read/write the pySpark script in Object Storage.

E.

The display name of the application.

Full Access
Question # 16

During a job run, you receive an error message that no space is left on your disk device. To solve the problem, you must increase the size of the job storage. What would be the most effi-cient way to do this with Data Science Jobs?

A.

On the job run, set the environment variable that helps increase the size of the storage.

B.

Your code using too much disk space. Refactor the code to identify the problem.

C.

Edit the job, change the size of the storage of your job, and start a new job run.

D.

Create a new job with increased storage size and then run the job.

Full Access
Question # 17

You have built a machine model to predict whether a bank customer is going to default on a

loan. You want to use Local Interpretable Model-Agnostic Explanations (LIME) to understand a

specific prediction. What is the key idea behind LIME?

A.

Global behaviour of a machine learning model may be complex, while the local behaviour

may be approximated with a simpler surrogate model.

B.

Model-agnostic techniques are more interpretable than techniques that are dependent on

the types of models.

C.

Global and local behaviours of machine learning models are similar.

D.

Local explanation techniques are model-agnostic, while global explanation techniques are

not

Full Access
Question # 18

You are a data scientist leveraging the Oracle Cloud Infrastructure (OCI) Language AI service for

various types of text analyses. Which TWO capabilities can you utilize with this tool?

A.

Topic classification

B.

Table extraction

C.

Sentiment analysis

D.

Sentence diagramming

E.

Punctuation correction

Full Access
Question # 19

You are asked to prepare data for a custom-built model that requires transcribing Spanish video

recordings into a readable text format with profane words identified.

Which Oracle Cloud service would you use?

A.

OCI Translation

B.

OCI Language

C.

OCI Speech

D.

OCI Anomaly Detection

Full Access
Question # 20

You have just received a new data set from a colleague. You want to quickly find out summary information about the data set, such as the types of features, total number of observations, and data distributions, Which Accelerated Data Science (ADS) SDK method from the AD&Dataset class would you use?

A.

Show_in_notebook{}

B.

To_xgb{}

C.

Compute{}

D.

Show_corr{}

Full Access
Question # 21

You have an embarrassingly parallel or distributed batch job on a large amount of data that you

consider running using Data Science Jobs. What would be the best approach to run the workload?

A.

Create the job in Data Science Jobs and start a job run. When it is done, start a new job run

until you achieve the number of runs required.

B.

Create the job in Data Science Jobs and then start the number of simultaneous jobs runs

required for your workload.

C.

Reconfigure the job run because Data Science Jobs does not support embarrassingly parallel

workloads.

D.

Create a new job for every job run that you have to run in parallel, because the Data Science

Jobs service can have only one job run per job.

Full Access
Question # 22

You are a data scientist building a pipeline in the Oracle Cloud Infrastructure (OCI) Data Science

service for your machine learning project. You want to optimize the pipeline completion time by

running some steps in parallel. Which statement is true about running pipeline steps in parallel?

A.

Steps in a pipeline can be run only sequentially.

B.

Pipeline steps can be run in sequence or in parallel, as long as they create a directed acyclic

graph (DAG).

C.

All pipeline steps are always run in parallel.

D.

Parallel steps cannot be run if they are completely independent of each other.

Full Access
Question # 23

You train a model to predict housing prices for your city. Which two metrics from the

Accelerated Data Science (ADS) ADSEvaluator class can you use to evaluate the regression model?

A.

Explained Variance Score

B.

F-1 Score

C.

Weighted Precision

D.

Weighted Recall

E.

Mean Absolute Error

Full Access
Question # 24

Which of the following TWO non-open source JupyterLab extensions has Oracle Cloud In-frastructure (OCI) Data Science developed and added to the notebook session experience?

A.

Environment Explorer

B.

Table of Contents

C.

Command Palette

D.

Notebook Examples

E.

Terminal

Full Access