Which of the following best describes how discrete data differs from continuous data?
A data analyst needs to create a dashboard to help identify trends in the data sets. Which of the following is an appropriate consideration for dashboard development?
A report is scheduled to run and be distributed at the end of business each day. On Mondays, one of the recipients opens the previous week's reports and combines them to calculate the weekly totals and projections for the coming week. This is a tedious process, and the recipient asks an analyst for help. Which of the following should the analyst recommend?
An analyst for a small business with multiple locations is using each location’s quarterly sales reports from last year to create a single revenue report for the year. Which of the following data mining techniques should the analyst use to complete this task?
An analyst wants to test the association between the number of doors in a car and the number of gears in the car. Which of the following is the best test to use?
Consider two different datasets, one with gas prices and the other with food prices. Which of the following measures is most affected by outliers?
Which of the following techniques should an analyst use to analyze a data set to get a snapshot of basic measures of central tendency?
Given the data below:
In which of the following file formats is the data presented?
A business intelligence team wants to create a new dashboard in order to solve a problem statement. Which of the following is the correct order of steps the team should take?
Given the image below:
The data should be cleaned because of the presence of:
A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be themost efficient way to deliver this report?
Which of the following data analysis tools increases the efficiency of data visualizations?
An organization wants to evaluate whether project activities are within the set projections and in line to meet the desired project targets. Which of the following types of analysis is best suited for this situation?
Consider this dataset showing the retirement age of 11 people, in whole years:
54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60
This tables show a simple frequency distribution of the retirement age data.
Which of the following can be used to translate data into another form so it can only be read by a user who has a key or a password?
A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:
Income category 1: less than $1.
Income category 2: more than $1 and less than $20,000.
Income category 3: more than $20,001 and less than $40,000.
Income category 4: more than $40,001.
Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?
Which of the following would be considered non-personally identifiable information?
Standardized tests are given to students in the middle of each month, and the results are ready by the end of the month. The superintendent needs a quick view of test performance. Which of the following would be the best recommendation to meet the superintendent's requirements?
What subset of Structured Query Language (SQL) is used to add, remove, modify, or retrieve the information stored within a relational database?
Which of the following is most likely to be used as a data-mining ETL tool?
Which of the following summary statements upholds integrity in data reporting?
After the daily ETL jobs are completed, the data in the reports does not appear complete, and a lot of data seems to be missing. Which of the following concepts should be used to assess and investigate further?
The ACME Corporation hired an analyst to detect data quality issues in their Excel documents. Which of the following are the most common issues? (Select TWO)
An analyst needs to provide a chart to identify the composition between the categories of the survey response data set:
Which of the following charts would be BEST to use?
Which of the following is the most appropriate to consider when creating a schema of a central group broken into detailed subcategories?
A data analyst is working for a shipping company and calculating the volume of boxes according to the following formula: volume = height × width × depth. Which of the following variable types describes volume?
An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?
A gambler thinks that a coin is fair and is equally likely to turn up heads or tails when the coin is flipped. Which of the following tests should the gambler use to fest this hypothesis?
A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered?
A data analyst must separate the column shown below into multiple columns for each component of the name:
Which of the following data manipulation techniques should the analyst perform?
A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?
Given the following data tables:
Which of the following MDM processes needs to take place FIRST?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to BEST display the data?
An analyst wants to combine two data sets into a single spreadsheet. Column names from the first spreadsheet are listed in rows in the second spreadsheet. Which of the following is the first step the analyst should take to combine the data sets?
A county in Illinois is conducting a survey to determine the mean annual income per household. The county is 427sq mi (2.65q km). Which of the following sampling methods would MOST likely result in a representative sample?
Exhibit.
Which of the following logical statements results in Table B?
A)
B)
C)
D)
While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?
Five dogs have the following heights in millimeters:
300,430, 170, 470, 600
Which of the following is the standard deviation for the five dogs?
Given the following data:
CustomerID
ItemBought
Date
Tre_234
Sofa
2022-09-08
216_Tre
Shoes
08/02/2021
215/Tre
Blanket
2021/06/20
045/Tre
Mug
12-26-2021
Tre-345
Lamp
31/08/2022
TREJD19
Bucket
2022'08/01
Which of the following best describes the main issue in the data set?
A data analyst for a media company needs to determine the most popular movie genre. Given the table below:
Which of the following must be done to the Genre column before this task can be completed?
A sales manager wants quarterly sales reports broken down by unit and week. Which of the following data output lists includes the most necessary information?
A junior web developer is developing a new application where users can upload short videos. The first task is to create a homepage that shows the headline "Upload Your Short Videos" and a clickable button that says "upload now".
Which of the following HTML commands would help the developer to complete the task successfully?
Which of the following occurs if a 90% confidence interval increases to 95%?
Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?
The director of operations at a power company needs data to help identify where company resources should be allocated in order to monitor activity for outages and restoration of power in the entire state. Specifically, the director wants to see the following:
* County outages
* Status
* Overall trend of outages
INSTRUCTIONS:
Please, select each visualization to fit the appropriate space on the dashboard and choose an appropriate color scheme. Once you have selected all visualizations, please, select the appropriate titles and labels, if applicable. Titles and labels may be used more than once.
If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.
Which of the following is an example of a strategy to reduce statistical errors?
Which of the following BEST describes the issue in which character values are mixed with integer values in a data set column?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to best display the data?
A data analyst needs to apply quality control concepts to a data set for accuracy. Which of the following is the best way to do this?
A data analyst has been asked to organize the table below in the following ways:
By sales from high to low -
By state in alphabetic order -
Which of the following functions will allow the data analyst to organize the table in this manner?
A salesperson who is prospecting potential clients collected the following data:
Which of the following is an issue with this data?
Which of the following best describes the process of examining data for statistics and information about the data?
Cleansing
A survey asks participants to rate a company on a scale of one to ten. Which of the following best describes the rating variable?
Jenny wants to study the academic performance of undergraduate sophomores and wants to determine the average grade point average at different points during an academic year.
What best describes the data set she needs?
Under which of the following circumstances should the null hypothesis be accepted when a = 0.05?
A data analyst has been asked to create an ad-hoc sales report for the Chief Executive Officer (CEO).
Which of the following should be included in the report?
A database administrator is required to mask certain table columns containing PII in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).
A database administrator needs to ensure only approved users can access specific database tables to perform financial functions. Which of the following is the best access control method for the administrator to use?
A data analyst received a large amount of third-party data that needs to be joined with in-house data files. After the data is joined, the analyst notices three columns all contain dates. Which of the following should the analyst do to maintain data consistency?
An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?
A database consists of one fact table that is composed of multiple dimensions. Depending on the dimension, each one can be represented by a denormalized table or multiple normalized tables. This structure is an example of a:
An analyst is currently working on a ticket to revamp a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?
Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?
A sales analyst needs to report how the sales team is performing to target. Which of the following files will be important in determining 2019 performance attainment?
An employer needs to maintain adequate office staffing during the winter and wants to track storm data. Which of the following data collection methods should the employer use?
Which of the following is a common data analytics tool that is also used as an interpreted, high-level, general-purpose programming language?
Which of the following would be used to store unstructured data from different sources?
A data set was recorded using multimedia technology. Which of the following is a necessary step on the way to interpretation?
Which of the following is the best approach to use to gain a general understanding of a data set?
Jhon is working on an ELT process that sources data from six different source systems.
Looking at the source data, he finds that data about the sample people exists in two of six systems.
What does he have to make sure he checks for in his ELT process?
Choose the best answer.
A data set for sales per month includes the following data:
Which of the following cleaning and profiling methods should be applied to the data set?
A business unit made the following modification to the values in a table:
Which of the following data quality dimensions was applied in this scenario?
A data analyst has a set with more than 40.000 rows in the sample schema below:
The analyst would like to create one column that contains the customers’ birth dates. Which of the following data quality dimensions would BEST explain the reason for compilation?
Which of the following describes the use of a representative amount of data from a main repository?
Given the following tables:
Which of the following will be the dimensions from a FULL JOIN of the tables above?
Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)
An analyst computed a new variable of income per day in the household by multiplying the number of days worked by the number of people working in the household and the income earned per day. Which of the following is the correct name for this new variable?
A data analyst has been asked to derive a new variable labeled “Promotion_flag” based on the total quantity sold by each salesperson. Given the table below:
Which of the following functions would the analyst consider appropriate to flag “Yes” for every salesperson who has a number above 1,000,000 in the Quantity_sold column?
Which of the following query optimization techniques involves examining only the data that is needed for a particular task?
Which of the following data types would a telephone number formatted as XXX-XXX-XXXX be considered?
A financial analyst is creating a daily billing report for a company. One night, the company's data warehouse did not update the data, which caused the data to be reported incorrectly the next day. Which of the following documentation elements should the analyst add to catch this error?
A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should theanalyst recommend?
A company's human resources department has asked a data analyst to categorize the income of all employees into five salary bands:
Which of the following types of functions would be the most appropriate to use?
Which of the following technologies would be best suited for creating a multiple linear regression model?
Which of the following explains why standardization of data field names is important to master data management concepts?
A data analyst needs to create a dashboard using the company's yearly revenue data sets. Which of the following would be the best way to plot the information to show the top-performing region?
Which one of the following would not normally be considered a summary statistic?
A data architect is designing a data solution for a retail clothing store chain. Each store has a database that tracks sales transactions. The data architect needs to create a summary table that will be used for a senior executive dashboard. The summary table should not contain duplicate store information. Which of the following should the data architect create?
Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?
An analyst wants to create a historical data set for the past five years with each year in its own data set. Which of the following methods is the best way to create this historical data set?
An analyst is building a new dashboard for a user. After an initial conversation with the user. the analyst created a mock-up of the dashboard. Which of the following best explains why the analyst created the mock-up?
An analyst needs to know what data an organization possesses. Which of the following is the best document for the analyst to consult?