Halloween Special Sale - 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: spcl70

DA0-001 PDF

$33

$109.99

3 Months Free Update

  • Printable Format
  • Value of Money
  • 100% Pass Assurance
  • Verified Answers
  • Researched by Industry Experts
  • Based on Real Exams Scenarios
  • 100% Real Questions

DA0-001 PDF + Testing Engine

$52.8

$175.99

3 Months Free Update

  • Exam Name: CompTIA Data+ Certification Exam
  • Last Update: Oct 29, 2025
  • Questions and Answers: 363
  • Free Real Questions Demo
  • Recommended by Industry Experts
  • Best Economical Package
  • Immediate Access

DA0-001 Engine

$39.6

$131.99

3 Months Free Update

  • Best Testing Engine
  • One Click installation
  • Recommended by Teachers
  • Easy to use
  • 3 Modes of Learning
  • State of Art Technology
  • 100% Real Questions included

DA0-001 Practice Exam Questions with Answers CompTIA Data+ Certification Exam Certification

Question # 6

A data analyst needs to observe the relationship between two numeric variables and identify the clustering pattern as well as the outliers. Which of the following visualizations should the analyst use?

A.

Heat map

B.

Tree map

C.

Scatter plot

D.

Stacked chart

Full Access
Question # 7

An analyst in a consumer bank department wants to showcase the concentration of accounts opened in the United States by ZIP Code to describe the effectiveness of the bank's marketing campaigns. Which of the following would be the best way to visualize the data?

A.

A stacked chart

B.

A tree map

C.

A waterfall chart

D.

A geographic map

Full Access
Question # 8

The director of operations at a power company needs data to help identify where company resources should be allocated in order to monitor activity for outages and restoration of power in the entire state. Specifically, the director wants to see the following:

* County outages

* Status

* Overall trend of outages

INSTRUCTIONS:

Please, select each visualization to fit the appropriate space on the dashboard and choose an appropriate color scheme. Once you have selected all visualizations, please, select the appropriate titles and labels, if applicable. Titles and labels may be used more than once.

If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.

DA0-001 question answer

Full Access
Question # 9

Given the diagram below:

DA0-001 question answer

Which of the following data schemas shown?

A.

Key-value pairs

B.

Online transactional processing

C.

Data Lake

D.

Relational database

Full Access
Question # 10

Given the table below:

DA0-001 question answer

Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?

A.

Name, one

B.

Gender, two

C.

Level, three

D.

Code, four

E.

Region, five

Full Access
Question # 11

Which of the following is an example of a discrete variable?

A.

The temperature of a hot tub

B.

The height of a horse

C.

The time to complete a task

D.

The number of people in an office

Full Access
Question # 12

Which of the following activities occurs during the ETL process?

A.

Reviewing and addressing missing values

B.

Creating a dashboard

C.

Inserting a pivot table and pivot chart

D.

Multiplying unique data

Full Access
Question # 13

An analyst is working with a data set that lists individuals' first and last names in separate columns. Which of the following processes should the analyst use to combine the first and last names into a single spreadsheet cell?

A.

Transpose

B.

Blend

C.

Concatenate

D.

Merges

Full Access
Question # 14

Which of the following data types is best for representing count data?

A.

Discrete

B.

Referential

C.

Sequential

D.

Continuous

Full Access
Question # 15

Given the following grocery store orders:

DA0-001 question answer

If a query is made to the table with the following logic:

Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)

Which of the following is the number of orders that will be returned by the query?

A.

Four

B.

Five

C.

Six

D.

Seven

Full Access
Question # 16

A database administrator is required to mask certain table columns containing Pll in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).

A.

Government-issued ID

B.

Address

C.

Order ID

D.

Order date

E.

Customer ID

F.

Referral number

Full Access
Question # 17

A sales team wants visibility of current sales numbers, pipeline, and team performance. The team would also like to see calculations of individuals’ earned commissions and projected commissions based on sales, but they want that information to be kept confidential. Which of the following would be the BEST way to provide this visibility?

A.

Create a dashboard displaying a data refresh date so users know the current sales numbers and configure permissions to control access.

B.

Create a dashboard for sales numbers, pipeline, and team and individual performance for the management team.

C.

Create a dashboard with filters for the overall team, individuals, and management. Users can filter to see the data they want.

D.

Create a dashboard with views for team, individuals, and management. Configure permissions to control access.

Full Access
Question # 18

Which of the following differentiates a flat text file from other data types?

A.

Data is separated by a delimiter.

B.

Data is stored in defined rows.

C.

Data is defined with key-value pairs.

D.

Data is housed in a markup language.

Full Access
Question # 19

Given the following table:

DA0-001 question answer

Which of the following methods is the best way to describe the changes in the values in the table?

A.

Average

B.

Range

C.

Standard deviation

D.

Median

Full Access
Question # 20

During data cleansing, an analyst conducts measures of central tendency on a data set. Which of the following data is the analyst attempting to identify?

A.

Duplicate

B.

Missing

C.

Outlying

D.

Invalid

Full Access
Question # 21

A data analyst is setting up a data dashboard to monitor several ETL data streams to ensure that data is complete for later analysis. Which of the following audiences should the analyst target for this dashboard?

A.

Executives

B.

The management team

C.

Technical experts

D.

External vendors

Full Access
Question # 22

An organization wants to evaluate whether project activities are within the set projections and in line to meet the desired project targets. Which of the following types of analysis is best suited for this situation?

A.

Trend analysis

B.

Performance analysis

C.

Descriptive analysis

D.

Exploratory analysis

Full Access
Question # 23

An analyst needs to summarize the number of people in Chicago in 2022 using the following set of data:

DA0-001 question answer

Which of the following steps should the analyst use to provide results? (Select two).

A.

Aggregation

B.

Sorting

C.

Filtering

D.

Indexing

E.

Cleaning

F.

Replacing

Full Access
Question # 24

A dataset requires an analysis for investigating and discovering abnormalities. Which of the following best describes the nature of the exploratory analysis conducted?

A.

Summary of the data's main characteristics

B.

Best data tuning method

C.

Set of methods for cleaning the data

D.

Method of checking the quality of the data

Full Access
Question # 25

Which of the following technologies would be best suited for creating a multiple linear regression model?

A.

Microsoft Power Bl

B.

R

C.

SQL

D.

Tableau

Full Access
Question # 26

A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:

DA0-001 question answer

Customer Table -

In-store Transactions –

DA0-001 question answer

Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?

A.

INNER: 6 rows; LEFT: 9 rows

B.

INNER: 9 rows; LEFT: 6 rows

C.

INNER: 9 rows; LEFT: 15 rows

D.

INNER: 15 rows; LEFT: 9 rows

Full Access
Question # 27

A site reliability team wants to monitor the stability of their website. so they can proactively diagnose issues when they occur Which of the following deliverables would best suit their needs?

A.

A self-serve dashboard of website performance that updates in real time

B.

A weekly log report of site visits and user actions

C.

A portal that is refreshed daily and reports errors classified by type

D.

A daily summary email indicating website outages for the previous day

Full Access
Question # 28

A data analyst is attempting to understand how ice cream consumption is affected by different attributes. such as cost, temperature. and income level. Which of the following

regression analyses should the data analyst perform to understand this relationship?

A.

Logistic

B.

Ordinary least squares

C.

Cox

D.

Polynomial

Full Access
Question # 29

Which of the following statements would be used to append two tables that have the same number of columns?

A.

UNION ALL

B.

MERGE

C.

GROUP BY

D.

JOIN

Full Access
Question # 30

Which of the following best describes the law of large numbers?

A.

As a sample size decreases, its standard deviation gets closer to the average of the whole population.

B.

As a sample size grows, its mean gets closer to the average of the whole population

C.

As a sample size decreases, its mean gets closer to the average of the whole population.

D.

When a sample size doubles. the sample is indicative of the whole population.

Full Access
Question # 31

Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.

What type of design pattern is the data warehouse using?

A.

Star.

B.

Sun.

C.

Snowflake.

D.

Comet.

Full Access
Question # 32

Which of the following is an example of a flat file?

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Full Access
Question # 33

An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?

A.

Scatter plot

B.

Heat map

C.

Pie chart

D.

Infographic

Full Access
Question # 34

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

DA0-001 question answer

Which of the following conclusions is accurate at a 95% confidence interval?

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Full Access
Question # 35

Which of the following is the first step an analyst should perform upon receiving a business request for analysis?

A.

Determine the data needs and sources for analysis.

B.

Initiate the analysis for exploratory data analysis.

C.

Review the business questions to understand the scope.

D.

Finalize the methodology to solve the problem.

Full Access
Question # 36

Which of the following is the best variable formal to store a customer's age using the least possible amount of storage data?

A.

Int

B.

Float

C.

Char

D.

Double

Full Access
Question # 37

An analyst has received the requirements for an internal user dashboard. The analyst confirms the data sources and then creates a wireframe. Which of the following is the NEXT step the analyst should take in the dashboard creation process?

A.

Optimize the dashboard.

B.

Create subscriptions.

C.

Get stakeholder approval.

D.

Deploy to production.

Full Access
Question # 38

Which of the following would be the best way to identify multicollinear attributes in a data set?

A.

Correlation coefficient

B.

Chi-squared test

C.

Two-sample f-test

D.

Two-way ANOVA

Full Access
Question # 39

You have two databases tables that you would like to join together using a foreign key relationship.

What term best describes this action?

A.

Blending.

B.

Appending.

C.

Mixing.

D.

Merging.

Full Access
Question # 40

A data analyst needs to create a dashboard to help identify trends in the data sets. Which of the following is an appropriate consideration for dashboard development?

A.

Data sources and attributes

B.

Frequently asked questions

C.

A report from the data source

D.

A comparison of data sets

Full Access
Question # 41

A military commander would like to see the health scorecards of the troops daily and filter them based on gender and rank. Considering this data is PHI, which of the following would be the best way for the commander to view the information?

A.

An emailed report

B.

A password-protected dashboard

C.

A daily printout of a report

D.

A cloud-hosted spreadsheet

Full Access
Question # 42

A reporting analyst needs to create a report that refreshes automatically and is accessible to the entire sales organization. Which of the following tools is the most appropriate to use for this task?

A.

R

B.

Excel

C.

Tableau

D.

Python

Full Access
Question # 43

Which of following is a non-relational database?

A.

Neo4j

B.

SQLite

C.

MySQL

D.

PostgreSQL

Full Access
Question # 44

Which of the following is the best reason to use database views instead of tables?

A.

Views reduce the need for repetitive, complex data joins.

B.

Views allow for the storage of temporary data, whereas tables do not.

C.

Views allow for the joining of multiple data sources, whereas tables do not.

D.

Views can be used to restrict anonymous sensitive information.

Full Access
Question # 45

Which of the following best describes a business analytics tool with interactive visualization and business capabilities and an interface that is simple enough for end users to create their own reports and dashboards?

    Python

A.

R

B.

Microsoft Power Bl

C.

SAS

Full Access
Question # 46

A data analyst has a set with more than 40.000 rows in the sample schema below:

DA0-001 question answer

The analyst would like to create one column that contains the customers’ birth dates. Which of the following data quality dimensions would BEST explain the reason for compilation?

A.

Data accuracy

B.

Data completeness

C.

Data duplication

D.

Data integrity

Full Access
Question # 47

Which of the ing is the correct ion for a tab-delimited spre file?

A.

tap

B.

tar

C.

sv

D.

az

Full Access
Question # 48

Which of the following would a data analyst look for first if 100% participation is needed on survey results?

A.

Missing data

B.

Invalid data

C.

Redundant data

D.

Duplicate data

Full Access
Question # 49

An analyst wants to test the association between the number of doors in a car and the number of gears in the car. Which of the following is the best test to use?

A.

F-test

B.

Acceptance test

C.

Chi-squared test

D.

Z-test

Full Access
Question # 50

A junior web developer is developing a new application where users can upload short videos. The first task is to create a homepage that shows the headline "Upload Your Short Videos" and a clickable button that says "upload now".

Which of the following HTML commands would help the developer to complete the task successfully?

A.

< span >Upload Your Short Videos< /span >< button >upload now< /button >

B.

< p >Upload Your Short Videos< /p >< p >upload now< /p >

C.

< hl >Upload Your Short Videos< /h1 >< button >upload now< /button >

D.

< hl >Upload Your Short Videos< /h1 >< hl >upload now< /h1 >

Full Access
Question # 51

Given the following data set:

DA0-001 question answer

Which of the following is the best reason for cleansing the data?

A.

Duplicate data

B.

Imputed data

C.

Redundant data

D.

Corrupt data

Full Access
Question # 52

A JSON file is an example of:

A.

structured data.

B.

web data.

C.

machine data.

D.

processed data.

Full Access
Question # 53

Which of the following data types should an analyst use to provide the most flexibility when recording emails on a form?

A.

Alphanumeric

B.

Text

C.

Discrete

D.

Continuous

Full Access
Question # 54

A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following

data visualization methods should the analyst use?

A.

A bubble chart

B.

A waterfall chart

C.

A scatter plot

D.

A line chart

Full Access
Question # 55

An analyst is reviewing the following data:

Car IDSpeed

123155

566436

564418

650567

546436

645638

Which of the following should the analyst include in the measures of central tendency for speed?

A.

Mode = 38 Range = 31 Mean = 42.5

B.

Range = 49 Max = 67 Min = 18

C.

Mode = 36 Max = 67 Min = 18

D.

Mode = 36 Median = 37 Mean = 41.5

Full Access
Question # 56

A research analyst collects ten data points from 1.000 specimens. The analyst will not need any additional data to complete the analysis and will not need to retrieve information by specifier. Which of the following is the best data structure for the analyst to use?

A.

NoSQL

B.

Flat file

C.

JSON

D.

Relational database

Full Access
Question # 57

Given the following graph:

DA0-001 question answer

Which of the following summary statements upholds integrity in data reporting?

A.

Sales are approximately equal for Product A and Product B across all strategies.

B.

Strategy 4 provides the best sales in comparison to other strategies.

C.

While Strategy 2 does not result in the highest sales of Product D, over all products it appears to be the most effective.

D.

Product D should be promoted more than the other products in all strategies.

Full Access
Question # 58

An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?

A.

Join

B.

Append

C.

Transform

D.

Blend

Full Access
Question # 59

Which one of the following is a common data warehouse schema?

A.

Snowflake.

B.

Square.

C.

Spiral.

D.

Sphere.

Full Access
Question # 60

Each month an analyst needs to execute a data pull for the two prior months. Which of the following is the most efficient function for the analyst to use?

A.

Logical

B.

Date

C.

Aggregate

D.

System

Full Access
Question # 61

A healthcare data analyst notices that one data set in the column for BloodPressure contains several outliers that need to be replaced with meaningful values. Which of the following data manipulation techniques should the analyst use?

A.

Recode

B.

Impute

C.

Append

D.

Reduction

Full Access
Question # 62

Which of the following data types best describe 4Ac1? (Select two).

A.

Alphanumeric

B.

Symbolic

C.

Numeric

D.

Float

E.

Boolean

F.

String

Full Access
Question # 63

Which of the following data analysis tools increases the efficiency of data visualizations?

A.

SQL

B.

Microsoft Excel

C.

SAS

D.

RapidMiner

Full Access
Question # 64

A business intelligence team wants to create a new dashboard in order to solve a problem statement. Which of the following is the correct order of steps the team should take?

A.

Determine business needs, find data sources, validate the data, create a mock-up, and analyze the information.

B.

Find data sources, determine business needs, validate the data, create a mock-up. and analyze the information.

C.

Create a mock-up, validate the data, analyze the information, determine business needs, and find data sources.

D.

Validate the data, find data sources, analyze the information, and determine business needs.

Full Access
Question # 65

Which of the following programming languages are best suited for analysis and machine-learning applications? (Select two).

A.

Ruby

B.

Rust

C.

PHP

D.

Python

E.

Kotlin

F.

R

Full Access
Question # 66

Given the following report:

DA0-001 question answer

Which of the following components need to be added to ensure the report is point-in-time and static? (Choose two.)

A.

A control group for the phrases

B.

A summary of the KPIs

C.

Filter buttons for the status

D.

The date when the report was last accessed

E.

The time period the report covers

F.

The date on which the report was run

Full Access
Question # 67

Which of the following BEST describes standard deviation?

A.

A measure that is used to establish a relationship between two variables

B.

A measure of how data is distributed

C.

A measure of the amount of dispersion of a set of values

D.

A measure that is used to find the significant difference between variables

Full Access
Question # 68

Which one of the following values will appear first if they are sorted in descending order?

A.

Aaron.

B.

Molly.

C.

Xavier.

D.

Adam.

Full Access
Question # 69

A data analyst needs to perform a full outer join of a customer's orders using the tables below:

DA0-001 question answer

Which of the following is the mean of the order quantity?

A.

73.5

B.

76.5

C.

78.8

D.

81.5

Full Access
Question # 70

A cereal manufacturer wants to determine whether the sugar content of its cereal has increased over the years. Which of the following is the appropriate descriptive statistic to use?

A.

Frequency

B.

Percent change

C.

Variance

D.

Mean

Full Access
Question # 71

Which of the following can be used to translate data into another form so it can only be read by a user who has a key or a password?

A.

Data encryption.

B.

Data transmission.

C.

Data protection.

D.

Data masking.

Full Access
Question # 72

A company needs a report that provides executives an overview and regional managers with both an overview and specifics. Which of the following reporting elements will achieve these results?

A.

Observations and insights

B.

Live data feed

C.

Drill-down function

D.

Access permissions

Full Access
Question # 73

Which of the following best describe qualitative data? (Select two).

A.

Discrete

B.

Ordinal

C.

Batch

D.

Continuous

E.

Nominal

F.

Real-time

Full Access
Question # 74

Which of the following is the best reason for removing data outliers?

A.

Data varies significantly from others.

B.

Data is redundant in the table.

C.

Data is duplicated in the whole range.

D.

Data is missing from the table.

Full Access
Question # 75

Which one of the following in NOT a common data integration tool?

A.

XSS

B.

ELT

C.

ETL

D.

APIs

Full Access
Question # 76

The senior management team at a company receives a detailed sales report at the end of each quarter. The report is several pages long and includes data from dozens of offices across the country. The team wants a better way to get a quick snapshot of what is included in the report. Which of the following modifications would best meet this requirement?

A.

Modifying documentation elements to include reference data sources

B.

Modifying the font size and style so important data points are more visible

C.

Modifying the report to include a summary section with observations and insights

D.

Modifying the report layout so it is easier to follow and understand

Full Access
Question # 77

Given the customer table below:

DA0-001 question answer

Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?

A.

Pie chart

B.

Heat graph

C.

Scatter plot

D.

Line chart

Full Access
Question # 78

Given the information in the following tables:

DA0-001 question answer

Which of the following describes merging these tables to create a master file that includes all transactions for both online and in-store sales?

A.

Data audit

B.

Data completeness

C.

Data validation

D.

Data consolidation

Full Access
Question # 79

A data engineer is creating a database field to capture whether a customer likes vanilla ice cream. Which of the following data types is the best to capture this information?

A.

Integer

B.

Boolean

C.

Categorical

D.

Numeric

Full Access
Question # 80

Given the following report:

DA0-001 question answer

Which of the following components need to be added to ensure the report is point-in-time and static? (Select two).

A.

A control group for the phrases

B.

A summary of the KPIs

C.

Filter buttons for the status

D.

The date when the report was last accessed

E.

The time period lhe report covers

F.

The date on which the report was run

Full Access
Question # 81

Amanda needs to create a dashboard that will draw information from many other data sources and present it to business leaders.

Which one of the following tools is least likely to meet her needs?

A.

QuickSight.

B.

Tableau.

C.

Power BI.

D.

SPSS Modeler.

Full Access
Question # 82

Consider this dataset showing the retirement age of 11 people, in whole years:

54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60

This tables show a simple frequency distribution of the retirement age data.

DA0-001 question answer

A.

56

B.

55

C.

57

D.

54

Full Access
Question # 83

Which of the following is an example of a data-mining ETL tool?

A.

SSIS

B.

Stata

C.

SPSS

D.

Cognos

Full Access
Question # 84

Which of the following query optimization techniques involves examining only the data that is needed for a particular task?

A.

Making a temporary table

B.

Creating a flat file

C.

Indexing documents

D.

Creating an execution plan

Full Access
Question # 85

Which one of the following is a measure of dispersion?

A.

Variance.

B.

Mode.

C.

Median.

D.

Mean.

Full Access
Question # 86

Which of the following report types is most appropriate for a high-level, year-end report requested by a Chief Executive Officer?

A.

Dynamic

B.

Recurring

C.

Ad hoc

D.

Self-service

Full Access
Question # 87

Which of the following is a control measure for preventing a data breach?

A.

Data transmission

B.

Data attribution

C.

Data retention

D.

Data encryption

Full Access
Question # 88

Which of the following is a characteristic of a relational database?

A.

It utilizes key-value pairs.

B.

It has undefined fields.

C.

It is structured in nature.

D.

It uses minimal memory.

Full Access
Question # 89

Which of the following techniques should an analyst use to analyze a data set to get a snapshot of basic measures of central tendency?

A.

Forecasting

B.

Trend analysis

C.

Gap analysis

D.

Descriptive statistics

Full Access
Question # 90

Given the diagram below:

DA0-001 question answer

Which of the following types of sampling is depicted in the image?

A.

Stratified

B.

Random

C.

Cluster

D.

Systematic

Full Access
Question # 91

Which of the following is an example of PII?

A.

Age

B.

Name

C.

Ethnicity

D.

Gender

Full Access
Question # 92

Which of the following BEST describes the issue in which character values are mixed with integer values in a data set column?

A.

Duplicate data

B.

Missing data

C.

Data outliers

D.

Invalid data type

Full Access
Question # 93

Given the image below:

DA0-001 question answer

The data should be cleaned because of the presence of:

A.

outlier

B.

non-parametric data.

C.

multicollinearity.

D.

invalid data.

Full Access
Question # 94

A county in Illinois is conducting a survey to determine the mean annual income per household. The county is 427sq mi (2.65q km). Which of the following sampling methods would MOST likely result in a representative sample?

A.

A stratified phone survey of 100 people that is conducted between 2:00 p.m. and 3:00 p.m.

B.

A systematic survey that is sent to 100 single-family homes in the county

C.

Surveys sent to ten randomly selected homes within 5mi (8km) of the county’s office

D.

Surveys sent to 100 randomly selected homes that are reflective of the population

Full Access
Question # 95

Which of the following is the correct data type for text?

A.

Boolean

B.

String

C.

Integer

D.

Float

Full Access
Question # 96

A database administrator needs to increase performance on a large dimension table. Which of the following is the best way to accomplish this task?

A.

Sampling

B.

Partitioning

C.

Windowing

D.

Sorting

Full Access
Question # 97

A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

DA0-001 question answer

Which of the following types of charts should be considered?

A.

Include a line chart using the site and average sales per customer.

B.

Include a pie chart using the site and sales to average sales per customer.

C.

Include a scatter chart using sales volume and average sales per customer.

D.

Include a column chart using the site and sales to average sales per customer.

Full Access
Question # 98

Given the following table:

DA0-001 question answer

Which of the following describes the data quality issues with theagedata?

A.

Completeness

B.

Consistency

C.

Accuracy

D.

Manipulation

Full Access
Question # 99

When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.

What term describes this action?

A.

Filtering.

B.

Normalization.

C.

Transposition.

D.

Aggregation.

Full Access
Question # 100

An analyst runs a report on a daily basis, and the number of datapoints must be validated before the data can be analyzed. The number of datapoints increases each day by approximately 20% of the total number from the day before. On a given day, the number of datapoints was 8,798. Which of the following should be the total number of datapoints on the next day?

A.

7,038

B.

9,600

C.

10,600

D.

10,800

Full Access
Question # 101

A business intelligence engineer needs to reduce the size of a data model for reporting purposes. The data set contains more than one million rows, and the table has a date-time column named Date. Which of the following should the analyst do to complete this task?

A.

Change the data type of the Date column to text.

B.

Trim the date.

C.

Round the hour of the Date column to the start of the hour.

D.

Split the Date column into two columns—time and date.

Full Access
Question # 102

The number of phone calls that the call center receives in a day is an example of:

A.

continuous data.

B.

categorical data.

C.

ordinal data.

D.

discrete data.

Full Access
Question # 103

A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?

A.

Delete all incorrect inputs and upload the corrected file.

B.

Have the user manually review the file for data completeness before loading it

C.

Create a data field to data type validator to run the file through prior to import.

D.

Spot-check the file prior to import to catch and correct field errors.

Full Access
Question # 104

A data analyst needs to create a dashboard using the company's yearly revenue data sets. Which of the following would be the best way to plot the information to show the top-performing region?

A.

A line chart

B.

A waterfall chart

C.

A heat map

D.

A stacked bar chart

Full Access
Question # 105

Five dogs have the following heights in millimeters:

300, 430, 170, 470, 600

Which of the following is the mean height for the five dogs?

A.

394mm

B.

405mm

C.

493mm

D.

504mm

Full Access
Question # 106

A table in a hospital database has a column for patient height in inches and a column for patient height in centimeters. This is an example of:

A.

dependent data.

B.

duplicate data.

C.

invalid data

D.

redundant data

Full Access
Question # 107

You are working with a dataset and need to swap the values in rows with those in columns.

What action do you need to perform?

A.

Recording

B.

Filtering.

C.

Aggregation.

D.

Transposition.

Full Access
Question # 108

A data set has the following values:

DA0-001 question answer

Which of the following is the best reason for cleansing the data?

A.

Invalid data

B.

Redundant data

C.

Data outliers

D.

Missing data

Full Access