New Year Special Sale - 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: spcl70

Practice Free DA0-001 CompTIA Data+ Certification Exam Exam Questions Answers With Explanation

We at Crack4sure are committed to giving students who are preparing for the CompTIA DA0-001 Exam the most current and reliable questions . To help people study, we've made some of our CompTIA Data+ Certification Exam exam materials available for free to everyone. You can take the Free DA0-001 Practice Test as many times as you want. The answers to the practice questions are given, and each answer is explained.

Question # 6

Which of the following report types is most appropriate for a high-level, year-end report requested by a Chief Executive Officer?

A.

Dynamic

B.

Recurring

C.

Ad hoc

D.

Self-service

Question # 7

Which of the following defines the policies and procedures for managing the master data?

A.

Data administration

B.

Data stewardship

C.

Data ownership

D.

Data governance

Question # 8

A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:

Income category 1: less than $1.

Income category 2: more than $1 and less than $20,000.

Income category 3: more than $20,001 and less than $40,000.

Income category 4: more than $40,001.

Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?

A.

Data merge

B.

Derived variables

C.

Data blending

D.

Data append

Question # 9

Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?

A.

SAS

B.

SQL

C.

Python

D.

R

Question # 10

An analyst reviews the following table:

DA0-001 question answer

Which of the following data types is represented in the values in the RefNo column?

A.

Numeric

B.

Real Number

C.

Currency

D.

Alphanumeric

Question # 11

An analyst wants to include a graph in a quarterly sales report that shows the comparison between two quantitative variables. Which of the following visual diagrams can the analyst use to most effectively represent this relationship?

A.

Bar graph

B.

Heat map

C.

Pie chart

D.

Histogram

Question # 12

A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?

A.

Calculate a high variance on survey responses.

B.

Calculate the maximum range of the survey responses.

C.

Calculate a low standard deviation on survey responses.

D.

Remove any data more than 4 standard deviation from the mean.

Question # 13

An analyst is designing a dashboard that will provide a story of the sales and sales customer ratio. The following data is available:

DA0-001 question answer

Which of the following charts should the analyst consider including in the dashboard?

A.

A column chart with site and sales

B.

A line chart with site and sales

C.

A pie chart with site and sales

D.

A scatter chart with site and sales

Question # 14

Which of the following file formats is best suited to start exploratory analysis within statistical software?

A.

CSV

B.

XLSM

C.

XML

D.

JSON

Question # 15

An analyst has been asked to validate data quality. Which of the following are the BEST reasons to validate data for quality control purposes? (Choose two.)

A.

Retention

B.

Integrity

C.

Transmission

D.

Consistency

E.

Encryption

F.

Deletion

Question # 16

Which of the following types of analyses is best to use when tracking sales revenue against quarterly targets?

A.

Trend

B.

Performance

C.

Link

D.

Scope

Question # 17

Which of the ing is the correct ion for a tab-delimited spre file?

A.

tap

B.

tar

C.

sv

D.

az

Question # 18

Five dogs have the following heights in millimeters:

300, 430, 170, 470, 600

Which of the following is the mean height for the five dogs?

A.

394mm

B.

405mm

C.

493mm

D.

504mm

Question # 19

A reporting analyst is creating a dashboard that shows the year-over-year performance for a sales organization. Which of the following is the best visual for the analyst use to illustrate the organization's performance?

A.

Pie chart

B.

Scatter plot

C.

Heat map

D.

Line chart

Question # 20

Given the table below:

DA0-001 question answer

Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?

A.

Name, one

B.

Gender, two

C.

Level, three

D.

Code, four

E.

Region, five

Question # 21

Which of the following is a characteristic of a relational database?

A.

It utilizes key-value pairs.

B.

It has undefined fields.

C.

It is structured in nature.

D.

It uses minimal memory.

Question # 22

A data analyst needs to create a weekly recurring report on sales performance and distribute it to all sales managers. Which of the following would be the BEST method to automate and ensure successful delivery for this task?

A.

Use scheduled report delivery.

B.

Implement subscription access delivery.

C.

Print out a copy.

D.

Upload the report to the server.

Question # 23

Amanda needs to create a dashboard that will draw information from many other data sources and present it to business leaders.

Which one of the following tools is least likely to meet her needs?

A.

QuickSight.

B.

Tableau.

C.

Power BI.

D.

SPSS Modeler.

Question # 24

Under which of the following circumstances should the null hypothesis be accepted when a = 0.05?

A.

When p is 0.00003

B.

When p is 0.001

C.

When p is 0.04

D.

When p is 0.06

Question # 25

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company’s annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Question # 26

Which of the following types of analysis would be best for an analyst to use to examine the relationships between authors who cited other authors in a library of research papers?

A.

Linguistic analysis

B.

Trend analysis

C.

Link analysis

D.

Performance analysis

Question # 27

Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)

A.

Data accuracy

B.

Data constraints

C.

Data attribute limitations

D.

Data bias

E.

Data consistency

F.

Data manipulation

Question # 28

A data analyst who works for a government agency is required to obtain the average income of citizens. The list of citizens is given in the following table:

DA0-001 question answer

A value for one citizen's income is missing. Which of the following approaches should the data analyst take to solve this issue?

A.

Replace the missing value with the average of the rest of the unemployed citizens.

B.

Insert the value 0 into the field with the missing value.

C.

Impute the mean of the other citizens' incomes into the field with the missing value.

D.

Exclude employed citizens from the analysis.

Question # 29

Given the below:

DA0-001 question answer

Which of the following numbers represents a Type I error?

A.

1

B.

2

C.

3

D.

4

Question # 30

Given the diagram below:

DA0-001 question answer

Which of the following data schemas shown?

A.

Key-value pairs

B.

Online transactional processing

C.

Data Lake

D.

Relational database

Question # 31

An analyst modified a data set that had a number of issues. Given the original and modified versions:

DA0-001 question answer

Which of the following data manipulation techniques did the analyst use?

A.

Imputation

B.

Recoding

C.

Parsing

D.

Deriving

Question # 32

Given the customer table below:

DA0-001 question answer

Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?

A.

Pie chart

B.

Heat graph

C.

Scatter plot

D.

Line chart

Question # 33

A military commander would like to see the health scorecards of the troops daily and filter them based on gender and rank. Considering this data is PHI, which of the following would be the best way for the commander to view the information?

A.

An emailed report

B.

A password-protected dashboard

C.

A daily printout of a report

D.

A cloud-hosted spreadsheet

Question # 34

Which of the following is the most appropriate to consider when creating a schema of a central group broken into detailed subcategories?

A.

Relational

B.

Hierarchical

C.

Snowflake

D.

Star

Question # 35

An analyst notices changes in sales ratios when analyzing a quarterly report. Which of the following is the analyst conducting?

A.

A gap analysis

B.

A link analysis

C.

A trend analysis

D.

A statistical analysis

Question # 36

Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?

A.

Microsoft Excel

B.

R

C.

Snowflake

D.

SQL

Question # 37

You are working with a dataset and want to change the names of categories that you used fordifferent types of books.

What term best describes this action?

A.

Recording.

B.

Summarizing

C.

Aggregating.

D.

Filtering.

Question # 38

Which one of the following values will appear first if they are sorted in descending order?

A.

Aaron.

B.

Molly.

C.

Xavier.

D.

Adam.

Question # 39

A healthcare data analyst notices that one data set in the column for BloodPressure contains several outliers that need to be replaced with meaningful values. Which of the following data manipulation techniques should the analyst use?

A.

Recode

B.

Impute

C.

Append

D.

Reduction

Question # 40

A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:

DA0-001 question answer

Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?

A.

Standardize the field names.

B.

Recode the data values.

C.

Overwrite the field names in one of the tables.

D.

Edit the field names in the data dictionary.

Question # 41

An analyst wants to determine whether a relationship between an individual's age and voting preferences exists. Which of the following is the best statistical method for the analyst to use?

A.

P-value

B.

Chi-squared

C.

F-test

D.

Z-score

Question # 42

An organizational document governs role-based and group-based requirements. Which of the following data requirements should be used?

A.

Security requirements

B.

Storage requirements

C.

Access requirements

D.

Use requirements

Question # 43

A company notifies its employees that emails will be automatically moved to a cloud-based server in 180 days. Which of the following describes this concept?

A.

Data deletion

B.

Data processing

C.

Data retention

D.

Data constraints

Question # 44

A data analyst has been asked to derive a new variable labeled “Promotion_flag” based on the total quantity sold by each salesperson. Given the table below:

DA0-001 question answer

Which of the following functions would the analyst consider appropriate to flag “Yes” for every salesperson who has a number above 1,000,000 in the Quantity_sold column?

A.

Date

B.

Mathematical

C.

Logical

D.

Aggregate

Question # 45

An analyst is currently working on a ticket for revamping a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?

A.

Talk to the group that made the request to determine the desired goal.

B.

Make changes to a frequently used report that is already in production.

C.

Build an additional dashboard with fewer views that are tailored toward each specific team.

D.

Develop a more streanMined dashboard to roll out by the next delivery date.

Question # 46

Which one of the following is a common data warehouse schema?

A.

Snowflake.

B.

Square.

C.

Spiral.

D.

Sphere.

Question # 47

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

DA0-001 question answer

Which of the following conclusions is accurate at a 95% confidence interval?

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Question # 48

You have two databases tables that you would like to join together using a foreign key relationship.

What term best describes this action?

A.

Blending.

B.

Appending.

C.

Mixing.

D.

Merging.

Question # 49

Which of the following is an example of a discrete data type?

A.

8in (20cm)

B.

5 kids

C.

2.5mi (4km)

D.

10.7lbs (4.9kg)

Question # 50

Which of the following describes the use of a representative amount of data from a main repository?

A.

Observation

B.

Delta load

C.

Web scraping

D.

Sampling

Question # 51

Given the following data:

CustomerID

ItemBought

Date

Tre_234

Sofa

2022-09-08

216_Tre

Shoes

08/02/2021

215/Tre

Blanket

2021/06/20

045/Tre

Mug

12-26-2021

Tre-345

Lamp

31/08/2022

TREJD19

Bucket

2022'08/01

Which of the following best describes the main issue in the data set?

A.

Inconsistent data

B.

Data mismatch

C.

Invalid data

D.

Redundant data

Question # 52

An analyst is currently working on a ticket to revamp a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?

A.

Talk to the group that made the request to determine the desired goal.

B.

Make changes to a frequently used report that is already in production.

C.

Build an additional dashboard with fewer views tailored toward each specific team.

D.

Develop a more streamlined dashboard to roll out by the next delivery date.

Question # 53

Which of the following best describes a 95% confidence interval?

A.

There is a 95% probability that a sample is within one standard deviation of the mean.

B.

A stated range may contain 95% of the population mean, 95% of the time.

C.

A set of ranges contains the population mean with 95% certainty.

D.

A range contains 95% of the population mean.

Question # 54

Which of the following is used for calculations and pivot tables?

A.

IBM SPSS

B.

SAS

C.

Microsoft Excel

D.

Domo

Question # 55

Given the following data sample:

DA0-001 question answer

Which of the following best describes the data quality issue?

A.

Data outlier

B.

Consistent data

C.

Duplicate data

D.

Invalid data

Question # 56

Which of the following is the best approach to use to gain a general understanding of a data set?

A.

Descriptive statistics

B.

Basic projections

C.

Gap analysis

D.

Trend analysis

Question # 57

A client wants a new report that will be automatically emailed to all global sales teams on a weekly basis. Each sales team must be able to view the sales for its region and the combined sales for all regions. Which of the following would be the most efficient method for meeting the requirements?

A.

Creating a single report with a region filter

B.

Creating report distribution lists for the sales teams in each region

C.

Creating a unique copy of the report for each sales team region

D.

Creating a unique copy of the report for each recipient

Question # 58

Given the following data set:

DA0-001 question answer

Which of the following is the best reason for cleansing the data?

A.

Duplicate data

B.

Imputed data

C.

Redundant data

D.

Corrupt data

Question # 59

Given the following table:

DA0-001 question answer

Which of the following describes the data quality issues with theagedata?

A.

Completeness

B.

Consistency

C.

Accuracy

D.

Manipulation

Question # 60

Which of the following is the best reason for removing data outliers?

A.

Data varies significantly from others.

B.

Data is redundant in the table.

C.

Data is duplicated in the whole range.

D.

Data is missing from the table.

Question # 61

A data analyst has been asked to create a sales report that calculates the rolling 12-month average for sales. If the report will be published on November 1, 2020, which of the following months shouts the report cover?

A.

October 1, 2019 to October 31, 2020

B.

October 31, 2020 to November 1, 2021

C.

November 1, 2019 to October 31, 2020

D.

October 31, 2019 to October 31, 2020

Question # 62

A web developer wants to ensure that malicious users can't type SQL statements when they asked for input, like their username/userid.

Which of the following query optimization techniques would effectively prevent SQL Injection attacks?

A.

Indexing.

B.

Subset of records.

C.

Temporary table in the query set.

D.

Parametrization.

Question # 63

A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be themost efficient way to deliver this report?

A.

A workbook with multiple tabs for each region

B.

A daily email with snapshots of regional summaries

C.

A static report with a different page for every filtered view

D.

A dashboard with filters at the top that the user can toggle

Question # 64

A JSON file is an example of:

A.

structured data.

B.

web data.

C.

machine data.

D.

processed data.

Question # 65

A data analyst was asked to create a visual representation of sales for the first quarter of 2020. Which of the following visualizations should be used when a time element is present?

A.

A bubble chart

B.

A line chart

C.

A scatter plot

D.

An infographic

Question # 66

An analyst is working with the income data of suburban families in the United States. The data set has a lot of outliers, and the analyst needs to provide a measure that represents the typical income. Which of the following would BEST fulfill the analyst’s goal?

A.

Median

B.

Mean

C.

Mode

D.

Standard deviation

Question # 67

An analyst for a concert venue is analyzing the number of tickets sold for a recent event. Which of the following types of data is the number of sold tickets an example of?

A.

Ordinal

B.

Continuous

C.

Nominal

D.

Discrete

Question # 68

An analyst is reporting on the average income for a county and is reviewing the following data:

DA0-001 question answer

Which of the following is the reason the analyst would need to cleanse the data in this data set?

A.

Data completeness

B.

Data outliers

C.

Duplicate data

D.

Missing values

Question # 69

A data analyst is reviewing SQL code and sees a query that uses terms such as MIN, SUM, and COUNT. Which of the following types of functions best describes these terms?

A.

Aggregate

B.

Logical

C.

Filtering

D.

System

Question # 70

Each month an analyst needs to execute a data pull for the two prior months. Which of the following is the most efficient function for the analyst to use?

A.

Logical

B.

Date

C.

Aggregate

D.

System

Question # 71

A customer list from a financial services company is shown below:

DA0-001 question answer

A data analyst wants to create a likely-to-buy score on a scale from 0 to 100, based on an average of the three numerical variables: number of credit cards, age, and income. Which of the following should the analyst do to the variables to ensure they all have the same weight in the score calculation?

A.

Recode the variables.

B.

Calculate the percentiles of the variables.

C.

Calculate the standard deviations of the variables.

D.

Normalize the variables.

Question # 72

Which of the following is an example of a flat file?

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Question # 73

Given the information in the following tables:

DA0-001 question answer

Which of the following describes merging these tables to create a master file that includes all transactions for both online and in-store sales?

A.

Data audit

B.

Data completeness

C.

Data validation

D.

Data consolidation

Question # 74

A data analyst needs to observe the relationship between two numeric variables and identify the clustering pattern as well as the outliers. Which of the following visualizations should the analyst use?

A.

Heat map

B.

Tree map

C.

Scatter plot

D.

Stacked chart

Question # 75

Which of the following query optimization techniques involves examining only the data that is needed for a particular task?

A.

Making a temporary table

B.

Creating a flat file

C.

Indexing documents

D.

Creating an execution plan

Question # 76

Which of the following will MOST likely be streamed live?

A.

Machine data

B.

Key-value pairs

C.

Delimited rows

D.

Flat files

Question # 77

Which of the following is the correct data type for text?

A.

Boolean

B.

String

C.

Integer

D.

Float

Question # 78

Which of the following is a common data analytics tool that is also used as an interpreted, high-level, general-purpose programming language?

A.

SAS

B.

Microsoft Power B1

C.

IBM SPSS

D.

Python

Question # 79

A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

DA0-001 question answer

Which of the following types of charts should be considered?

A.

Include a line chart using the site and average sales per customer.

B.

Include a pie chart using the site and sales to average sales per customer.

C.

Include a scatter chart using sales volume and average sales per customer.

D.

Include a column chart using the site and sales to average sales per customer.

Question # 80

Which of the following are reasons to conduct data cleansing? (Select two).

A.

To perform web scraping

B.

To track KPls

C.

To improve accuracy

D.

To review data sets

E.

To increase the sample size

F.

To calculate trends

Question # 81

Which of the following is a common data analytics tool that is also used as an interpreted, high-level, general-purpose programming language?

A.

SAS

B.

Microsoft Power BI

C.

IBM SPSS

D.

Python

Question # 82

Which of the following best describes the law of large numbers?

A.

As a sample size decreases, its standard deviation gets closer to the average of the whole population.

B.

As a sample size grows, its mean gets closer to the average of the whole population

C.

As a sample size decreases, its mean gets closer to the average of the whole population.

D.

When a sample size doubles. the sample is indicative of the whole population.

Question # 83

A data analyst is working with a team to create a dashboard for a client who requires on-demand access. Which of the following is the best delivery method to support the clients’ requirement?

A.

Email

B.

Scheduled

C.

Subscription

D.

Static

Question # 84

A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:

A.

Non-relational schema

B.

Galaxy schema

C.

Snowflake schema

D.

Star schema

Question # 85

An analyst wants to include a graph in a quarterly sales report that shows the comparison between two quantitative variables. Which of the following visual diagrams can the analyst use to most effectively represent this relationship?

A.

Bar graph

B.

Heat map

C.

Pie chart

D.

Scatter plot

Question # 86

Which of the following types of data manipulation functions should a data analyst use to implement a YES/NO condition in a spreadsheet?

A.

Text

B.

Statistical

C.

Financial

D.

Logical

Question # 87

A dataset requires an analysis for investigating and discovering abnormalities. Which of the following best describes the nature of the exploratory analysis conducted?

A.

Summary of the data's main characteristics

B.

Best data tuning method

C.

Set of methods for cleaning the data

D.

Method of checking the quality of the data

Question # 88

A data analyst is developing a data dictionary that aligns with a company's data management processes and policies. Which of the following best describes what should be included in the data dictionary?

A.

Information containing the links to business data

B.

Information explaining the business methodologies

C.

Information containing definitions of the business data

D.

Information describing the data analysis phases

Question # 89

A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?

A.

Data content

B.

Frequency

C.

Filtering

D.

Views

Question # 90

A data scientist wants to see which products make the most money and which products attract the most customer purchasing interest in their company.

Which of the following data manipulation techniques would he use to obtain this information?

A.

Data append

B.

Data blending

C.

Normalize data

D.

Data merge

Question # 91

Which of the following techniques is used to quantify data?

A.

Decoding

B.

Enumeration

C.

Coding

D.

Structure

Question # 92

A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:

A.

non-relational schema.

B.

galaxy schema.

C.

snowflake schema.

D.

star schema.

Question # 93

Five dogs have the following heights in millimeters:

300,430, 170, 470, 600

Which of the following is the standard deviation for the five dogs?

A.

147mm

B.

154mm

C.

394 mm

D.

21,704mm

Question # 94

A data analyst is attempting to understand how ice cream consumption is affected by different attributes. such as cost, temperature. and income level. Which of the following

regression analyses should the data analyst perform to understand this relationship?

A.

Logistic

B.

Ordinary least squares

C.

Cox

D.

Polynomial

Question # 95

A data set has the following values:

DA0-001 question answer

Which of the following is the best reason for cleansing the data?

A.

Invalid data

B.

Redundant data

C.

Data outliers

D.

Missing data

Question # 96

Which of the following query statements would be used when filtering data in a relational database management system? (Select two).

A.

ORDER BY

B.

HAVING

C.

WHERE

D.

SELECT

E.

INSERT

F.

GROUP BY

Question # 97

An analyst wants to create a historical data set for the past five years with each year in its own data set. Which of the following methods is the best way to create this historical data set?

A.

Data transpose

B.

Data concatenation

C.

Data append

D.

Data normalization

Question # 98

A data analyst has removed the outliers from a data set due to large variances. Which of the following central tendencies would be the best measure to use?

A.

Range

B.

Mean

C.

Mode

D.

Median

Question # 99

A data analyst is asked to create a sales report for the second-quarter 2020 board meeting, which will include a review of the business’s performance through the second quarter. The board meeting will be held on July 15, 2020, after the numbers are finalized. Which of the following report types should the data analyst create?

A.

Static

B.

Real-time

C.

Self-service

D.

Dynamic

Question # 100

A database administrator needs to increase performance on a large dimension table. Which of the following is the best way to accomplish this task?

A.

Sampling

B.

Partitioning

C.

Windowing

D.

Sorting

Question # 101

An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

DA0-001 question answer

Which of the following types of charts should be considered to best display the data?

A.

Include a bar chart using the site and the percentage of new customers data.

B.

Include a line chart using the site and the percentage of new customers data.

C.

Include a pie chart using the site and percentage of new custorners data.

D.

Include a scatter chart using the site and the percent of new customers data.

Question # 102

A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should theanalyst recommend?

A.

A self-service report

B.

A research report

C.

An ad hoc report

D.

An operational report

Question # 103

An analyst needs to create an analytics dashboard for an employee intranet site to improve the search functionality, display relevant information, and maintain an updated FAQ page. Which of the following visualizations would best represent what employees are searching for?

A.

A word cloud

B.

A histogram

C.

A pie chart

D.

A scatter plot

Question # 104

The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?

A.

Q2 2020 and Q4 2019

B.

YTD 2020 and YTD 2019

C.

Q2 2020 and Q2 2019

D.

Q2 2020 and Q2 2021

Question # 105

A collections manager has a team calling customers who are past due on their accounts in an attempt to collect payments. The manager receives the call list in the form of a printed report that is generated by the accounting department at the beginning of each week. Consequently, the collections team calls some customers who have made payments in the time since the report was last printed. Which of the following reporting enhancements could the accounting department implement to best reduce the number of calls on current accounts?

A.

Modify the date range on the report

B.

Include a time stamp on the report.

C.

Increase the frequency of report generation.

D.

Add a report run date to the report.

Question # 106

A data analyst is building a closed won quarter-over-quarter report for the sales team. Which of the following will be needed to complete this request?

A.

The report create date and closed dollar amount

B.

The closed won quarter and the closed dollar amount

C.

The segment and closed dollar amount

D.

The closed won year and sales leader name

Question # 107

An analyst needs to join two tables of data together for analysis. All the names and cities in the first table should be joined with the corresponding ages in the second table, if applicable.

DA0-001 question answer

Which of the following is the correct join the analyst should complete. and how many total rows will be in one table?

A.

INNER JOIN, two rows

B.

LEFT JOIN. four rows

C.

RIGHT JOIN. five rows

D.

OUTER JOIN, seven rows

Question # 108

A database administrator is required to mask certain table columns containing PII in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).

A.

Government-issued ID

B.

Address

C.

Order ID

D.

Order date

E.

Customer ID

F.

Referral number

Question # 109

A column is being used to store strings of variable lengths. Performance is a concern, so the column needs to use as little space as possible. Which of the following data types best meets these requirements?

A.

char

B.

nchar

C.

varchar

D.

nvarchar

Question # 110

An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?

A.

Scatter plot

B.

Heat map

C.

Pie chart

D.

Infographic

Question # 111

A data analyst for a media company needs to determine the most popular movie genre. Given the table below:

DA0-001 question answer

Which of the following must be done to the Genre column before this task can be completed?

A.

Append

B.

Merge

C.

Concatenate

D.

Delimit

Question # 112

Which of the following differentiates a flat text file from other data types?

A.

Data is separated by a delimiter.

B.

Data is stored in defined rows.

C.

Data is defined with key-value pairs.

D.

Data is housed in a markup language.

Question # 113

Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?

A.

Simple random

B.

Cluster

C.

Systematic

D.

Stratified

Question # 114

Which of the following roles is responsible for ensuring an organization's data quality, security, privacy, and regulatory compliance?

A.

Data owner.

B.

Data steward.

C.

Data custodian.

D.

Data processor.

Question # 115

Consider the following dataset which contains information about houses that are for sale:

DA0-001 question answer

Which of the following string manipulation commands will combine the address and region namecolumns to create a full address?

full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan

A.

SELECT CONCAT(address, ' , ' , regionname) AS full_address FROM melb LIMIT 5;

B.

SELECT CONCAT(address, '-' , regionname) AS full_address FROM melb LIMIT 5;

C.

SELECT CONCAT(regionname, ' , ' , address) AS full_address FROM melb LIMIT 5

D.

SELECT CONCAT(regionname, '-' , address) AS full_address FROM melb LIMIT 5;

Question # 116

Which of the following explains why standardization of data field names is important to master data management concepts?

A.

The quality of the data is consistent and improved.

B.

The data looks more appealing.

C.

The colors in data visualization are enhanced.

D.

The data is decompressed.

Question # 117

An analyst wants to extract data from a variety of sources and store the data in a cloud-based environment prior to cleaning. Which of the following integration techniques should the analyst use?

A.

ETL

B.

API

C.

SQL

D.

ELT

Question # 118

Which of the following programming languages are best suited for analysis and machine-learning applications? (Select two).

A.

Ruby

B.

Rust

C.

PHP

D.

Python

E.

Kotlin

F.

R

DA0-001 PDF

$33

$109.99

3 Months Free Update

  • Printable Format
  • Value of Money
  • 100% Pass Assurance
  • Verified Answers
  • Researched by Industry Experts
  • Based on Real Exams Scenarios
  • 100% Real Questions

DA0-001 PDF + Testing Engine

$52.8

$175.99

3 Months Free Update

  • Exam Name: CompTIA Data+ Certification Exam
  • Last Update: Dec 14, 2025
  • Questions and Answers: 396
  • Free Real Questions Demo
  • Recommended by Industry Experts
  • Best Economical Package
  • Immediate Access

DA0-001 Engine

$39.6

$131.99

3 Months Free Update

  • Best Testing Engine
  • One Click installation
  • Recommended by Teachers
  • Easy to use
  • 3 Modes of Learning
  • State of Art Technology
  • 100% Real Questions included