New Year Special Sale - 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: spcl70

Practice Free H13-723_V2.0 HCIP-Big Data Developer V2.0 Exam Questions Answers With Explanation

We at Crack4sure are committed to giving students who are preparing for the Huawei H13-723_V2.0 Exam the most current and reliable questions . To help people study, we've made some of our HCIP-Big Data Developer V2.0 exam materials available for free to everyone. You can take the Free H13-723_V2.0 Practice Test as many times as you want. The answers to the practice questions are given, and each answer is explained.

Question # 6

existKafka, as follows aboutProducerWhat is wrong with the statement of sending data? (multiple choice)

A.

ProducerResponsible for production data,Consumerresponsible for consuming data,ProducerandConsumerneed to be established betweenSocket

connect

B.

Producercan either send data toBroker, and can send data toConsumersuperior

C.

ProducerAs a message producer, you can directly write data toZooKeeper

D.

Producerconnect any normalBrokerInstances can produce data

Question # 7

Flumesupport real-time data collection source Which is the type?

Taildir Log4jJMSThrift 12.FusionInsight HDofHive, user-definedUDFcan andHiveBuilt-inUDFrepeat name, In this case, the user-definedUDF.

A.

True

B.

False

Question # 8

aboutFusonInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)

A.

C

B.

Scala

C.

Java

D.

Python

Question # 9

forHBase rowkeyThe design principles described below are correct?

(multiple choice)

A.

Attribute values with high access weight are placed inRowKeyFront

B.

Attribute values with high access weight are placed inRowkeysecond half

C.

Attribute values with good dispersion are placed inRowkeyfirst half

D.

Attribute values with good dispersion are placed inRowkeysecond half

Question # 10

existFusionInsight HDUnder the client, to runMapReducegenerated by the applicationjarBag. Which command can be executed?

A.

hdfs dfs

B.

yarn jar

C.

yarn application

Question # 11

HDFS"Write once, read many"__________File access model. Therefore, it is recommended that a document

A.

After creating,

B.

writing and closing,

C.

do not modify it again.

D.

right or wrong

Question # 12

FusionInsight HDproduct, aboutKafkaThe component statement is correct?

(multiple choice)

A.

KafkaofProducerWhen sending a message, you can specify which message is sent byConsumerConsumption.

B.

while creatingTopc, the number of replicas must not be greater than the currently survivingBrokerThe number of instances, otherwise createTopicWill will fail.

C.

KafkaAfter the installation is complete, you can no longer configure the data storage directory.

D.

KafkaMetadata information will be stored inZooKeepersuperior.

Question # 13

Fusionlnsigt HD one deployed inTomcatapplication on theHBaseservice, it is recommended to use in this scenario machine account.

A.

True

B.

False

Question # 14

Fusionlnsigt HDofHiveWhat distributed computing frameworks can components run on? (multiple choice)

A.

Spark

B.

Streaming

C.

MapReduce

D.

HDFS

Question # 15

FusionInsight HDmiddle,Oozieclient'sJava APIwill be called when the task is run

OozieClientWhich method of the class?

A.

suspend

B.

getJobInfo

C.

submit

D.

Run

Question # 16

forHiveWhich of the descriptions about normal tables and external tables is incorrect?

A.

Create a normal table by default

B.

The essence of the external table is that it will already existHDFSThe file path above is associated with the table

C.

When deleting a normal table, both metadata and data are deleted at the same time

D.

When deleting an external table, only the external table data is deleted without the metadata

Question # 17

existSparkIn application development, which of the following codes can correctly count words?

A.

val counts=textFile.map(line=>line.split(" ')).map(-rd=>(word,1)).reduceByKey(_+_)

B.

val counts=textFile.flatMap(line=>line.split(" ")).map(word=>(word,1)).reduceByKey(_+_)

C.

val counts=textFile.map(line=>line.split(" ")).map(word=>(word,1)).groupByKey()

D.

val counts=textFile.flatmap(line=>line.split(" ")).map(-rd=>(word,1)).reduceByKey()

Question # 18

aboutHiveandHadoopThe relationship of other components, the following description is wrong?

A.

Hivefinally store the data inHDFSmiddle

B.

HQLable to passMapReduceperform tasks

C.

HiveYesHadoopThe platform's data warehouse tools

D.

HiverightHBasehave strong dependencies

Question # 19

HDFSclient withNWhen a copy writes a file, which of the following is true about the writing process? (multiple choice)

A.

eachDataNodemaximum storage1copies.

B.

Support multiple users to write to the same file at the same time.

C.

The first copy of the data block is placed preferentially on the node where the client writing the data block is located.

D.

Copied file blocks all exist on the same rack by default.

Question # 20

FusionInsight HDin, aboutOozieWhich of the following descriptions is correct? (multiple choice)

A.

submitcommand to change the status of the job toPREP

B.

startcommand allowsPREPThe status of the job changes toRUNNING

C.

runcommand to change the job status toRUNNING

D.

killCommands can act onPREP/SUSPEND/RUNNINGstatus job

Question # 21

existFusionInsight HDproductSolrDuring application development, you canSolr Admin UIright

CollectionDo some verification. Below aboutSolr Admin UIIs the statement correct? (multiple choice)

A.

clickClouddownTree,CheckSolr CloudmiddleCollections, a configuration set, andlive_nodesequivalence information

B.

clickCloud, see eachCollectionunder eachShardofReplicadistribution and status

enter aCollectionofShard ReplicaofCore Overviewinterface, you can view theRepicaSow

C.

Quoting the actual number of documents, storage size and location information

D.

Solruser group,Solr adminrole andSuper groupof users have access toSolr Admin UI

Question # 22

FusionInsight HDin the systemHDFSdefaultBlock Sizehow many?

A.

32MB

B.

64MB

C.

128MB

D.

256MB

Question # 23

aboutKafkaThe characteristics of the following description are correct? (multiple choice)

A.

KafkaIt is a high-throughput, distributed, publish-subscribe-based messaging system

B.

KafkaPersistence of messages

C.

KafkaApplicable to offline and online message consumption scenarios

D.

Kafkaguarantee eachPartitionmessages in order

Question # 24

FlinkusecheckpointThe mechanism guarantees fault tolerance in the operation of the application.

A.

True

B.

False

Question # 25

about the followingHBaseofBloomFilterCharacter understanding, which statement is incorrect?

A.

used to filter data

B.

Used to optimize random performance

C.

will increase storage consumption

D.

Can accurately determine that a certain data does not exist

Question # 26

pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?

A.

Table Name

B.

Table names and column families(or table name andfamily)

C.

table names and columns

D.

can be empty

Question # 27

aboutRedisFeatures, the following description is correct? (multiple choice)

A.

low latency

B.

Rich data structures

C.

Support data persistence

D.

supportSQLquery interface

Question # 28

Fusionlnsigt HD which of the following components can be used for data collection? (multiple choice)

A.

Flume

B.

Loader

C.

HBase

D.

MapReduce

Question # 29

Which of the following is notMapReducespecialty?

A.

easy to program

B.

good scalability

C.

real-time computing

D.

High fault tolerance

Question # 30

set the block storage size to128M,HDFSWhen the client is writing a file, when writing a100M

How much storage space does a file of this size actually occupy?

A.

128M

B.

100M

C.

64M

D.

50M

Question # 31

FusionInsight HDWhich of the following belong toOozieofMapReduce Actionconfiguration item? (multiple choice)

A.

name-node

B.

source

C.

mapred.mapper.class

D.

job-tracker

Question # 32

Suppose there is an application with10Tables, each table has tens of millions of records, and the number of fields is about20indivual.

Currently usingRedisto cache this10The data of a table, the design of its data structure, which of the following is the best design?

A.

usehashstructure, and a table uses ahash key, a row in the table records forhash keyone offield.

B.

usehashstructure, each row record of each table uses onehash key,hash keyoffieldThe fields of the corresponding table records,

and designKEYEach table adds a different prefix to distinguish it.

C.

usestringstructure, each field of each row record of each table uses oneKEY.

D.

usestringstructure, each row record of each table uses oneKEY,valueAll fields recorded for a row in a table? ?back value of .

Question # 33

forN(N>1) copies of stored documents,HDFSThe client initiates a read file request. If the read replica node fails, the

If the connection fails, it will not go to other replica nodes for reading.

A.

True

B.

False

Question # 34

Which of the following is notMapReducespecialty?

A.

easy to program

B.

good scalability

C.

real-time computing

D.

High fault tolerance

Question # 35

existFusionInsight HDmiddle,FlumeIn a configuration file, if there are multiplesource,butsourceThe name cannot be repeated.

A.

True

B.

False

Question # 36

existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type) to ensure that data is sent in order.

A.

True

B.

False

Question # 37

RedisofLISTData structure, suitable for which of the following scenarios? (multiple choice)

A.

Build queuing systems, such as message queues

B.

uniqOperations, such as getting the ranking value of all data in a certain period of time

C.

get latestNdata operations: for example, for a certain microblog, get the latest10comments

D.

Simulate stack operations

Question # 38

FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)

A.

Nimbus

B.

Supervisor

C.

Broker

D.

quorumpeer

Question # 39

existHBaseWhile the application is running, the application can write data while creating the table.

A.

True

B.

False

Question # 40

FusionInsight HDofHiveIn the application, there are the following scenarios:? ? ?Storage files have higher? ?efficiency, and most

Minute? ?Only a part of the letter is involved in the file, this scenario is suitable for using a column file (ORC F??)storage.

A.

True

B.

False

Question # 41

Fusionlnsiht HDmiddle,Oozieclient'sJava APIwill be called when the task is runOozieClientWhich method of the class?

A.

suspend

B.

getJoblnfo

C.

submit

D.

run

Question # 42

Which of the following causesHDFSofNameNodeEntersafemode(safe read-only mode)? (multiple choice)

A.

Active and standbyNameNodeThe disk space where the metadata is located is insufficient.

B.

The number of missing blocks exceeds the threshold.

C.

Lost replicas exceed the threshold.

D.

The corrupted replica exceeds the threshold.

Question # 43

FusionInsight HDin, aboutHivethe data load function (viaHiveofLOADcommand to import data), the following description

What is wrong?

A.

available fromHDFSdirectly into thehivesurface

B.

available fromHiveServerThe local hard disk of the node is directly loaded into thehivesurface

C.

It can be directly loaded from the local hard disk of the node where the client is located.hivesurface

D.

HiveThe data loading process will not parse the specific file content, mainly the process of file transfer

Question # 44

useFusionInsight HDofHiveQuery data, when the amount of data in the intermediate results of the query is large, you can choose to compress the intermediate data

Structured data for better performance.

Which of the following areHiveRelevant parameters for intermediate result compression? (multiple choice)

A.

hive.exec.compress.intermediate

B.

hive.intermediate.compression.codec

C.

hive.exec.compress.output

twenty three

D.

hive.intermediate.compression.type

Question # 45

In useSolrWhen performing a full-text search, you canwtThe parameter specifies the response format of the query result.

aboutSolrThe response format of the query result, which of the following statements is wrong?

A.

supportCSVandJSON

B.

supportCSV,JSONandHTML

C.

supportCSV,JSONandXML

Question # 46

Spark Streamingavailable fromKafkaReceive data and perform calculations, and the calculation results can only be stored inHDFS, can no longer be written backKafka.

A.

True

B.

False

Question # 47

HDFSclient withNWhen a copy writes a file, which of the following is true about the writing process?

(multiple choice)

A.

eachDataNodemaximum storage1copy.

B.

Support multiple users to write to the same file at the same time.

C.

The first copy of the data block is placed preferentially on the node where the client writing the data block is located.

D.

Copied file blocks all exist on the same rack by default.

Question # 48

existMapReduceIn the development framework,InputFormatWhat is the function of the class?

A.

mergemapThe result of the stage has the samekeyvalue key-value pair

B.

put allmapThe results of the stages are merged to get the final result

C.

rightmapThe results of the stages are sorted and merged

D.

Divide the input data into one by one. and split this further into pairs, then asmapfunction input of

Question # 49

existHBaseWhich of the following interfaces or classes does not need to be involved in the implementation of business logic for writing data?

A.

Put

B.

HTable

C.

HBaseAdmin

D.

PutList

Question # 50

FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct?

A.

In a network isolation environment,ZooKeeperofIPCan be configured as a businessIP, can also be configured to manageIP.

B.

HiveUnlike traditional databases, soJDB, CThe established connection does not need to be closed.

C.

In a secure cluster environment, establish andHiveBefore connecting, you need to use unified authentication to log in.

D.

useJDB, CsubmitSQL, you canSQLA semicolon ";" is added after the statement.

Question # 51

Hadoopin the systemYARNassigned toContainerThe memory size, which can be passed through the parameter

yarn.app.mapreduce.am.resource.mb(yarn.nodemanager.resource.memory-mb) to set.

( )

A.

True

B.

False

Question # 52

A large-scale production enterprise plans to transform its internal logistics data and sales data into big data. The design department gives

Based on the analysis of the data storage scheme, which of the following statements are correct?

(multiple choice)

A.

HBaseIt can store massive data and support dynamic expansion, which can fulfill the storage requirements of logistics and sales data.

B.

HbaseIt supports efficient random reading, and can complete real-time analysis and command of the logistics situation through reasonable design.

C.

Logistics data is very sparse,HBaseCan handle sparse data efficiently.

D.

HBaseIt can be built on ordinary commercial servers, and the construction cost is relatively low.

Question # 53

SparkWhat kind of parameter setting methods does the application have?

(multiple choice)

A.

In the application configuration filespark-defaults.confconfigure in

B.

When submitting your application, pass--confmake settings

C.

In the application code, passSparkContextofsetPropertymethod to set

D.

In the application code, passSparkConfobject to set

Question # 54

forHBase rowkeyThe design principles described below are correct? (multiple choice)

A.

Attribute values with high access weight are placed inRowKeyFront

B.

Attribute values with high access weight are placed inRowkeysecond half

C.

Attribute values with good dispersion are placed inRowkeyfirst half

D.

Attribute values with good dispersion are placed inRowkeysecond half

Question # 55

FusionInsight HDmiddle,OoziesupportHDFS Action.

A.

True

B.

False

Question # 56

FusionInsight HD a completeStreaming CQLThe application contains at least which of the following part? (multiple choice)

A.

Create an input stream

B.

Create output stream

C.

Business action statements (such asSELECT,INSERT,JIONWait)

D.

Apply commit statement

Question # 57

existKafkain, aboutConsumerWhich statement is correct? (multiple choice)

A.

ConsumerMainly for data consumption

B.

ConsumerNeed to specify when consuming dataTopic

C.

multipleConsumerconfigure the samegroup.idwhen theseConsumerConsumed data will not be duplicated

D.

newConsumerAPIConsumption data does not need to be specifiedZooKeeperaddress

Question # 58

FusionInsigt HD which of the following components can be used for data collection? (multiple choice)

A.

Flume

B.

Loader

C.

HBase

D.

MapReduce

Question # 59

due toSparkis a memory-based computing engine, therefore, aSparsThe amount of data that the application can handle cannot exceed the amount allocated for this

indivualSparkThe total memory of the application.

A.

True

B.

False

Question # 60

Kerberossafety certificate.

A.

True

B.

False

Question # 61

YARNIn the service, if you want to give the queueQueueAset capacity to30%, which parameter should be configured?

A.

yarm.scheduler.capacity.rot.QueueA. user-limit-factor

B.

yam.scheduler.capacity.root.QueueA. minimum-userlimit-percent

C.

yarm.scheduler.capacity.root.QueueA. capacity yarm.scheduler.capacity.rot.QueueA

Question # 62

Which of the following is notMapReducespecialty?

A.

Easy to program.

B.

Good scalability

C.

Calculate in real time

D.

High fault tolerance

Question # 63

fusioninsight, an application to chargeKerberosSecurity certification, which of the following options are

must be used? (multiple choice)

A.

krb5.confconfiguration file

B.

existFusionInsight ManagerUser account applied on

C.

user accountkeytabdocument

D.

jaas.confconfiguration file

Question # 64

FSpark Streamingapplication, in aJVM, there can only be one at a time

StreamingContextin active state

A.

True

B.

False

Question # 65

existHBaseWhich of the following interfaces or classes does not need to be involved in the implementation of business logic for writing data?

A.

Put

B.

HTable

C.

HBaseAdmin

D.

Puttist

Question # 66

forHBase rowkeyThe design principles described below are correct? (multiple choice)

A.

Attribute values with high port access weights are placed inRowKeyFront

B.

Attribute values with high port access weights are placed inRowkeysecond half

C.

The attribute value with good dispersion is placed inRowkeyfirst half

D.

The attribute value with good dispersion is placed inRowkeysecond half

Question # 67

forFusionInsight HDplatformHBasecomponent, adding a secondary index needs to define the secondary index Which properties? (multiple choice)

A.

index name

B.

index column

C.

Index column type (and index column length)

D.

the name of the column family to which the indexed column belongs,

Question # 68

Below aboutZooKeeperThe statement is wrong ().

A.

If ZooKeeperIf there is an interruption in the process of synchronizing the message, after the failure is recovered, the transmission status before the failure can be adjusted according to the the state continues to synchronize, that is, support for resuming transmission from a breakpoint.

B.

ZooKeeperUse a custom atomic message protocol to ensure the consistency of node data in the entire system.

C.

ZooKeeperThe cluster is elected at startupLeaderRole.

D.

LeaderAfter a node receives a data change request, it first writes to disk and then writes to memory.

Question # 69

Rcboft0system, abouteWhich of the provided file formats is not in column format?

A.

ORC(High-performance column store)

B.

Parquetcolumn store

C.

RCFileRow-column storage

D.

TextFile

Question # 70

aboutKafkaofProducer, is the following statement correct? (multiple choice)

A.

Produceris a message producer

B.

ProducerProduction data needs to be specifiedTopic

C.

Can play multiple at the same timeProducerprocess to the sameTopicsend data

D.

ProducerNeed to connect first when producing dataZooKeeper, and then connectBroker

Question # 71

which come downRedisIs the command an atomic operation?

A.

INCR(increment the stored digital value by one)

B.

HINCRBY(for the hash tablekeydomain infieldvalue plus incrementincrement)

C.

LPUSH(for linked listspushoperate)

D.

All of the above

Question # 72
A.

The port preferentially downloads data from the nearest DataNode.

B.

The port file data is first returned from the DataNode to the NameNode, and then downloaded from the NameNode to the client.

C.

If the client and the connected DataNode fail while reading, the client will abandon the failure

node, to connect to the node where its replica is located.

D.

The port supports multiple clients reading the same file data from the DataNode at the same time.

Question # 73

Which of the following causesHDFSofNameNodeEntersafemode(safe read-only mode)?

(multiple choice)

A.

Active and standbyNameNodeThe disk space where the metadata is located is insufficient.

B.

The number of missing blocks on the port exceeds the threshold.

C.

The missing copies of the port exceed the threshold.

D.

The corrupted copy exceeds the threshold.

Question # 74

FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct of?

A.

In a network isolation environment,ZooKeeperofIPCan be configured as a businessIP, can also be configured to manageIP.

B.

hiveUnlike traditional databases, soJDB, CThe established connection does not need to be closed.

C.

In a secure cluster environment, establish andHiveBefore connecting, you need to use unified authentication to log in.

D.

useJDB, CsubmitSQL, you canSQLPut a semicolon after the statement";"

Question # 75

Regarding the Hive service of the FusionInsight HD platform and its WebHCat development interface, the following description does not

the correct one is

A.

Support for REST-based query requests

B.

The returned data format of WebHCat is XML

C.

WebHCat provides external services based on HTTP and HTTPS protocols

D.

You can create tables, query and other operations through WebHCat

Question # 76

existSparkIn application development, which of the following codes can correctly count words?

A.

rval counts = textFile.map(line => line.split("")).map(word =>(word,

1)).reduceByKey(+)

B.

val counts = textFile.flatMap(line => line.split(")).map(word =>(word,

1)).reduceByKey(+)

C.

vrval counts = textFie.map(line => line.split(")).map(word =>(word,

1)).groupByKey( )

D.

vrval counts = textFile.flatMap(line => line.split("")).map(word =>(word,

1)).groupByKey( )

Question # 77

existHBaseWhile the application is running, the application can write data while creating the table.

A.

True

B.

False

Question # 78

In useSolrWhen performing a full-text search, you canwtThe parameter specifies the response format of the query result. close

AtSolrThe response format of the query result, which of the following statements is wrong?

A.

supportCSVandJSON

B.

supportCSV,JSONandHTML

C.

supportCSV,JSONandXML

Question # 79

HBasedata fileHFileone ofKeyValueWhat information does the format contain?

(multiple choice)

A.

Key

B.

Value

C.

timeStamp

D.

KeyType

Question # 80

existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type), to ensure that the data press

A.

sent in order.

B.

True

C.

False

Question # 81

FHumeofproperties.propertiesMultiple configurations can be configured in the configuration filechannelto transmit data.

A.

True

B.

False

Question # 82

FusionInsight HDin, aboutOoziejob management for suspended jobs

Which of the following operations? (multiple choice)

A.

mouthkill

B.

mouthrun

C.

mouthresume(wake)

D.

mouthsubmit

Question # 83

existHBaseIn application development, when a table'sRowkeyRange and distribution are known, pre-score is recommendedregion,

Please call the following code (fragment) to pre-sort a tableregionAfter this table will create severalregion?

//Create a pre-partitionregionthe tablebyte??I splits = new byte?4];splits?0?=Bytes.toBytes

("A");splits?1?=Bytes.toBytes("H");splits?2?=Bytes.toBytes("O");splits

?3?=Bytes.toBytes("U");admin.createTable(htd,splits);

A.

3

B.

4

C.

5

D.

6

Question # 84

FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)

A.

Nimbus

B.

Supervisor

C.

Broker

D.

quorumpeer

Question # 85

Fusioninsight HDin useStreamingofA, CKWhich of the following statements is true

of? (multiple choice)

A.

enabledAckerLater,StreamingIt will identify the failure to sendTupleAnd automatically resend, no need for human intervention pre.

B.

AckerThe message that will identify the processing timeout or processing failure isfail.

C.

fromspoutstarted, formedTupleFailure of any link in the tree will mark the entire tree as failed.

D.

The application needs to be inspoutoffail() interface method to implement message resend logic.

Question # 86

Spark StreamingAs a micro-batch stream processing engine, it converts the data of each time slice into OneRDDa partition in the

A.

True

B.

False

Question # 87

HDFSmiddleNameNodeUsed to store metadata for the file system.

A.

True

B.

False

Question # 88

existSpark, which of the following statements about broadcast variables is correct? (multiple choice)

A.

mouth broadcast variable in eachTasksave a copy

B.

mouth broadcast variable in eachExecutorsave a copy

C.

Port broadcast variables are read-only

D.

port broadcast variable viaSparkContext.broadcast) method to generate

Question # 89

YarmWhich role is to manage individual node resources (CPU/Memory)of?

A.

NodeManager

B.

ResourceManager

C.

DataNode

D.

NameNode

Question # 90

aboutKafkaInsufficient disk capacity alarm, which of the following analysis is incorrect for the possible reasons?

A.

The disk configuration (such as the number of disks, disk size, etc.) used to store Kafka data cannot meet the current industry requirements.

service data traffic, causing the disk usage to reach the upper limit

B.

The data storage time is configured too long, and the accumulated data reaches the upper limit of the disk usage.

C.

Unreasonable business planning results in uneven data distribution and some disks reach the upper limit of usage

D.

Broker node failure caused

Question # 91

Fusionlnsigt HD which of the following components can be used for data collection? (multiple choice)

A.

Flume

B.

Loader

C.

HBase

D.

MapReduce

Question # 92

existFusionInsight HD where can I viewMapReduceThe result of running the application?

A.

FusionInsight ManagerofWebinterface

B.

Loader Webinterface

C.

Solr Webinterface

D.

Yarn Webinterface

Question # 93

aboutFusionInsight HDplatformHiveservice, itsWebHCatDevelopment interface, the following description does not the correct one is?

A.

Support based onRESTquery request

B.

WebHCatThe return data format isXML

C.

WebHCatbased onHTTPandHTTPSAgreement to provide services to the outside world

D.

able to passWebHCatCreate tables, query, etc.

Question # 94

for running onMapRestoeThe application on the platform, on which this application is basedjarpackage is placed where?

A.

HIVE

B.

HBASE

C.

HDFS

D.

DB

Question # 95

Tonganeng's societyHDofe, user-definedUDFcan andHmachine built-inUDFclass name,

In this case, the user-definedUDF.

A.

True

B.

False

Question # 96

forSpakstreamingapplication, in aJVM, there can only be one at a time

Streamingcontextis active.

A.

True

B.

False

Question # 97

solris a high-performance, basedLucenefull-text search service.SolrxrightLuceneto expand,

Loss of fruit supportSolrCloudmodel.

A.

True

B.

False

Question # 98

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Question # 99

existSparkmiddle,SparkSQLis an independent module that does not depend onSparkCoretstandtasQaon.eaR. operation, etc.

A.

True

B.

False

Question # 100

HDFSclient withNcopy toHDFSWhen writing a file, if one of the copies fails to write, all

If there is a copy, it will return a failure.

A.

True

B.

False

Question # 101

FusionInsight Managerinterface, when receivedKafkaInsufficient disk capacity alarm, and the alarm's

When the cause has been ruled out for the hard disk hardware failure, the system administrator needs to consider expanding the capacity to solve this problem.

A.

True

B.

False

Question # 102

existMapReduceIn application development of , which of the followingJAVA, Class responsible for managing and running a computational task?

A.

lob

B.

Context

C.

FileSystem

D.

Configuration

Question # 103

FusionInsight HDin, belonging toStreamingWhat are the methods of data distribution? (multiple choice)

A.

Shuffle Grouping

B.

Field Grouping

C.

Local Grouping

D.

Direct Grouping

Question # 104

Fusioninsight HDWhat are the supported ways to submitOozieOperation? (multiple choice)

A.

ShellOrder

B.

Java API

C.

Hue UI

D.

Rest API

Question # 105

Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduceRedisto cache

account information.

For this scene,RedisWhich of the following is the best data structure choice for ?

A.

list

B.

set

C.

hash

D.

sorted set

Question # 106

aboutKafkaofPeoducer, is the following statement correct? (multiple choice)

A.

Produceris a message producer

B.

ProducerProduction data needs to be specifiedTopic

C.

Can play multiple at the same timeProducerprocess to the sameTopicsend data

D.

ProducerNeed to connect first when producing dataZooKeeper, and then connectBroker

Question # 107

Which of the following business applications is notHiveApplicable scenarios?

A.

Real-time real-time online data analysis

B.

Data mining (user behavior analysis, interest division, regional display)

C.

Data Aggregation (Daily/Weekly user clicks, click ranking)

D.

Non-real-time analysis (log analysis, statistical analysis)

Question # 108

about the followingHDFSWhat is the correct process for the client to read the file? (multiple choice)

A.

priority from the nearestDataNodeDownload data.

B.

file data fromDataNodeback toNameNode, then fromNameNodeDown to the client.

C.

If the client and the connectedDataNodeIf there is a failure in reading, then the client will give up the applause node and connect the node where other replicas are located.

D.

Supports multiple clients fromDataNoderead the same file data.

Question # 109

aboutStreamingdisaster recovery capability, which of the following statements is correct? (multiple choice)

A.

SupervisorAfter the process exits, it can beNimbusAutomatically identify and pull up without affecting running business.

B.

WorkerCan be used after abnormal exitSupervisorPulls up automatically without manual intervention.

C.

When a node fails, the tasks on the node will be reassigned to other normal nodes without manual intervention.

D.

NimbusAfter the failure, the backupNimbusIt will take over automatically without affecting the running business.

Question # 110

HiveWhich of the following table types are supported? (multiple choice)

A.

Partition Table

B.

bucket table

C.

Tilt table

D.

partition+bucket table

Question # 111

Fusionlnsigt HDWhat distributed computing frameworks do big data platforms provide? (multiple choice)

A.

HBase

B.

Spark

C.

MapReduce

D.

Streaming

Question # 112

existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type) to ensure that data is sent in order.

A.

True

B.

False

Question # 113

existFusionInsight HDmiddle,SparkBy defaultYARNAs a cluster resource management system.

A.

True

B.

False

Question # 114

existStreamingin application development,BoltUse which of the following interfaces to sendTuple?

A.

to emit

B.

execute

C.

open

D.

nextTuple

Question # 115

FusionInsight HD a completeStreaming CQLWhich of the following parts does the application contain at least? (multiple choice)

A.

Create an input stream

B.

Create output stream

C.

Business action statements (such asSELECT,INSERT,JIONWait)

D.

Apply commit statement

Question # 116

SparkWhat kind of parameter setting methods does the application have? (multiple choice)

A.

in the application configuration filespark-defaults.confconfigure in

B.

When submitting your application, by-confmake settings

C.

In the application code, passSparkContextofsetPropertymethod to set

D.

In the application code, passSparkConfobject to set

Question # 117

existHBaseIn table design, it is common to multiple businessesrowkey, what are the possible countermeasures for situations that lead to data access contradictions?

slightly? (multiple choice)

A.

compromise method (rowkeyDesign takes into accountAandBbusiness needs)

B.

Redundancy method (use space for time factory efficiency)

C.

Create secondary index

D.

pre-scoreregion

Question # 118

existFusionInsight HDclient, executeskinit{account}command is to getKDCwhich content?

A.

Krb5.conf

B.

TGT

C.

ST

D.

jaas.conf

Question # 119

RDDasSparkThe core object, which has the following characteristics? (multiple choice)

A.

read only

B.

partition

C.

fault tolerance

D.

efficient

Question # 120

FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct?

A.

In a network isolation environment,ZooKeeperofIPCan be configured as a businessIP, can also be configured to manageIP.

B.

HiveUnlike traditional databases, soJDB, CThe established connection does not need to be closed.

C.

In a secure cluster environment, establish andHiveBefore connecting, you need to use unified authentication to log in.

D.

useJDB, CsubmitSQL, you canSQLA semicolon ";" is added after the statement.

Question # 121

Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduce

Redisto cache user information. For this scene,RedisWhich of the following is the best data structure choice for ?

A.

List

B.

Set

C.

Hash

D.

Sorted set

Question # 122

FusionInsight HDmiddle,StreamingWhat are the characteristics of? (multiple choice)

A.

stream processing

B.

high performance

C.

Scalable

D.

High fault tolerance

Question # 123

FusionInsight HD in which ways can you viewOozieDebug results of the job? (multiple choice)

A.

CheckOozieofJava APIWhether the returned result is as expected

B.

pass throughHueofworkflowDashboard to view program running results

C.

pass throughHue's file browser to see ifHDFSThe specified directory produces the expected file

D.

Oozieself-containedWebThe interface can also view the job execution results

Question # 124

Set the block storage size to128M,HDFSWhen the client is writing a file, when writing a100Msize file, real

How much storage space does it take up?

A.

128M

B.

100M

C.

64M

D.

50M

Question # 125

FusionInsight HDin, useStreamingThe command? way to submitexample.jarmiddleom huawei example

WrodCounttask, task name iswcTeat, is the following execution command correct?

A.

storm jar example jar wcTest om huawei example WordCount

B.

storm jar example jar wcTest WrodCount

C.

storm jar example jar om huawei example WordCount wcTest

D.

storm jar example jar WorkCount wcTest

Question # 126

passing throughFusionindight ManagerWhen the interface creates a user, you want the user to have access to allTopichave

Read and write permissions, but not createTopicpermissions should be addedkafkaWhich group of services?

A.

KafkaadminGroup

B.

KakfaGroup

C.

kafkasuperuserGroup

Question # 127

Spark Streamingavailable fromKafkaReceive data and perform calculations, and the calculation results can only be stored inHDFS, can no longer be written backKafka.

A.

True

B.

False

Question # 128

Which of the following options isYamcore functionality?

A.

storage

B.

Resource management

C.

search

D.

data transmission

Question # 129

Fusion Insight HDmiddle,StreamingWhich of the following scenarios is applicable? (multiple choice)

A.

Streaming data monitoring

B.

Real-time visit statistics of the website

C.

Offline log analysis

D.

Traffic flow analysis

Question # 130

FusionInsight HDin, aboutOoze, which of the following operations can be performed on a suspended job? (many

select)

A.

kill

B.

run

C.

resume

D.

submit

Question # 131

existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%data for the following two

The correct statement is:

X: lines.filter(…).groupByKey(…)

Y: lines.groupByKey(…).filter(…)

A.

XCompareYhigher performance

B.

XCompareYlower performance

C.

XandYsame as performance

D.

UncertaintyXandYdifference in performance

Question # 132

HBaseFilters can set column names or column values as filter conditions, and support the use of multiple filters at the same time.

A.

True

B.

False

Question # 133

What is wrong?

A.

available fromHDFSdirectly into thehivesurface

B.

available fromHiveServerThe local hard disk of the node is directly loaded into thehivesurface

C.

It can be directly loaded from the local hard disk of the node where the client is located.hivesurface

D.

HiveThe data loading process will not parse the specific file content, mainly the process of file transfer

Question # 134

Fusioninsight HDmedium, applicablestreamingSubmit by command lineexample.jarmiddle

om.huawei.example.wordcounttask, task name iswcTest, is the following execution command correct?

A.

Storm.jar example.jar wcTest om.huawei..example.wordcount

B.

Storm.jar example.jar wcTest wordcount

C.

Storm.jar example.jar om.huawei..example.wordcount wcTest

D.

Storm.jar example.jar wordcount wcTest

Question # 135

when aMapReduceWhen the application is executed, which of the following actions ismapoccurred before the stage?

A.

split

B.

combine

C.

partition

D.

sort

Question # 136

existFusionInsight HDmiddle,SparkBy defaultYARNAs a cluster resource management system.

A.

True

B.

False

Question # 137

when aMapReduceWhen the application is executed, which of the following actions ismapoccurred before the stage?

A.

split

B.

combine

C.

partition

D.

sort

Question # 138

existMapReduceIn the development framework,InputFormatWhat is the function of the class?

A.

mergemapThe result of the stage has the samekeyvalue key-value pair

B.

put allmapThe results of the stages are merged to get the final result

C.

rightmapThe results of the stages are sorted and merged

D.

Split the input data into individualsplit, and willsplitfurther split intoyes, then asmapletter input of numbers

Question # 139

FusionInsight HD in real-time processing scenarios, what computing frameworks are available? (multiple choice)

A.

Spark Streaming

B.

Streaming

C.

MapReduce

D.

HDFS

Question # 140

.existFusionInsight HDin, useSparkSQL, which of the following methods (or tools) can be used to performSQLstatement?

(multiple choice)

A.

JDB, C

B.

spark-beeline

C.

spark-sql

Question # 141

existSpark, the accumulator can realize high-speed parallel counter and variable summation; inSparkduring application development,

only inDriverGet the value of this counter on .

A.

True

B.

False

Question # 142

aboutStreamingdisaster recovery capability, which of the following statements is correct? (multiple choice)

A.

SupervisorAfter the process exits, it can beNimbusAutomatically identify and pull up without affecting running business.

B.

WorkerCan be used after abnormal exitSupervisorPulls up automatically without manual intervention.

C.

When a node fails, the tasks on the node will be reassigned to other normal nodes without manual intervention.

D.

NimbusAfter the failure, the backupNimbusIt will take over automatically without affecting the running business.

Question # 143

aboutFlumeWhich of the following descriptions is correct?

A.

(1) data fromsourcearrivechannel;(2)sinkfromchannelfetch data

B.

(1) data fromchannelarrivesource;(2)sinkfromsourcefetch data

C.

(1) data fromsinkarrivechannel;(2)sourcefromchannelfetch data

D.

(1) data fromchannelarrivesource;(2)sinkfromsourcefetch data

Question # 144

FusionInsight HD a completeStreaming CQLWhich of the following parts does the application contain at least? (multiple choice)

A.

Create an input stream

B.

Create output stream

C.

Business action statements (such asSELECT,INSERT,JIONWait)

D.

Apply commit statement

Question # 145

existStreamingin application development,BoltUse which of the following interfaces to sendTuple?

A.

to emit

B.

execute

C.

open

D.

nextTuple

Question # 146

SparkWhat kind of parameter setting methods does the application have? (multiple choice)

A.

in the application configuration filespark-defaults.confconfigure in

B.

When submitting your application, pass—confmake settings

C.

In the application code, passSparkContextofsetPropertymethod to set

D.

In the application code, passSparkConfobject to set

Question # 147

FusionInsight HDmiddle,HiveWhich of the following parameters of the client is used to controlreduceConcurrency? (multiple choice)

A.

hive.exec.reducers.bytes.per.reducer

B.

hive.exec-ducers.max

C.

hive.exec.max.dynamic.partitions

D.

hive.exec.max.dynamic.partitions.pernode

Question # 148

existFusionInsight HDin, aboutHivepartition (partition) function, which is wrong as described below?

A.

Partition fields are defined when the table is created

B.

There can only be one partition field, and multi-level partitions cannot be created

C.

Using partitions can reduce the data scan range of some queries, thereby improving query efficiency

D.

Partition fields can be used aswherecondition of the sentence

Question # 149

HDFSThe system time of the node where the client is located is the same as theFusionInsight HDThe system time of the cluster should be consistent. If there is a time difference, So the time difference should be less than a few minutes?

A.

5

B.

10

C.

20

D.

30

Question # 150

FusionInsight HDsystem, aboutHiveWhich of the provided file formats is not a columnar file?

A.

CRC

B.

Parquet

C.

RCFile

D.

TextFile

Question # 151

existFusionInsight HDin the cluster,FlumeWhich service does not support writing collected data to the cluster?

A.

HDFS

B.

HBase

C.

Kafka

D.

Redis

Question # 152

Which of the following measures can improveHBasequery performance? (multiple choice)

A.

pre-scoreregion,MakeregionEvenly distributed, improve concurrency

B.

Query frequent attributes are placed inrowkeyThe front part

C.

Use secondary indexes to adapt to more query scenarios

D.

Data read at the same time are stored in the samecellmiddle

Question # 153

HBaseFilters can set column names or column values as filter conditions, and support the use of multiple filters at the same time.

A.

True

B.

False

Question # 154

existKafkamiddle,ConsumerIt can support offline message and online message consumption at the same time, but consume data sequentially

of.

A.

True

B.

False

Question # 155

A project requires Internet access to a certain area? ?Save it, and search the full text of these Internet records to see if there is any? ?information, with

to prevent crime in the region.

In this scenario, which of the following options is the best?

A.

existSolrCreate an index and save the data, and return all the data during full-text search.

B.

existHBasestore data on theHBaseThe filtering characteristics of , satisfy fuzzy matching query.

C.

When storing data, inSolrindex on theHBaseStore complete data? ?when, throughSolrFull-text search to obtain and record

record key information, through the key information inHBasefor the full record.

Question # 156

existFlumeDuring cascaded transfers, you can usefail overmode transfer, so that if the next hop isFlumenode failure or

When the data is received abnormally, it can automatically switch to another way to continue transmission.

A.

True

B.

False

Question # 157

FusionInsight HDmiddle,OozieBefore submitting the job, you need to upload the configuration files andjarpackage toHDFS

A.

True

B.

False

H13-723_V2.0 PDF

$33

$109.99

3 Months Free Update

  • Printable Format
  • Value of Money
  • 100% Pass Assurance
  • Verified Answers
  • Researched by Industry Experts
  • Based on Real Exams Scenarios
  • 100% Real Questions

H13-723_V2.0 PDF + Testing Engine

$52.8

$175.99

3 Months Free Update

  • Exam Name: HCIP-Big Data Developer V2.0
  • Last Update: Dec 14, 2025
  • Questions and Answers: 526
  • Free Real Questions Demo
  • Recommended by Industry Experts
  • Best Economical Package
  • Immediate Access

H13-723_V2.0 Engine

$39.6

$131.99

3 Months Free Update

  • Best Testing Engine
  • One Click installation
  • Recommended by Teachers
  • Easy to use
  • 3 Modes of Learning
  • State of Art Technology
  • 100% Real Questions included