Weekend Sale - 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: spcl70

H13-711_V3.0 PDF

$33

$109.99

3 Months Free Update

  • Printable Format
  • Value of Money
  • 100% Pass Assurance
  • Verified Answers
  • Researched by Industry Experts
  • Based on Real Exams Scenarios
  • 100% Real Questions

H13-711_V3.0 PDF + Testing Engine

$52.8

$175.99

3 Months Free Update

  • Exam Name: HCIA-Big Data V3.0
  • Last Update: Jul 18, 2025
  • Questions and Answers: 649
  • Free Real Questions Demo
  • Recommended by Industry Experts
  • Best Economical Package
  • Immediate Access

H13-711_V3.0 Engine

$39.6

$131.99

3 Months Free Update

  • Best Testing Engine
  • One Click installation
  • Recommended by Teachers
  • Easy to use
  • 3 Modes of Learning
  • State of Art Technology
  • 100% Real Questions included

H13-711_V3.0 Practice Exam Questions with Answers HCIA-Big Data V3.0 Certification

Question # 6

In many small file scenarios, Spark will start many tasks. When there is a Shuffle operation in the SQL logic, the number of hash buckets will be greatly increased, which will seriously affect the performance. In Fusioninsight, scenarios for small files usually use the( )Operator to merge partitioni generated by small files in Tabler, reduce the number of partitions, avoid generating too many hash buckets during shuffle, and improve performance?

A.

group by

B.

coalosce

C.

onnect

D.

join

Full Access
Question # 7

Hardware failure is considered to be the norm, in order to solve this problem.HDFS has designed a copy mechanism. By default, a file, HDFS will save( )share?

A.

1

B.

2

C.

3

D.

4

Full Access
Question # 8

Which of the following HDFS commands can be used to check the integrity of data blocks?

A.

HDFS fsck

B.

HDFS fsck-delete

C.

HDFS dfsadmin -report

D.

HDES balancer -threshold 1

Full Access
Question # 9

Which of the following parts does the structure of the unified certification management system include?

A.

Unified Authentication Server

B.

Unified authentication management module

C.

Identity information storage server

D.

Unified Session Management Module

Full Access
Question # 10

YarnWhen doing resource scheduling, maptaak and reduceTask are run in( )middle.

Full Access
Question # 11

The correct installation process for installing Fusioninsight HD is

A.

Install Manager->Execute preinstall->LLD tool for configuration->Install cluster->Check after installation->Configure after installation

B.

LLD tool to configure -> execute preinstall-> install Managers-> install cluster-> post-installation check-> post-installation configuration

C.

Install Manager->LLD tool for configuration->execute preinstall->install cluster->post-installation check->post-installation configuration

D.

LLD tool to configure -> execute preinstalls-> install cluster-> install Manager-> post-installation check-> post-installation configuration

Full Access
Question # 12

Which of the following descriptions about HBase. Secondary Index is correct

A.

The secondary index associates the column to be searched with the rowkey into an index table

B.

At this point, it is listed as a new rowkey; the original rowkey becomes the value

C.

The secondary index is queried twice

D.

all of the above

Full Access
Question # 13

F1ink in( )interface for streaming data processing,( )interface for batch processing?

A.

Datastream API, DataSet API

B.

Data batch API.DataStream API

C.

Stream API.Batch API

D.

Batch API, Stream API

Full Access
Question # 14

Which of the following sub-products are included in the Fusioninsight family

A.

Fusioninsight Miner

B.

Fusioninsight Farmer

C.

Fusioninsight HD

D.

GaussDB 200

Full Access
Question # 15

Regarding the basic operation of Hive table building, the correct description is

A.

Once the table is created, the table name cannot be changed.

B.

Once the table is built. No new columns can be added

C.

The external keyword needs to be specified when creating an external table

D.

Once the table is created, the column names cannot be changed

Full Access
Question # 16

In the FusionInsight cluster, which of the following components does Spark mainly interact with?

A.

Hive

B.

YARN

C.

HDFS

D.

Zookeeper

Full Access
Question # 17

What processes are included in the HBase service of Fusioninsight HD?

A.

HMaster

B.

Slave

C.

HRegionServer

D.

Data Node

Full Access
Question # 18

What are the successful cases of Huawei Fusioninsight HD in the industry?

A.

digital government

B.

Smart Park

C.

smart transportation

D.

finance

Full Access
Question # 19

About Spark SQL&Hive difference and connection, which of the following statements is correct?

A.

Spark SQL is compatible with most Hive syntax and functions

B.

Spark SQL cannot use Hive's custom functions

C.

The execution engine of Spark SQL is Spark core, HiveThe default execution engine is MapReduce

D.

Spark SQL relies on Hive metadata

Full Access
Question # 20

Which of the following components must depend on Zookeeper to run?

A.

HDFS

B.

HBase

C.

Spark

D.

YARN

Full Access
Question # 21

The Fusionlnsight HD cluster contains many kinds of services, and each service consists of thousands of roles. Which of the following are the roles of the service?( )

A.

HDFS

B.

NameNode

C.

DataNode

D.

Hbase

Full Access
Question # 22

A Fusioninsight HD cluster contains multiple services, and each service consists of several roles. Which of the following are the roles of the service?

A.

HDFS

B.

NameNode

C.

DataNode

D.

HBase

Full Access
Question # 23

In the Fusioninsight product, which of the following descriptions are correct about the topic of creating Kafka?

A.

When creating TopicE of Kafkal, the number of Partitions must be set

B.

When creating TopicE of Kafkal, the number of Partition copies must be set

C.

Setting up multiple replicas can enhance the disaster tolerance of Kafka services

D.

all of the aboveA. True

Full Access
Question # 24

The following figure shows the computational model of Structured Streaming. By observation, it can be concluded that the final calculation result of 3 is

A.

Dog 1, owl 1

B.

Cat 2, dog 4, owl 2

C.

Cat 2, dog 3, owl 1

D.

Cat 1, cat 1, dog 2, dog 2, owl 2

Full Access
Question # 25

When the F1ume process is cascaded, which of the following sink types are used to receive the messages sent by the previous hop Flume?

A.

Avro sink

B.

Thrift sink

C.

Hive sink

D.

Null sink

Full Access
Question # 26

Which parts of the data need to be read to execute the HBase data reading business?

A.

HLog

B.

MemStore

C.

HFile

D.

HMaster

Full Access
Question # 27

Kafka Cluster Mirroring. Which of the following functions can be achieved by the tool?

A.

Kafka cross-cluster data synchronization scheme

B.

Kafka data backup within a single cluster

C.

Kafkat but intra-cluster data recovery

D.

None of the aboveA. True

Full Access
Question # 28

Which of the following scenarios is Sparki suitable for?

A.

graph computation

B.

Interactive query

C.

batch processing

D.

real-time stream processing

Full Access
Question # 29

Which operations in Hive can be merged?

A.

UNION ALL

B.

GROUP BY

C.

SELECT

D.

JOIN

Full Access
Question # 30

In Hive, which of the following descriptions about bucketing is correct?

A.

Data can put different data into different buckets according to the way of buckets

B.

Unsortable in bucket

C.

The advantage of bucketing is that it can achieve higher query processing efficiency and make sampling more efficient

D.

You can specify the number of buckets when creating a table

Full Access
Question # 31

Which service process manages the Region of HBasel?

A.

DataNode

B.

ZooKeeper

C.

HMaster

D.

HRegionServer

Full Access
Question # 32

What services can Huawei MRS provide to customers?

A.

Multi-node deployment

B.

Statistical analysis and data mining

C.

Security Control Based on Kerberosi Certificate

D.

Offline and real-time data processing

Full Access
Question # 33

Which of the following commands are of type set?

A.

scard

B.

sunion

C.

zcount

D.

hexists

Full Access
Question # 34

The ZKFC process is deployed on the following node in HDFS?

A.

Active NameNode

B.

Standby NameNode

C.

DataNode

D.

All of the above are wrong

Full Access
Question # 35

In the process of using Flume to transmit data, in order to prevent data loss due to the restart of the Flume process. Which of the following Channel types can be used

A.

Memory Channel

B.

JDBC Channel

C.

File Channel

D.

HDES Channel

Full Access
Question # 36

Which of the following options cannot be achieved through big data technology?

A.

business model discovery

B.

value for evaluation

C.

Products Featured

D.

Operational Analysis

Full Access
Question # 37

Through the unified user management system in the big data platform, the unified management of users, roles and organizations of various open source component application systems in the platform can be realized. Cross-domain single sign-on, log-out and unification between various application systems can be realized. identity authentication function.

A.

True

B.

False

Full Access
Question # 38

Fusioninsight Manger supports REST interface, SNMP interface and SYSLOG interface externally

A.

True

B.

False

Full Access
Question # 39

The following are the characteristics of Huawei Kunpeng processor

A.

High-performance computing, ARM-compatible high-performance Kunpeng processors and x86-architecture servers and solutions

B.

Safe and reliable, creating high quality as stable as Mount Tai

C.

Open ecology, support mainstream software and hardware in the industry, and work with developers, partners and industrial organizations to create a new base for intelligent computing

D.

everything abovecorrect

Full Access
Question # 40

Which of the following types of data is not semi-structured data?

A.

HTML

B.

XML

C.

two-dimensional table

D.

JSON

Full Access
Question # 41

Regarding DataSet, which of the following statements is incorrect?

A.

A DataSet is a strongly typed collection of domain-specific objects

B.

DataSet can perform most operations without deserialization

C.

DataSet needs to be deserialized to perform operations such as sort, filter, shuffle, etC.

D.

DataSet - highly similar to RDD. Better performance than RDD

Full Access
Question # 42

Huawei Fusioninsight HD is the first big data platform in China that complies with national financial and other garbage protection. What are the following aspects of its security?

A.

system security

B.

Authority authentication

C.

Data Security

D.

everything aboveA. True

Full Access
Question # 43

Which of the following functions can the Kafka Cluster Mirroring tool achieve?

A.

Kafka cross-cluster data synchronization method

B.

Kafka data backup within a single cluster

C.

Kafka data recovery within a single cluster

D.

None of the aboveA. True

Full Access
Question # 44

What is the core module of spark?

A.

spark streaming

B.

spark core

C.

mapreduce

D.

spark sql

Full Access
Question # 45

Regarding the relationship between Hive and other components of Hadoop. Which of the following descriptions is wrong?

A.

Hive finally stores data in HDFS

B.

Hive is a data warehouse tool for the Hadoop platform

C.

HQL can execute tasks through MapReduce

D.

Hive has a strong dependence on HBase

Full Access
Question # 46

RDD has Transformation and Action operators. Which of the following belongs to the Action operator?

A.

reduceByKey

B.

filter

C.

map

D.

saveAsTextFile

Full Access
Question # 47

Which of the following scenarios is not applicable to Hive?

A.

Real-time online data analysis

B.

Non-real-time analysis, such as log analysis, statistical analysis

C.

Data mining, such as user behavior analysis, interest division, regional display

D.

Data aggregation, such as daily, weekly user clicks,Click to rank

Full Access
Question # 48

In the Fusioninsight product, which statement is correct about the Kafka component?

A.

When creating a topic, the number of replicas must not be greater than the number of currently surviving Broker instances, otherwise the topic creation will fail

B.

When the Producer of Kafkal sends a message, it can specify which Consumer consumes the message

C.

Kafka will store metadata information in Zookeeper for

D.

After Kafka is installed, the sensitive data storage directory cannot be configured.

Full Access
Question # 49

Which of the following are the functions that Spark can provide?

A.

Distributed memory computing engine

B.

Distributed file system

C.

Unified scheduling of cluster resources

D.

Stream processing capabilities

Full Access
Question # 50

Is the description of the role of the standby NameNode correct in the HDFS system?

A.

Hot Standby for Primary NameNodel

B.

Standby NameNode) has no memory requirements

C.

Help the main NameNodet merge edit logs and reduce the startup time of the main NameNodel

D.

The standby NameNode should be deployed to the same node as the primary NameNode

Full Access
Question # 51

Which of the following options belong to the advantages of HUAWEI CLOUD MRS?

A.

Extensibility

B.

High reliability

C.

high performance

D.

Ease of use

Full Access
Question # 52

In Kafka HA, when the leader corresponding to the partition is down, a new leader needs to be elected from the followers. Which of the following roles should be executed?

A.

Follower

B.

Controller

C.

Brocker

D.

Leader

Full Access
Question # 53

The HDFS data reading process includes the following steps, please choose the correct order. (Drag title) Order: ECADB

A.

After obtaining this input stream, the client calls the read method to read the data. The input stream selects the nearest DataNode to establish a connection and read data.

B.

The client calls close. to close the input stream.

C.

The location where the data block corresponding to this file in NameNodel is obtained by calling NameNode remotely through RPC

D.

If the end of the data block has been reached. Then close the connection with this DataNodel, and then re-find the next data block. until all data is read.

E.

The client calls the open method of the FileSystem instance to obtain the input stream corresponding to the file.

Full Access
Question # 54

HDFS of HadoopE is a distributed file system. Which of the following application scenarios is suitable for data storage and management?

A.

Lots of small file storage

B.

High capacity and high leaf swallowing capacity

C.

low latency read

D.

Streaming data access

Full Access
Question # 55

SELECT aa.salarybB. address FROM employee aa JoiN SELECT adress FROM employee info where provine='zhejiang') What types of operations does bb ONaa.nanme=bB. name contain?

A.

create table

B.

Import Data

C.

subquery

D.

JOIN Cha Xun

Full Access
Question # 56

What are the performance bottlenecks of traditional data processing?

A.

High cost of data storage

B.

Insufficient streaming data processing performance

C.

Limited scalability

D.

Batch data processing is missing

Full Access
Question # 57

Which of the following scenarios in HBase will trigger the F1ush operation?

A.

HBasePeriod refreshMemstore, silentthink period is 1Hour

B.

When the number of files in WALs reaches a threshold

C.

The total size of the MemStore in the Region has reached the preset Flush Size threshold

D.

The ratio of the total memory occupied by the MemStore to the total memory of the RegionServer exceeds the preset threshold size

Full Access
Question # 58

The HDFS data reading process includes the following steps, please choose the correct order. (Drag picture title, sort question)

A.

After obtaining this input stream, the client calls the read method to read the data. The input stream selects the nearest DataNode to establish a connection and read data.

B.

The client calls close. to close the input stream.

C.

The location where the data block corresponding to this file in NameNodel is obtained by calling NameNode remotely through RPC

D.

If the end of the data block has been reached. Then close the connection with this DataNodel, and then re-find E. a data block. until all data is read.

E.

The client calls the open method of the FileSystem instance to obtain the input stream corresponding to the file.

Full Access
Question # 59

In the MRS interface, Loader can specify a variety of different data sources, configuration data cleaning and conversion steps, and configure cluster storage systems, etC. .

A.

TRUE

B.

FALSE

Full Access
Question # 60

Channel supports transactions and provides weaker order guarantees. Any number of Sources and Sinks can be connected.

A.

True

B.

False

Full Access
Question # 61

When a Spark application is running, if a certain task fails to run, the entire app fails to run.

A.

True

B.

False

Full Access
Question # 62

In the Output stage, Structured Streaming can define different data writing methods, including which of the following methods?

AAppend Mode

B. Update Mode

C. General Mode

D. Ccomplete Mode

Full Access
Question # 63

In the HDFS federated environment, which of the following contents are included in the NameSpace

A.

content

B.

document

C.

piece

D.

None of the above

Full Access
Question # 64

Which of the following statements about Huawei's big data solution is correct?

A.

Farmer is a data service framework

B.

GaussDB is an open source database product

C.

Fusioninsight Manager is a distributed system management framework, administrators can control distributed clusters through multiple access points

D.

Fusioninsight HD is an enhanced version based on the open source big data software Hadoopl

Full Access
Question # 65

In the era of big data, which of the following challenges are faced by enterprises?

A.

Data is scattered among various departments of the enterprise, and the same data is stored in different formats within each department.

B.

Diversified data structures.

C.

technological advancements of competitors.

D.

Scattered data has problems such as noise, missing, and non-standard storage types, which requires a lot of data preprocessing.

Full Access
Question # 66

Which of the following options is an important role for Spark

A.

DateNode

B.

Nodemanager

C.

Driver

D.

ResourceManager

Full Access
Question # 67

In order to improve the fault tolerance of Kafka, Kafka supports the replication strategy of partition. Which of the following descriptions about Leader partition and Follower partition is wrong?

A.

It is impossible for each node of a kafka cluster to be l with each othereader and flower

B.

If the leader fails, other followers will take over (become the new leader)

C.

Because the leader server carries all the request pressure. Therefore, from the overall consideration of the cluster, kafka will distribute the leader evenly on each instance to ensure the overall performance is stable

D.

Kafka needs to select a leader for partition replication, and the leader is responsible for reading and writing partitionsD. operation, other replica nodes are only responsible for data synchronization

Full Access
Question # 68

The most basic unit of HBase's distributed storage is Region.

A.

True

B.

False

Full Access
Question # 69

What types of data sources does F1ink stream processing include?

A.

Socket streams

B.

JDBC

C.

Files

D.

Ccollections

Full Access
Question # 70

The size of the memory allocated by YARN to Containerb in the Hadoop system, which can be set by the parameter yarn.app.mapreduceam.resource.mb

A.

True

B.

False

Full Access
Question # 71

F1ink not only provides real-time computing that supports both high throughput and exact-once semantics, but also provides batch data processing.

A.

True

B.

False

Full Access
Question # 72

Assuming that HDFS only saves 2 copies when writing data. Then during the writing process, HDFS Client first writes data to DataNodel1 and then writes data to DataNode2.0

A.

True

B.

False

Full Access
Question # 73

After submitting the topology using the Streaming client shell command in the Fusioninsight HD system, use Strom The UI view shows that the topology has not processed data for a long time. What are the possible reasons?

A.

Supervisor is the component that receives data in topology and then performs processing

B.

There is a logic error in the topology business, and it cannot run normally after submission

C.

The topology is too complex or the number of concurrent users is too large, resulting in workerThe startup time is too long, exceeding the waiting time of Supervisort

D.

The supervisor's slots resources are exhausted, and after the topology is submitted, the slots cannot be allocated to start the worker process.

Full Access
Question # 74

Which of the following extensions belong to ElasticSearch?

A.

hadoop

B.

head

C.

bigdesk

D.

IKAnalyzer

Full Access
Question # 75

Which of the following options belong to Fusioninsight data security?

A.

Operating system security hardening

B.

Component data encryption

C.

data integrity check

D.

User authority authentication management

Full Access
Question # 76

Hive in Fusioninsight contains two roles, HiveServer and MetaStore.

A.

True

B.

False

Full Access
Question # 77

The index data of ElasticSearch can only be stored in the HDFS system.

A.

True

B.

False

Full Access
Question # 78

The UNION ALL operator in Hive is used to combine the result sets of two more SELECT statements. Duplicate values are not allowed in the result set.

A.

True

B.

False

Full Access
Question # 79

In the Fusioninsight HD platform, HBase does not currently support secondary indexes

A.

True

B.

False

Full Access
Question # 80

Elastic SearchofSowleadCanbystoragein a variety ofstorage classtype,andthe followingwherekind of storagekindtypebranchhold?

A.

Shared file system

B.

Object Storage

C.

HDFS

D.

Local file system

Full Access
Question # 81

Hive is a data warehouse infrastructure built on Hadoop. It provides a set of tools that can be used to perform extract-transform-load (ETL), a mechanism for storing, querying, and analyzing large-scale data stored in Hadoop.

A.

True

B.

False

Full Access
Question # 82

ApplicationMasters apply for and receive resources from ResourceManagerl through the RPC protocol in a polling manner.

A.

True

B.

False

Full Access
Question # 83

In the Mapreducei process, by default, a shard is a block and a mapTask.

A.

True

B.

False

Full Access
Question # 84

Fusionin, a graphical health inspection tool The sight Tool consists of FusionCare and SysCheckerp.

A.

True

B.

False

Full Access
Question # 85

What is the default resource scheduler in YARN?

Full Access
Question # 86

About RDDs. Which of the following statements is false?

A.

RDD is a read-only, partitionable distributed dataset

B.

RDD is Spark's abstraction of underlying data

C.

RDDs have a lineage mechanism (Lineed)

D.

RDDs are stored on disk by default

Full Access
Question # 87

When a certain task of MapReduce fails, the task can be recalculated through the retry mechanism.

A.

TRUE

B.

FALSE

Full Access
Question # 88

Which of the following options are Huawei's data center solutions?

A.

Data Lake Governance Center DGC

B.

Cloud operating system FusionSphere Openstack

C.

AI development platform ModelArts

D.

Video Analysis VAS

Full Access
Question # 89

Commands in Redis are case-sensitive.

A.

True

B.

False

Full Access
Question # 90

In Fusioninsight HD, which of the following is not a flow control feature of Hive

A.

Supports threshold control for the total number of established connections

B.

Supports threshold control for the number of connections established by each user

C.

Supports threshold control on the number of connections established by a specific user

D.

Supports threshold control of the number of connections established per unit time

Full Access
Question # 91

What are the types of znodes in Zookeeper?

A.

sem1-persistent

B.

ephemeral

C.

temporary

D.

persistent

Full Access
Question # 92

What information does a Key Value format in the HBase data file HFiler contain?

A.

Key

B.

Value

C.

Timestamp

D.

KeyType

Full Access
Question # 93

Which of the following statements about Fusioninsight HBasel visual modeling are correct?

A.

Visual modeling helps DBAs in modeling design and lowers the threshold for using HBase

B.

Qualifier HBase column: each column represents an attribute of business data

C.

Realize the division of labor: DBAs focus on data table modeling, developers focus on user table names and columns used

D.

Column user table column: each column represents a KeyValue

Full Access
Question # 94

What information is included in calling the Zookeeper client command?

A.

The port number

B.

IP address

C.

server nickname

D.

username

Full Access
Question # 95

What are the storage formats supported by Hive in the Fusioninsight HD system?

A.

HFile

B.

TextFile

C.

SequenceFile

D.

RCFile

Full Access
Question # 96

Which components in the Fusioninsight HD platform support table and column encryption?

A.

Flink

B.

HBase

C.

Hive

D.

HDFS

Full Access
Question # 97

What is the default resource scheduler in YARN?( )

A.

FIFO scheduler

B.

capacity scheduler

C.

Fair scheduler none of the

Full Access
Question # 98

What kinds of Kafka message transmission guarantees are usually provided?

A.

At Most Three Times

B.

Exactly Once

C.

At least once (At Lease once)

D.

At Most once

Full Access
Question # 99

Which of the following statements about CarbonData in Fusioninsight is correct?

A.

cArbon is also a high-performance analytics engine that integrates data sources with spark.

B.

cArbon uses a combination of lightweight compression and heavyweight compression to compress data, which can reduce data storage space by 60%-80% and greatly save hardware storage costs.

C.

arbon is a new Apache Hadoop native file format that uses advanced columnar storage, indexing, compression, and encoding techniques to improve computational efficiency to help accelerate data queries over petabytes of magnitude, and can be used for faster interactive queries.

D.

The purpose of using carbon is to provide ultra-fast responses to ad-hoc queries on big data.

Full Access
Question # 100

Which of the following are the optimization methods of Redis?

A.

Reduced key-values

B.

Turn off persistence

C.

Limit Redis memory size

D.

slowlog configuration

Full Access
Question # 101

Which of the following are the characteristics of Streamingl?

A data is stored first and then calculated

B. event driven

C. low latency

D. Can do continuous query

Full Access
Question # 102

Which of the following descriptions about the functions of HMaster in HBase are correct?

A.

Region load balancing, Region splitting and Region allocation after splitting

B.

Responsible for creating tables, modifying tables, deleting tables

C.

Responsible for load balancing of RegionServer

D.

Region after RegionServer failsmigrate

Full Access
Question # 103

What services can Huawei DWS provide to customers?

A.

Support GDS tools to speed up data storage

B.

Ensure high reliability of data and systems

C.

Trillions of data correlation analysis seconds response

D.

Unified management console

Full Access
Question # 104

From the point of view of the life cycle, what stages does data mainly go through?

A.

data collection

B.

data storage

C.

data management

D.

data analysis

E.

data presentation

Full Access
Question # 105

What are the correct understandings and descriptions of the main features of big data?

A.

Many data sources and formats

B.

Fast data growth and fast processing

C.

Large amount of data, large amount of calculation

D.

Low data value density, high value

Full Access
Question # 106

What steps are included in the preparation for Fusioninsight HD installation?

A.

Complete the hardware installation

B.

Complete the node host OS installation

C.

Prepare tools and software. Such as Putty, LLD, Fusioninsight HD software installation package, etC.

D.

Prepare planning data. such as network parameters and role deployment locations

Full Access
Question # 107

In Flink( )Interface for streaming data processing.( )interface for batch processing

A.

Stream API, Batch API

B.

Data Stream APl. DataSet AP

C.

DataBatch AP1.DataStreamAPIi

D.

BatchAP1, Stream APi

Full Access
Question # 108

In the Fusioninsight HD system, it fails to view the topology or submit the topology using the Shelle command of the Streaming client. Which of the following positioning methods are correct?

A.

Check the client's exception stack to determine whether the client is using problems

B.

Check the running log of the main Nimbus to determine whether the Nimbus server is abnormal

C.

Check the Supervisori running log to determine whether the Supervisor is abnormal

D.

View Worker running log

Full Access
Question # 109

To enable the log aggregation function of the Yam component in the Hadoop platform, which parameter needs to be configured?

A.

yarn.nodemanager.local-dirs

B.

yarn.nodemanager.log-dirs

C.

yarn.acl.enable

D.

yarn.log-aggregation-enable

Full Access
Question # 110

In Huawei's big data solution, which of the following components are included in the hadoop layer?

A.

Miner

B.

Spark

C.

Hive

D.

Flink

Full Access
Question # 111

Which statement is correct about worker (worker process), Executor (thread) and task (task)?

A.

Each Executor (thread) can run a number of tasks (tasks)

B.

Each Executor (thread) can run different components (spout or bolt) of the task (task)

C.

Each worker can run multiple Executors (threads)

D.

Each worker can only run Executors (threads) for one topology

Full Access
Question # 112

Which of the following scenarios is not suitable for HDFS?

A streaming data access

B. Lots of small file storage

C. Large file storage and access

D. random write

Full Access
Question # 113

Which of the following belongs to the shufle mechanism in the MapReduce process?

A.

partition

B.

sort/merge

C.

????

D.

ccombine

Full Access
Question # 114

F1ink is a unified computing framework that combines batch processing and stream processing. Its core is a stream data processing engine for data distribution and parallel computing.

A.

True

B.

False

Full Access
Question # 115

In HBasel, when the size of a Region becomes larger, it may be pruned.

A.

True

B.

False

Full Access
Question # 116

If using Redis pairAn ordered collection is sorted, which data type?

A.

string

B.

Sat

C.

Hash

D.

sorted set

Full Access
Question # 117

HDFS supports large file storage, and supports multiple users' write operations to the same file, as well as modification at any position of the file.

A.

True

B.

False

Full Access
Question # 118

The following offin Kafka messagesWhich one of the speed transmission methods is still correct?

A.

Postingonesubscriptioninformationsystem, the sameNumber of barsData can be consumed by multiple consumers. data isConsumptionnot laterdelete immediately

B.

Distributed Messaginghand overThere are two mainwantofmessage passing pattern,peer to peertransferpattern, haircloth-subscription model

C.

point-to-pointinformation systemmedium, cancanhave multiple consumptionsat the same timeremovefeedata, becauseThis does not guarantee the order in which data is processed.

D.

In a point-to-point messaging system, when a messagefeeByremovefeeteamone of the columnsdataAfter that, thedata rulefromdelete from message queue

Full Access
Question # 119

What query types does ElasticSearch have? (multiple choice)

A.

full-text search

B.

Term-based search

C.

Search based on score

D.

Retrieval based on metadata

Full Access
Question # 120

In the Fusioninsight HD system, if dirty data is generated while the Loader job is running, the status of the Loader job execution result must be failed.

A.

True

B.

False

Full Access
Question # 121

In the HDFS mechanism, NameNodet is responsible for managing metadata. Each read request on the Clienty side needs to read metadata information from the metadata disk of NameNode, so as to obtain the location of the read file in DataNodel.

A.

True

B.

False

Full Access
Question # 122

If there are only Default, QueueA and QueueB sub-queues in the YARNU group, then their capacities are allowed to be set to 60%, 25%, and 22% respectively.

A.

True

B.

False

Full Access
Question # 123

The following offon HDFSarity ofholding a spoontransformedProcess description,pleaseArrange from top to bottom in the correct order.

A.

Saccn? ?node periodically fromMa? ?download?and?files

B.

in HDFS Sectionformat onceRear,?will generate fs?me and?twodelivery

C.

to replace?file, and will?coincident name?make up new?and and.

D.

to avoid?Increasing,?will be merged periodically?become new?.

E.

put frimage?combine into new? ?document.

Full Access
Question # 124

Which fields can ElasticSearchl's balancing algorithm be applied to?

A.

Import Data

B.

export data

C.

volume reduction

D.

Expansion

Full Access
Question # 125

LdapServer's Group (group) is a unified group management for users. If a user is added to the group, the member's dn record will be added to the nember attribute of the group.

A.

True

B.

False

Full Access
Question # 126

Data in FlumeCompression characteristics are mainlyYesFor which of the following purposes?

A.

lower diskI0

B.

Enhanced security

C.

to mentionHigh reliability

D.

lower gridI0

Full Access
Question # 127

Which of the following descriptions about Hive are correct?

A.

supportTez, Spark and other computing engines

B.

Can query and manage petabytes of distributed data

C.

E for dataTL process automation

D.

Direct access to HDFS files and HBase

Full Access
Question # 128

The main difference between YARN-client and YARN-cluster is the difference between the Application Master process.

A.

True

B.

False

Full Access
Question # 129

Flume's properties.properties configuration file can configure multiple channels to transmit data.

A.

True

B.

False

Full Access
Question # 130

Colocation (identical distribution) The same distribution at the file level realizes fast access to files. It avoids a lot of network overhead caused by data relocation.

A.

True

B.

False

Full Access
Question # 131

Fusioninsight is Huawei's unified platform for enterprise-level big data storage, query, and analysis. It can help enterprises quickly build massive data information processing systems, and discover new value points and business opportunities through real-time and non-real-time analysis and mining of massive information data.

A.

True

B.

False

Full Access
Question # 132

In a point-to-point messaging system, data in the queue can be consumed by one or more consumers, but a message can only be consumedoneSecond-rate.

A.

True

B.

False

Full Access
Question # 133

When creating a Loaderf job, in which of the following steps can the filter type be set?

Full Access
Question # 134

When Solr creates CollectionE, it is recommended to select the routing algorithm as compositld Router, then the Collection can expand shard.

A.

True

B.

False

Full Access
Question # 135

What is the default resource scheduler for queues in YARN?

Full Access
Question # 136

The emergence of HFS solves the need to store a large number of small files (below 10MB) in HDFS. At the same time, it is necessary to store some mixed scenes of large files (above 10MB)

A.

True

B.

False

Full Access
Question # 137

By configuring which of the following parameters can the logs generated in Kafkal be cleaned up?

A.

log.retention.bytes

B.

server.properties

C.

log.cleanup.policy

D.

log.retention.hours

Full Access
Question # 138

The overall process of Kafka Produceri reading data is that the Producer connects to any surviving Broker, requests the leader metadata information of the specified topic and partition, and then directly connects with the corresponding Brokerl to publish the data.

A.

True

B.

False

Full Access
Question # 139

ElasticSearch can be used as a relational database similar to MySQL.

A.

True

B.

False

Full Access
Question # 140

Redis adopts a non-central self-organizing structure. Nodes use the Gossip protocol to exchange node status information.

A.

TRUE

B.

FALSE

Full Access
Question # 141

ElWhat processing capabilities does asticSearch have for structured, semi-structured, and unstructured data?

A.

to enterA series of operations such as line cleaning, word segmentation, and establishment of an inverted index

B.

Provides the ability to search full text, conditions can include words or phrases

C.

The written data can be checked in real timeSow

D.

numberOptional rewrite when data is writtendeleteand compression function

Full Access
Question # 142

Fusioninsight tool is a set of health detection tools provided for technical support engineers and maintenance engineers. It can check the health status of cluster-related nodes and services, discover potential problems in the cluster in advance, and generate health check reports. It is convenient for technical support engineers and maintenance engineers to quickly understand the health status of the system.

A.

True

B.

False

Full Access
Question # 143

Select which of the following conversion rules are supported by Loader jobs? (multiple choice)

A.

Modulo conversion

B.

Null conversion

C.

splice conversion

D.

Add constant field

Full Access
Question # 144

Hadoop's NameNode is used to store the metadata of the file system.

A.

True

B.

False

Full Access
Question # 145

In the Fusioninsight HD product, a typical kafka cluster contains several producers, thousands of consumers and a zookeeper cluster?

A.

True

B.

False

Full Access
Question # 146

HMaster needs to be connected when using HBase for data reading service in Fusioninsight HD

A.

True

B.

False

Full Access
Question # 147

Which of the following scenarios are not good at F1ink components?

A.

to foldGeneration computing

B.

stream processing

C.

datastorage

D.

batch processing

Full Access
Question # 148

When using Loaderi for data import and export, data processing must go through the Reducel stage

A.

True

B.

False

Full Access
Question # 149

Spark is a memory-based computing engine. All Sparki program data in the running process can only be stored in memory

A.

True

B.

False

Full Access
Question # 150

The processing logic of topology is in bolt.

A.

True

B.

False

Full Access
Question # 151

Spark Streaming has higher real-time performance than Storm.

A.

True

B.

False

Full Access
Question # 152

Which of the following descriptions about the features of Zookeeper is wrong?

A.

Updates sent by the client are applied in the order in which they were sent

B.

A message is to be received by more than half-respected servers,he will be able to successfully write to disk

C.

A message update can only succeed or fail. There is no intermediate state

D.

The number of Zookeeper nodes must be odd

Full Access
Question # 153

Which nodes are required to communicate with external data sources before and after the Fusioninsight HD Loader job?

A.

Loader service master node

B.

The node on which the YARN service job is running

C.

Both of the first two are required

D.

Neither of the first two are needed

Full Access
Question # 154

Which of the following descriptions about Zookeeper features is wrong?

A.

The number of Zookeeper nodes must be odd.

B.

Updates sent by the client are applied in the order in which they were sent.

C.

Message updates can only succeed or fail, with no intermediate states.

D.

A message needs to be received by more than half of the servers.it will be able to successfully write to disk

Full Access
Question # 155

Regarding the basic operation of Hive table building, which is the correct description?

A.

When creating an external table, you need to specify the external keyword

B.

Once the table is created, the table name cannot be changed

C.

Once the table is created, the column names cannot be changed

D.

Once the table is created, no new columns can be added

Full Access
Question # 156

When a Regioni in HBaser performs the Split operation, what stage occurs in the process of actually dividing an HFile file into two Regions?

A.

During Spliti

B.

During Flush

C.

ompactionj process

D.

HFile separation process

Full Access
Question # 157

Which module is responsible for Fusioninsight Manager user data storage?

A.

CAS

B.

AOS

C.

Kerberos

D.

LDAP

Full Access
Question # 158

Which configuration is not supported by Fusioninsight Manager user rights management?

A.

Assign roles to users

B.

Configure permissions for roles

C.

Assign roles to user groups

D.

Configure permissions for user groups

Full Access
Question # 159

When the number of nodes in the Zookeeper cluster is 5 nodes, how many nodes are the disaster recovery capabilities of the cluster equivalent to?

A.

3

B.

4

C.

6

D.

none of the above

Full Access
Question # 160

What kind of computing tasks are the MapReduces components in Hadoop good at?

A.

Iterative calculation

B.

Offline computing

C.

real-time interactive computing

D.

Streaming Computing

Full Access
Question # 161

Which of the following descriptions about Hive features is incorrect?

A.

Flexible and convenient ETL

B.

Only supports MapReduce computing engine

C.

Direct access to HDFS files and HBase

D.

Easy to use and easy to program

Full Access
Question # 162

Kafka cluster during runtime,Directly depend on the following components?

A.

Zookeeper

B.

HDFS

C.

Spark

D.

HBase

Full Access
Question # 163

Which of the following scenarios does Hive not apply to?

A.

Non-real-time analysis. Such as log analysis, statistical analysis

B.

data mining. Such as user behavior analysis, interest analysis, regional display

C.

Data summary. such as clicks per user per day,Click to rank

D.

Real-time online data analysis

Full Access
Question # 164

Which of the following descriptions about Fusioninsight CTBase is incorrect?

ACTBase's read and write data interface. It uniformly encapsulates the interface defined by the line, and automatically merges and parses cold fields without merging and interpretation in the application.

B. CTBase is a cluster table development framework based on HBasel

C. CTBase provides a set of WebUI for metadata definition,Provides a medical-only watch design tool to reduce the difficulty of watch design

D. CTBase's java API provides a set of interfaces for HBasej connection pool management. Internal connection sharing is performed to reduce the difficulty of client application development.

Full Access
Question # 165

When the Loader of MRS creates a job, what is the role of the connector?

A.

Configure how jobs connect to external data sources

B.

Configure how jobs connect to internal data sources

C.

Provide optimization parameters to improve data import and export performance

D.

Make sure there are conversion steps

Full Access
Question # 166

Which of the following statements about Flink barriers is wrong

A.

Barriers are periodically inserted into the data flow and flow with it as part of the data flow

B.

Barriers are at the heart of Flink snapshots

C.

A barrier separates the snapshot data of the current cycle from the snapshot data of the next cycle

D.

When the barrier is inserted, it will temporarily block the data flow

Full Access
Question # 167

How many shards does an index library of ElasticSearchl have by default?

A.

5

B.

6

C.

3

D.

4

Full Access
Question # 168

In the Fusioninsight HD system, which component does the flume data flow not need to pass through in the node?

A.

sink

B.

topic

C.

Source

D.

Channel

Full Access
Question # 169

What is wrong about the architecture description of Hive in Fusionlnsight HD?

A.

As long as one HiveServer is unavailable, the entire HiveEcluster is unavailable

B.

HiveServert is responsible for accepting client requests, parsing, executing HQL commands and returning query results

C.

MetaStore is used to provide raw data services and depends onDBServer

D.

At the same time, only one HiveServer is in Active state, and the other is in Standby state

Full Access
Question # 170

Which of the following descriptions about Hive log collection on the Fusioninsight Manager interface is incorrect?

A.

You can specify a specific user for log collection, for example, only download logs generated by UserA.

B.

You can specify a time period for log collection, for example, only collect logs from 2016-1-1 to 2016-1-10.

C.

You can specify an instance for log collection, for example, specify to collect metstore logs.

D.

The node IP can be specified for log collection, for example, only the logs of a certain IP can be downloaded.

Full Access
Question # 171

When the loader in Fusioninsight HD imports files from the SFTP server, which of the following file types does not require encoding conversion and data conversion and is the fastest?

A.

sequence_file

B.

text_file

Cbinary_file

C.

graph_file

Full Access
Question # 172

Fusioninsight Manager's statement about the configuration function of the service is incorrect?

A.

Service level configuration can take effect for all instances

B.

Instance-level configuration takes effect only for this instance

C.

Instance-level configuration also takes effect on other instances

D.

After the configuration is saved, you need to restart the service to take effect

Full Access
Question # 173

Streaming:Event listening is mainly implemented through which of the following services provided by Zookeeper?

A.

Distributed detection mechanism

B.

ACK

C.

Watcher

D.

Checkpoint

Full Access
Question # 174

The figure below shows the data transmission architecture of flume. What is the component at the "?" in the figure?

A.

Interceptor

B.

Channel processor

C.

Channel selector

D.

None of the aboveA. True

Full Access
Question # 175

Regarding RDD, which of the following statements is wrong?

A.

RDD has a lineage mechanism (Lineage)

B.

RDDs are stored on disk by default

C.

RDD is a read-only, partitionable distributed dataset

D.

RDD is Spark's abstraction of underlying data

Full Access
Question # 176

In the F1ink technical architecture,( )is a computing engine for stream processing and batch processing

A.

Standalone

B.

Runtime

C.

DataStream

D.

FlinkCore

Full Access
Question # 177

Which of the following commands deletes files?

A.

dfs-clear

B.

dfs -del

C.

dfs -rm

D.

dfs -Is

Full Access
Question # 178

Which module in Hadoop is responsible for data storage in HDFS?

A.

NameNode

B.

Data Node

C.

ZooKeeper

D.

JobTaoker

Full Access
Question # 179

Which instance must the Loader instance be deployed with in FusionlnsightHD?

A.

DataNode

B.

RegionServer

C.

ResourceManager

D.

NodeManager

Full Access
Question # 180

Which of the following is not a characteristic of the MapReduce component in Hadoopl?

A.

easy to program

B.

good scalability

C.

real-time computing

D.

High fault tolerance

Full Access
Question # 181

The following descriptions about Kafkaf are wrong( )

A.

Used as the basis for activity streams and operational data processing pipelines

B.

Developed by Apache Hadoop and open sourced in 2011

C.

It has the characteristics of information persistence, high throughput, real-time, etC.

D.

Implemented using Scala, Java language

Full Access
Question # 182

Regarding the comparison between Hive and traditional data warehouse, which of the following descriptions is wrong?

A.

Hive metadata storage is independent of data storage, thereby decoupling metadata and data. High flexibility, while traditional data warehouse data application is single, low flexibility

B.

Hive is based on HDFS storage. Theoretically, the storage capacity can be expanded infinitely. The storage capacity of traditional data warehouses will have an upper limit

C.

Since Hive data is stored in HDFS, it can ensure high fault tolerance and high reliability of data

D.

Since Hive is based on a big data platform, query efficiency is faster than traditional data warehouses

Full Access
Question # 183

What is the physical storage unit of Region in HBasel

A.

Region

B.

ColumnFamily

C.

olumn

D.

Row

Full Access
Question # 184

What does Fusioninsight HD HBase use by default as its underlying file storage system?

A.

HDFS

B.

Hadoop

C.

Memory

D.

MapReduce

Full Access
Question # 185

Which description about HIVE is incorrect?

A.

The best use case for Hive is batch jobs with large datasets

B.

Hive can realize low-latency and fast query on large-scale data sets

C.

Hive is built on top of Hadoop based on static batch processing, Hadoop usually has high latency andC. A lot of overhead is required for committing and scheduling

D.

The Hive query operation process strictly follows the function execution model of Hadoop MapReduce. Hive converts the user's HiveQL statement into Map through the interpreter Reduce on Hadoop cluster

Full Access
Question # 186

In an MRS cluster, which of the following components does Spark mainly interact with?

A.

Zookeeper

B.

Yarin

C.

Hive

D.

HDFS

Full Access
Question # 187

Which statement about the DataNodel of HDFS in Huawei Fusioninsight HD system is correct?

A.

Does not check the validity of the data

B.

Periodically send the block-related information of this node to the NameNode

C.

Blocks stored in different DataNodes must be different

D.

Blocks on a DataNode. can be the same

Full Access
Question # 188

What is the module used to manage the active and standby status of the Loader Server process in Loader?

A.

Job Scheduler

B.

HA Manager

C.

Job Manager

D.

Resource Manager

Full Access
Question # 189

Which of the following links is the data conversion operation of F1ink completed?

A.

soure

B.

Transformation

C.

Sink

D.

Channel

Full Access
Question # 190

Which way is incorrect to load data into Hive table?

A.

Load the file of the local path directly into the Hive table

B.

Load the files on HDFS into the Hive table

C.

Hive supports insert into! Single record method, so you can insert a single record directly on the command line

D.

Insert result sets from other tables into Hive tables

Full Access
Question # 191

When planning and deploying a Fusionlnsight cluster, it is recommended that the management node be best deployed( ), the control node needs to be deployed at least( )Piece,Data nodes need to be deployed at least( )Piece.

A.

1.2, 2

B.

1, 3, 2

C.

2, 3, 1

D.

2, 3, 3

Full Access
Question # 192

Which of the following statements about ZKFC is wrong?

A.

ZKFC (ZKFailoverController) is used as a Zookeeper cluster client to monitor the status information of NameNodel

B.

The ZKFC process needs to be deployed in NameNodel's node and Zookeeper's Leader node

C.

Standby NameNodej perceives the status of Active NameNodel through Zookeeper. Once the Active NameNode is down, the Standby NameNode will perform the main upgrade operation.

D.

The ZKFC of HDFS NameNodel connects to Zookeeper, and saves the host name and other information in Zookeeper

Full Access
Question # 193

Which of the supervisor descriptions of Fusioninsight HD Streaming is correct?

A.

Supervisor is responsible for resource allocation and task scheduling

B.

Supervisort is responsible for accepting tasks assigned by Nimbus, starting and stopping Worker processes that belong to its own management

C.

Supervisor is a process that runs specific processing logic

D.

Supervisor is a component that receives data in Topology and then performs processing

Full Access
Question # 194

The order in which the YARN scheduler allocates resources, which one of the following descriptions is correct?

A.

Any machine -> same rack -> local resources

B.

Any machine -> local resources -> same rack

C.

Local resources -> same rack-?Any machine

D.

Same rack->any machine->local resources

Full Access