3 Months Free Update
3 Months Free Update
3 Months Free Update
In many small file scenarios, Spark will start many tasks. When there is a Shuffle operation in the SQL logic, the number of hash buckets will be greatly increased, which will seriously affect the performance. In Fusioninsight, scenarios for small files usually use the( )Operator to merge partitioni generated by small files in Tabler, reduce the number of partitions, avoid generating too many hash buckets during shuffle, and improve performance?
Hardware failure is considered to be the norm, in order to solve this problem.HDFS has designed a copy mechanism. By default, a file, HDFS will save( )share?
Which of the following HDFS commands can be used to check the integrity of data blocks?
Which of the following parts does the structure of the unified certification management system include?
YarnWhen doing resource scheduling, maptaak and reduceTask are run in( )middle.
Which of the following descriptions about HBase. Secondary Index is correct
F1ink in( )interface for streaming data processing,( )interface for batch processing?
Which of the following sub-products are included in the Fusioninsight family
Regarding the basic operation of Hive table building, the correct description is
In the FusionInsight cluster, which of the following components does Spark mainly interact with?
About Spark SQL&Hive difference and connection, which of the following statements is correct?
The Fusionlnsight HD cluster contains many kinds of services, and each service consists of thousands of roles. Which of the following are the roles of the service?( )
A Fusioninsight HD cluster contains multiple services, and each service consists of several roles. Which of the following are the roles of the service?
In the Fusioninsight product, which of the following descriptions are correct about the topic of creating Kafka?
The following figure shows the computational model of Structured Streaming. By observation, it can be concluded that the final calculation result of 3 is
When the F1ume process is cascaded, which of the following sink types are used to receive the messages sent by the previous hop Flume?
Which parts of the data need to be read to execute the HBase data reading business?
Kafka Cluster Mirroring. Which of the following functions can be achieved by the tool?
In the process of using Flume to transmit data, in order to prevent data loss due to the restart of the Flume process. Which of the following Channel types can be used
Which of the following options cannot be achieved through big data technology?
Through the unified user management system in the big data platform, the unified management of users, roles and organizations of various open source component application systems in the platform can be realized. Cross-domain single sign-on, log-out and unification between various application systems can be realized. identity authentication function.
Fusioninsight Manger supports REST interface, SNMP interface and SYSLOG interface externally
Huawei Fusioninsight HD is the first big data platform in China that complies with national financial and other garbage protection. What are the following aspects of its security?
Which of the following functions can the Kafka Cluster Mirroring tool achieve?
Regarding the relationship between Hive and other components of Hadoop. Which of the following descriptions is wrong?
RDD has Transformation and Action operators. Which of the following belongs to the Action operator?
In the Fusioninsight product, which statement is correct about the Kafka component?
Is the description of the role of the standby NameNode correct in the HDFS system?
Which of the following options belong to the advantages of HUAWEI CLOUD MRS?
In Kafka HA, when the leader corresponding to the partition is down, a new leader needs to be elected from the followers. Which of the following roles should be executed?
The HDFS data reading process includes the following steps, please choose the correct order. (Drag title) Order: ECADB
HDFS of HadoopE is a distributed file system. Which of the following application scenarios is suitable for data storage and management?
SELECT aa.salarybB. address FROM employee aa JoiN SELECT adress FROM employee info where provine='zhejiang') What types of operations does bb ONaa.nanme=bB. name contain?
Which of the following scenarios in HBase will trigger the F1ush operation?
The HDFS data reading process includes the following steps, please choose the correct order. (Drag picture title, sort question)
In the MRS interface, Loader can specify a variety of different data sources, configuration data cleaning and conversion steps, and configure cluster storage systems, etC. .
Channel supports transactions and provides weaker order guarantees. Any number of Sources and Sinks can be connected.
When a Spark application is running, if a certain task fails to run, the entire app fails to run.
In the Output stage, Structured Streaming can define different data writing methods, including which of the following methods?
AAppend Mode
B. Update Mode
C. General Mode
D. Ccomplete Mode
In the HDFS federated environment, which of the following contents are included in the NameSpace
Which of the following statements about Huawei's big data solution is correct?
In the era of big data, which of the following challenges are faced by enterprises?
In order to improve the fault tolerance of Kafka, Kafka supports the replication strategy of partition. Which of the following descriptions about Leader partition and Follower partition is wrong?
The size of the memory allocated by YARN to Containerb in the Hadoop system, which can be set by the parameter yarn.app.mapreduceam.resource.mb
F1ink not only provides real-time computing that supports both high throughput and exact-once semantics, but also provides batch data processing.
Assuming that HDFS only saves 2 copies when writing data. Then during the writing process, HDFS Client first writes data to DataNodel1 and then writes data to DataNode2.0
After submitting the topology using the Streaming client shell command in the Fusioninsight HD system, use Strom The UI view shows that the topology has not processed data for a long time. What are the possible reasons?
The UNION ALL operator in Hive is used to combine the result sets of two more SELECT statements. Duplicate values are not allowed in the result set.
In the Fusioninsight HD platform, HBase does not currently support secondary indexes
Elastic SearchofSowleadCanbystoragein a variety ofstorage classtype,andthe followingwherekind of storagekindtypebranchhold?
Hive is a data warehouse infrastructure built on Hadoop. It provides a set of tools that can be used to perform extract-transform-load (ETL), a mechanism for storing, querying, and analyzing large-scale data stored in Hadoop.
ApplicationMasters apply for and receive resources from ResourceManagerl through the RPC protocol in a polling manner.
Fusionin, a graphical health inspection tool The sight Tool consists of FusionCare and SysCheckerp.
When a certain task of MapReduce fails, the task can be recalculated through the retry mechanism.
In Fusioninsight HD, which of the following is not a flow control feature of Hive
What information does a Key Value format in the HBase data file HFiler contain?
Which of the following statements about Fusioninsight HBasel visual modeling are correct?
What are the storage formats supported by Hive in the Fusioninsight HD system?
Which components in the Fusioninsight HD platform support table and column encryption?
Which of the following statements about CarbonData in Fusioninsight is correct?
Which of the following are the characteristics of Streamingl?
A data is stored first and then calculated
B. event driven
C. low latency
D. Can do continuous query
Which of the following descriptions about the functions of HMaster in HBase are correct?
From the point of view of the life cycle, what stages does data mainly go through?
What are the correct understandings and descriptions of the main features of big data?
What steps are included in the preparation for Fusioninsight HD installation?
In Flink( )Interface for streaming data processing.( )interface for batch processing
In the Fusioninsight HD system, it fails to view the topology or submit the topology using the Shelle command of the Streaming client. Which of the following positioning methods are correct?
To enable the log aggregation function of the Yam component in the Hadoop platform, which parameter needs to be configured?
In Huawei's big data solution, which of the following components are included in the hadoop layer?
Which statement is correct about worker (worker process), Executor (thread) and task (task)?
Which of the following scenarios is not suitable for HDFS?
A streaming data access
B. Lots of small file storage
C. Large file storage and access
D. random write
Which of the following belongs to the shufle mechanism in the MapReduce process?
F1ink is a unified computing framework that combines batch processing and stream processing. Its core is a stream data processing engine for data distribution and parallel computing.
HDFS supports large file storage, and supports multiple users' write operations to the same file, as well as modification at any position of the file.
The following offin Kafka messagesWhich one of the speed transmission methods is still correct?
In the Fusioninsight HD system, if dirty data is generated while the Loader job is running, the status of the Loader job execution result must be failed.
In the HDFS mechanism, NameNodet is responsible for managing metadata. Each read request on the Clienty side needs to read metadata information from the metadata disk of NameNode, so as to obtain the location of the read file in DataNodel.
If there are only Default, QueueA and QueueB sub-queues in the YARNU group, then their capacities are allowed to be set to 60%, 25%, and 22% respectively.
The following offon HDFSarity ofholding a spoontransformedProcess description,pleaseArrange from top to bottom in the correct order.
LdapServer's Group (group) is a unified group management for users. If a user is added to the group, the member's dn record will be added to the nember attribute of the group.
Data in FlumeCompression characteristics are mainlyYesFor which of the following purposes?
The main difference between YARN-client and YARN-cluster is the difference between the Application Master process.
Flume's properties.properties configuration file can configure multiple channels to transmit data.
Colocation (identical distribution) The same distribution at the file level realizes fast access to files. It avoids a lot of network overhead caused by data relocation.
Fusioninsight is Huawei's unified platform for enterprise-level big data storage, query, and analysis. It can help enterprises quickly build massive data information processing systems, and discover new value points and business opportunities through real-time and non-real-time analysis and mining of massive information data.
In a point-to-point messaging system, data in the queue can be consumed by one or more consumers, but a message can only be consumedoneSecond-rate.
When creating a Loaderf job, in which of the following steps can the filter type be set?
When Solr creates CollectionE, it is recommended to select the routing algorithm as compositld Router, then the Collection can expand shard.
The emergence of HFS solves the need to store a large number of small files (below 10MB) in HDFS. At the same time, it is necessary to store some mixed scenes of large files (above 10MB)
By configuring which of the following parameters can the logs generated in Kafkal be cleaned up?
The overall process of Kafka Produceri reading data is that the Producer connects to any surviving Broker, requests the leader metadata information of the specified topic and partition, and then directly connects with the corresponding Brokerl to publish the data.
Redis adopts a non-central self-organizing structure. Nodes use the Gossip protocol to exchange node status information.
ElWhat processing capabilities does asticSearch have for structured, semi-structured, and unstructured data?
Fusioninsight tool is a set of health detection tools provided for technical support engineers and maintenance engineers. It can check the health status of cluster-related nodes and services, discover potential problems in the cluster in advance, and generate health check reports. It is convenient for technical support engineers and maintenance engineers to quickly understand the health status of the system.
Select which of the following conversion rules are supported by Loader jobs? (multiple choice)
In the Fusioninsight HD product, a typical kafka cluster contains several producers, thousands of consumers and a zookeeper cluster?
HMaster needs to be connected when using HBase for data reading service in Fusioninsight HD
When using Loaderi for data import and export, data processing must go through the Reducel stage
Spark is a memory-based computing engine. All Sparki program data in the running process can only be stored in memory
Which of the following descriptions about the features of Zookeeper is wrong?
Which nodes are required to communicate with external data sources before and after the Fusioninsight HD Loader job?
Regarding the basic operation of Hive table building, which is the correct description?
When a Regioni in HBaser performs the Split operation, what stage occurs in the process of actually dividing an HFile file into two Regions?
Which configuration is not supported by Fusioninsight Manager user rights management?
When the number of nodes in the Zookeeper cluster is 5 nodes, how many nodes are the disaster recovery capabilities of the cluster equivalent to?
What kind of computing tasks are the MapReduces components in Hadoop good at?
Kafka cluster during runtime,Directly depend on the following components?
Which of the following descriptions about Fusioninsight CTBase is incorrect?
ACTBase's read and write data interface. It uniformly encapsulates the interface defined by the line, and automatically merges and parses cold fields without merging and interpretation in the application.
B. CTBase is a cluster table development framework based on HBasel
C. CTBase provides a set of WebUI for metadata definition,Provides a medical-only watch design tool to reduce the difficulty of watch design
D. CTBase's java API provides a set of interfaces for HBasej connection pool management. Internal connection sharing is performed to reduce the difficulty of client application development.
In the Fusioninsight HD system, which component does the flume data flow not need to pass through in the node?
What is wrong about the architecture description of Hive in Fusionlnsight HD?
Which of the following descriptions about Hive log collection on the Fusioninsight Manager interface is incorrect?
When the loader in Fusioninsight HD imports files from the SFTP server, which of the following file types does not require encoding conversion and data conversion and is the fastest?
Fusioninsight Manager's statement about the configuration function of the service is incorrect?
Streaming:Event listening is mainly implemented through which of the following services provided by Zookeeper?
The figure below shows the data transmission architecture of flume. What is the component at the "?" in the figure?
In the F1ink technical architecture,( )is a computing engine for stream processing and batch processing
Which instance must the Loader instance be deployed with in FusionlnsightHD?
Which of the following is not a characteristic of the MapReduce component in Hadoopl?
Regarding the comparison between Hive and traditional data warehouse, which of the following descriptions is wrong?
What does Fusioninsight HD HBase use by default as its underlying file storage system?
In an MRS cluster, which of the following components does Spark mainly interact with?
Which statement about the DataNodel of HDFS in Huawei Fusioninsight HD system is correct?
What is the module used to manage the active and standby status of the Loader Server process in Loader?
Which of the following links is the data conversion operation of F1ink completed?
When planning and deploying a Fusionlnsight cluster, it is recommended that the management node be best deployed( ), the control node needs to be deployed at least( )Piece,Data nodes need to be deployed at least( )Piece.
Which of the supervisor descriptions of Fusioninsight HD Streaming is correct?
The order in which the YARN scheduler allocates resources, which one of the following descriptions is correct?