New Year Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70special

Huawei H13-711_V3.0 HCIA-Big Data V3.0 Exam Practice Test

Page: 1 / 65
Total 649 questions

HCIA-Big Data V3.0 Questions and Answers

Testing Engine

  • Product Type: Testing Engine
$37.5  $124.99

PDF Study Guide

  • Product Type: PDF Study Guide
$33  $109.99
Question 1

What is the basis for Stage division when Spark applications are running?

Options:

A.

task

B.

taskSet

C.

action

D.

shuffle

Question 2

Which of the following scenarios in HBase will trigger the F1ush operation?

Options:

A.

HBasePeriod refreshMemstore, silentthink period is 1Hour

B.

When the number of files in WALs reaches a threshold

C.

The total size of the MemStore in the Region has reached the preset Flush Size threshold

D.

The ratio of the total memory occupied by the MemStore to the total memory of the RegionServer exceeds the preset threshold size

Question 3

What are the performance bottlenecks of traditional data processing?

Options:

A.

High cost of data storage

B.

Insufficient streaming data processing performance

C.

Limited scalability

D.

Batch data processing is missing

Question 4

Which of the following descriptions about the HBase storage model is correct?

Options:

A.

Each Key Value has a Qualifier identifier

B.

The same key value key is associated with multiple values

C.

KeyValuel has key information such as timestamp and type

D.

Even if there are multiple Key Values with the same key value and the same Qualifier, there may be more than one. In this case, the time stamp is used to distinguish

Question 5

In Hadoop, if yarn.scheduler.capacity.root.QueueA. minimum-user-limitpercent is set to 50, which of the following statements is wrong?

Options:

A.

A user submits a task and can use 100% of QueueA's resources.

B.

QueueA must ensure that each user gets at least 50% of the resources.

C.

If there are already 2 user tasks running in QueueA, then the task submitted by the third user needs to wait for the release of resources.

D.

Each user in QueueA can only get up to 50% of the resources.

Question 6

Which of the following descriptions about the characteristics of Kafka Partition replicas is correct?

Options:

A.

Follower synchronizes data from Leader by pulling

B.

The master copy is called Leader, and the slave copy is called Follower

C.

Both consumers and producers read and write data from the Leader, and can also interact directly with the Follower

D.

Replicas are in units of partitions. Each partition has its own slave replica of the master

Question 7

Which of the following indicators belong to flume data monitoring?

Options:

A.

The amount of data received by the source

B.

The amount of data written by the sink

C.

Number of DataNodes

D.

The amount of cached data in the channel

Question 8

Which of the following statements about Fusioninsight HBasel visual modeling are correct?

Options:

A.

Visual modeling helps DBAs in modeling design and lowers the threshold for using HBase

B.

Qualifier HBase column: each column represents an attribute of business data

C.

Realize the division of labor: DBAs focus on data table modeling, developers focus on user table names and columns used

D.

Column user table column: each column represents a KeyValue

Question 9

Hive is a data warehouse software based on hadoopE, which can query and manage PB-level distributed data. Which of the following descriptions about Hive features is incorrect?( )

Options:

A.

Only supports MapReduce computing engine

B.

Easy to use and easy to program

C.

Direct access to HDFS files and HBase

D.

Flexible and convenient ETL (extract/transform/load)

Question 10

Options:

A.

redistributing stream

B.

one-to-one

C.

one-to-many stream

D.

distributingi flow

Question 11

NodeManager memory andCThe number of PUs is configured by which of the following options?

Options:

A.

YARN.scheduler.capacity.root.QueueA. maximum-capacity

B.

YARN.nodemanager.resource.cpu-vcore

C.

YARN.nodemanager.vmem-pmom-ratio

D.

YARN.modemanager.resource.memory-mb

Question 12

Which of the following options are Huawei Kunpeng industrial products?

Options:

A.

Intelligent management chip

B.

Smart network card chip

C.

Smart SSD Controller Chip

D.

Huawei Kunpeng processor

Question 13

Which operations in Hive can be merged?

Options:

A.

UNION ALL

B.

GROUP BY

C.

SELECT

D.

JOIN

Question 14

What are the storage formats supported by Hive in the Fusioninsight HD system?

Options:

A.

HFile

B.

TextFile

C.

SequenceFile

D.

RCFile

Question 15

Which service process manages the Region of HBasel?

Options:

A.

DataNode

B.

ZooKeeper

C.

HMaster

D.

HRegionServer

Question 16

Which of the following options belong to Fusioninsight data security?

Options:

A.

Operating system security hardening

B.

Component data encryption

C.

data integrity check

D.

User authority authentication management

Question 17

What are the storage formats supported by Hive?

Options:

A.

TextFile

B.

RCFile

C.

Parquet

D.

SequenceFile

Question 18

What services can Huawei DWS provide to customers?

Options:

A.

Support GDS tools to speed up data storage

B.

Ensure high reliability of data and systems

C.

Trillions of data correlation analysis seconds response

D.

Unified management console

Question 19

Which components in FusioninsightHD can Zookeeper provide distributed management support for?

Options:

A.

HBase

B.

Loader

C.

Hive

D.

Spark

Question 20

What are the key features of Streaming in Huawei's big data product Fusioninsight HD?

Options:

A.

flexibility

B.

Scalability

C.

Disaster recovery capability

D.

message reliability

Question 21

When Fusionlnsight HD products deploy Kerberos and LDAP services, which of the following descriptions are correct?

Options:

A.

Before deploying Kerberos services, LD must be deployedAP service

B.

The LDAP service must be deployed on the same node as the Kerberos service

C.

Deploying the same node for Kerberos service and LDAP service facilitates data access and improves performance

D.

LDAP services can be shared by multiple clusters

Question 22

Which of the following sub-products are included in the Fusioninsight family

Options:

A.

Fusioninsight Miner

B.

Fusioninsight Farmer

C.

Fusioninsight HD

D.

GaussDB 200

Question 23

In Huawei's big data solution, which of the following components are included in the hadoop layer?

Options:

A.

Miner

B.

Spark

C.

Hive

D.

Flink

Question 24

What are the following main functions of Fusioninsight Manager?

Options:

A.

safety management

B.

data integration

C.

System pipe burial

D.

Service Governance

Question 25

The following statements about Kafka Partitioni are correct

Options:

A.

The introduction of the Partition mechanism ensures the high throughput of Kafkal

B.

Each Partition is an ordered and immutable message queue

C.

The number of Partitions determines the maximum number of concurrent consumers in each consumer group

D.

Each Partition corresponds to a log file at the storage level

Question 26

What information does a Key Value format in the HBase data file HFiler contain?

Options:

A.

Key

B.

Value

C.

Timestamp

D.

KeyType

Question 27

Fusionlnsight HD cluster upgrade, which of the following descriptions are correct?

Options:

A.

During the upgrade process, you cannot manually perform the active/standby OMS switchover.

B.

The root account password of all hosts in the cluster must be the same

C.

Keep the network unblocked. Avoid abnormal upgrades due to network problems

D.

No expansion can be done during the observation period

Question 28

In the Fusioninsight product, which statement is correct about the Kafka component?

Options:

A.

When creating a topic, the number of replicas must not be greater than the number of currently surviving Broker instances, otherwise the topic creation will fail

B.

When the Producer of Kafkal sends a message, it can specify which Consumer consumes the message

C.

Kafka will store metadata information in Zookeeper for

D.

After Kafka is installed, the sensitive data storage directory cannot be configured.

Question 29

In Kafka, the following statement about the data sent by the Producer is wrong

Options:

A.

The Producer is responsible for producing data, and the Consumer is responsible for consuming data. A connection needs to be established between the Producer and the Consumer.

B.

Proucer can send data to both Broker and Consumer

C.

As a message producer, Proucer can directly write data to Zookeeper

D.

Proucer can generate data by connecting to any normal Broker instance

Question 30

What steps are included in the preparation for Fusioninsight HD installation?

Options:

A.

Complete the hardware installation

B.

Complete the node host OS installation

C.

Prepare tools and software. Such as Putty, LLD, Fusioninsight HD software installation package, etC.

D.

Prepare planning data. such as network parameters and role deployment locations

Question 31

Which of the following options is a key feature of the HDFS architecture?

Options:

A.

robust mechanism

B.

multiple access mechanism

C.

HA high reliability

D.

Metadata Persistence Mechanism

Question 32

What can the unified certification management system of mainstream manufacturers consist of?

Options:

A.

Unified Authentication Server

B.

Unified authentication management module

C.

Unified Safety Management System

D.

Identity information storage server

Question 33

What are the following main functions of FusionInsightManager?

Options:

A.

data integration

B.

System Management

C.

safety management

D.

Service Governance

Question 34

A user information table meg is saved in HBaser of Fusioninsight HD_table, Rowkey is the user id, one of which is the name of the user, now write three KeyValues to this column in sequence:001:Li, 001:Mary001:Lily, may I ask scanmeg_table', (VERSION=>2) which pieces of data will be returned?

Options:

A.

001:Lily

B.

001:Li

C.

001:Mary, 001:Lily

D.

001:Li, 001:Mary, 001:Lily

Question 35

In the WebHCat architecture, which of the following operations can users perform over the secure HTTPS protocol

Options:

A.

Perform Hive DDL operations

B.

Run MapReduce tasks

C.

Run Hive HQL task

D.

everything aboveA. True

Question 36

What is the module used to manage the active and standby status of the Loader Server process in Loader?

Options:

A.

Job Scheduler

B.

HA Manager

C.

Job Manager

D.

Resource Manager

Question 37

When installing the Streaming component of FusionlnsightHD, the Nimbus role requires several nodes to be installed

Options:

A.

3

B.

2

C.

4

D.

1

Question 38

What is the data scale unit of traditional data processing?

Options:

A.

TB

B.

EB

C.

PB

D.

GB

Question 39

The picture below shows Sparke&MapReduce performance comparison data, it can be concluded that compared with MapReducei computing, Spark uses( )resources, get( )double the performance?

Options:

A.

1/8, 3

B.

1/10, 3

C.

1/10, 4

D.

1/8, 4

Question 40

In the Fusioninsight HD system, which of the following methods cannot view the execution result of the Loader job?

Options:

A.

Richard Suga via YARN tasks

B.

Check through the Loader UI interface

C.

View through the alarm of Managerb

D.

View through NodeManageri

Question 41

Which of the following HDFS commands can be used to check the integrity of data blocks?

Options:

A.

HDFS fsck

B.

HDFS fsck-delete

C.

HDFS dfsadmin -report

D.

HDES balancer -threshold 1

Question 42

Which of the following is incorrect about the BloomFilter feature of HBase?

Options:

A.

used to filter data

B.

Used to optimize the performance of random reads

C.

will increase storage consumption

D.

Can accurately determine that a certain data does not exist

Question 43

RDD has Transformation and Action operators. Which of the following belongs to the Action operator?

Options:

A.

reduceByKey

B.

filter

C.

map

D.

saveAsTextFile

Question 44

As shown in the figure, the following description of the message read by the Kafka message consumer Consumeri is wrong?

Options:

A.

The blue in the picture is a topic of Kafkal, which can be understood as a queue, and each grid represents a message.

B.

The messages generated by the producer are placed at the end of the topic one by one.

C.

Consumers read messages sequentially from right to left.

D.

Consumert uses offset to record the position of the read

Question 45

In Fusionlnsight HD cluster planning, manage nodes&control node&What kind of scenarios is the unified data node deployment solution suitable for?

Options:

A.

Below 30 nodes

B.

More than 100 nodes

C.

100-200 nodes

D.

More than 200 nodes

Question 46

Which module in Hadoop is responsible for data storage in HDFS?

Options:

A.

NameNode

B.

Data Node

C.

ZooKeeper

D.

JobTaoker

Question 47

What application scenarios is HBase not suitable for?

Options:

A.

High throughput application scenarios

B.

Application scenarios requiring full ACD characteristics

C.

Semi-structured data application scenarios

D.

Mass data (TB, PB) application scenarios

Question 48

Which of the following options is correct about the meaning of Zookeeper reliability?

Options:

A.

Reliability is achieved through active and standby deployment

B.

Reliability means that the update can only succeed or fail, and there is no intermediate state

C.

Reliability means that no matter which server displays the same view to the outside world

D.

Reliability means that a message is received by one server, and it will be accepted by all servers

Question 49

Fusioninsight HD Manager interface Hive log collection. Which option is incorrect?

Options:

A.

You can specify an instance for log collection, such as specifying a separate collection of MetaStorel logs

B.

You can specify a time period for log collection, for example, only collect logs from 2016-1-1 to 2016-140

C.

The node IP can be specified for log collection, for example, only the logs of a certain IP can be downloaded

D.

A specific user can be specified for log collection, for example, only the logs generated by userA can be downloaded

Question 50

Which of the following is not included in the schemai authentication method of Zookeeper?

Options:

A.

sasl

B.

auth

C.

digest

D.

world

Question 51

Which of the following descriptions about the key features of F1ink is incorrect?

Options:

A.

SparkStreaming - lower latency compared to Flink

B.

The Flink stream processing engine can provide functions to support both stream processing and batch processing applications

C.

Flink has higher throughput compared to Streaming in Fusionlnght HD

D.

Checkpoint implements Flink's fault tolerance

Question 52

In the MRS platform, which component does the F1ume data flow not need to pass through in the node?

Options:

A.

Sink

B.

Channel

C.

LTopic

D.

Source

Question 53

The following descriptions about Kafkaf are wrong( )

Options:

A.

Used as the basis for activity streams and operational data processing pipelines

B.

Developed by Apache Hadoop and open sourced in 2011

C.

It has the characteristics of information persistence, high throughput, real-time, etC.

D.

Implemented using Scala, Java language

Question 54

F1ink in( )interface for streaming data processing,( )interface for batch processing?

Options:

A.

Datastream API, DataSet API

B.

Data batch API.DataStream API

C.

Stream API.Batch API

D.

Batch API, Stream API

Question 55

When creating a Loaderf job, in which of the following steps can the filter type be set?

Options:

A.

input settings

B.

convert

C.

output

D.

Basic Information

Question 56

What data types of application scenarios is HBase of Hadoop not suitable for?

Options:

A.

Large file application scenarios

B.

Mass data application scenarios

C.

High throughput application scenarios

D.

Semi-structured data application scenarios

Question 57

Which of the following types of data is not semi-structured data?

Options:

A.

HTML

B.

XML

C.

two-dimensional table

D.

JSON

Question 58

Which of the following programming languages is Spark implemented in?

AC

B.C++

C. JAVA

D. Scala

Options:

Question 59

When a Regioni in HBaser performs the Split operation, what stage occurs in the process of actually dividing an HFile file into two Regions?

Options:

A.

During Spliti

B.

During Flush

C.

ompactionj process

D.

HFile separation process

Question 60

Which of the following options are suitable for MapReduce?

Options:

A.

Offline computing

B.

real-time interactive computing

C.

Iterative calculation

D.

Streaming Computing

Question 61

Regarding RDD, which of the following statements is wrong?

Options:

A.

RDD has a lineage mechanism (Lineage)

B.

RDDs are stored on disk by default

C.

RDD is a read-only, partitionable distributed dataset

D.

RDD is Spark's abstraction of underlying data

Question 62

In many small file scenarios, Spark will start many tasks. When there is a Shuffle operation in the SQL logic, the number of hash buckets will be greatly increased, which will seriously affect the performance. In Fusioninsight, scenarios for small files usually use the( )Operator to merge partitioni generated by small files in Tabler, reduce the number of partitions, avoid generating too many hash buckets during shuffle, and improve performance?

Options:

A.

group by

B.

coalosce

C.

onnect

D.

join

Question 63

What is the core module of spark?

Options:

A.

spark streaming

B.

spark core

C.

mapreduce

D.

spark sql

Question 64

In the Fusioninsight HD product, which statement about Kafka is incorrect?

Options:

A.

Kafka strongly depends on Zookeeper

B.

The number of instances deployed by Kafka must not be less than 2

C.

Kafkal server can generate messages

D.

Consumer consumes messages as the client role of Kafkal

Question 65

Which nodes are required to communicate with external data sources before and after the Fusioninsight HD Loader job?

Options:

A.

Loader service master node

B.

The node on which the YARN service job is running

C.

Both of the first two are required

D.

Neither of the first two are needed

Question 66

What is wrong about the architecture description of Hive in Fusionlnsight HD?

Options:

A.

As long as one HiveServer is unavailable, the entire HiveEcluster is unavailable

B.

HiveServert is responsible for accepting client requests, parsing, executing HQL commands and returning query results

C.

MetaStore is used to provide raw data services and depends onDBServer

D.

At the same time, only one HiveServer is in Active state, and the other is in Standby state

Question 67

A Kafka cluster contains one or more service instances, which are called( )

Options:

Question 68

Hadoop's NameNode is used to store the metadata of the file system.

Options:

A.

True

B.

False

Question 69

The following is for the LdapSearer organization?Type statement is correct

Options:

A.

Each node in the LdapSearer directory is called?day, and?has its own uniquely distinguishable name?IN (Distingarl shed Mama)

B.

Specific objects cannot be defined under organizational units.

C.

LdapSearer directory information is based on? ?structure to carry out?woven and stored.

D.

The tree of the LdapSearer directory tree generally defines the domain name dc (Dcmain Compen?)

Question 70

When the client of HDFS writes a file, the writing position of the first copy of the data is determined by the NameNode. The writing position of other copies is determined by the Data Nodet.

Options:

A.

True

B.

False

Question 71

As an authentication server center, Kerberos1 can provide unified authentication services to all services in the cluster and secondary development applications of customers.

Options:

A.

True

B.

False

Question 72

The processing logic of topology is in bolt.

Options:

A.

True

B.

False

Question 73

In MapReduce programming, code not only describes what to do, but also how to do it.

Options:

A.

True

B.

False

Question 74

ElWhat processing capabilities does asticSearch have for structured, semi-structured, and unstructured data?

Options:

A.

to enterA series of operations such as line cleaning, word segmentation, and establishment of an inverted index

B.

Provides the ability to search full text, conditions can include words or phrases

C.

The written data can be checked in real timeSow

D.

numberOptional rewrite when data is writtendeleteand compression function

Question 75

Options:

A.

True

B.

False

Question 76

ElasticSeArch expansion fieldKing Packageincluding whichsome?

Options:

A.

The amount of cluster data is reduced

B.

The index data of a single instance of ElasticSearch is too large

C.

The node needs to reinstall the operating system

D.

Excessive consumption of physical resources

Question 77

In the MRS service, the unavailability of the Zookeeper service will result in the unavailability of the kafka service.

Options:

A.

True

B.

False

Question 78

The client writes data to any Zookeeper node. Zookeeper needs to fully synchronize the data to other nodes before returning to the client to write successfully.

Options:

A.

True

B.

False

Question 79

Fl?Me is akind of distributed. highreliable and highavailableofClothesservice. useto be effectivecompare?,polymerization?Move a lot of logs? ?.

Options:

A.

TRUE

B.

FALSE

Question 80

Mapme?program by? ?consists of two parts,? ?program, which has?Piece?Task. Please program the most? ?banknotes? ?

Options:

A.

2

B.

3

C.

5

D.

4

Question 81

The following offin Kafka messagesWhich one of the speed transmission methods is still correct?

Options:

A.

Postingonesubscriptioninformationsystem, the sameNumber of barsData can be consumed by multiple consumers. data isConsumptionnot laterdelete immediately

B.

Distributed Messaginghand overThere are two mainwantofmessage passing pattern,peer to peertransferpattern, haircloth-subscription model

C.

point-to-pointinformation systemmedium, cancanhave multiple consumptionsat the same timeremovefeedata, becauseThis does not guarantee the order in which data is processed.

D.

In a point-to-point messaging system, when a messagefeeByremovefeeteamone of the columnsdataAfter that, thedata rulefromdelete from message queue

Question 82

Which of the following options are action operators?

Options:

A.

mapPartiion

B.

collect

C.

reduceByKe

D.

reduce

Question 83

Redis multi-server nodedeploy, the client sends a requestbeggingtake officemeaning? ?,if? ?check itself is not key placementnode, thenwill requestto itshe?.

Options:

A.

TRUE

B.

FALSE

Question 84

When deploying Solr on Fusioninsight HD, if you choose to store the index on the local disk, it is recommended to mount a separate disk for the data directory of each SolrServer instance, and configure the disk as RAID0 or RAID5.

Options:

A.

True

B.

False

Question 85

According to the way of data acquisition in Flue, Source can be divided into driven Sourcet and( )Source.

Options:

Question 86

The following optionsitem, offAtZooKeeper is reliableSexual inclusionWhich is the correct expression?

Options:

A.

reliability passhostprepareDeployment mode implementation

B.

Reliability means that no matter which servar, the external display is the same? ?

C.

Reliability means that a message is received by a?Ver received, it will be received by all? ?accept

D.

Reliability means more?Can only succeed or fail, no intermediate states

Question 87

In Task Scheduling at YRN. oneAppliedMasterapply to? ?After that, the corresponding? ?communication,wantask it to enlightenmove.

Options:

A.

TRUE

B.

FALSE

Question 88

Spark on YARN-clienti is suitable for production environment because it can see the output of APP faster.

Options:

A.

True

B.

False

Question 89

Which of the following reasons may be the cause of the failure of the Loader job execution?

Options:

A.

Job execution takes too long

B.

Does not comply with data conversion rules

C.

No data compression format specified

D.

No source directory or file specified

Question 90

Redis adopts a non-central self-organizing structure. Nodes use the Gossip protocol to exchange node status information.

Options:

A.

TRUE

B.

FALSE

Question 91

In the Mapreducei process, by default, a shard is a block and a mapTask.

Options:

A.

True

B.

False

Question 92

Sparki, like Hadoop-, is not suitable for generational computing.

Options:

A.

True

B.

False

Question 93

The following aboutHiveComponent capabilities in the architecture. Which is the correct description?

Options:

A.

ThriftServer for thriftcatchmouth, as JDBCservices and willHive and other applicationsintegrated

B.

Compiler pressAccording tomissionaccording toDependent relationship is executed separatelyMap/Raduce task

C.

Executor is responsible for editingTranslate voQLandwillwhich translates into a series of mutualrelyofMap/ReduceTask

D.

Ooptimizer is an optimizer, divided into logicalJiyouoptimizer and physical optimizer.HiveQLGenerated execution planMapRadceTaskEnterrow optimization

Question 94

All messages in Kafka will be persisted to the hard disk. At the same time, Kafka ensures data reliability by setting Replication to Topic Partitio.

Options:

A.

True

B.

False

Question 95

KerbarasAs a certification service center,were ableSecondary development applications to all services in the cluster and customerscarryforsystemA certification service.

Options:

A.

TRUE

B.

FALSE

Question 96

In big data computing tasks, aboutI0Which of the following is an incorrect description of an intensive task?

Options:

A.

CHigh PU consumption

B.

I0During intensive task execution, most of the time is spent in I0deal with

C.

By improving network transmission efficiency and read and write efficiency, performance can be greatly improved

D.

The more tasks, the more efficient the CPU

Question 97

P? ?need? ?receive? ?Confirmation issued after? ?signal, when setting the MC?numbered?How many times means at least used? ?write?Le?, but does not require writing? ?write?

Options:

A.

acka?1

B.

an?1

C.

an?2

D.

an?0

Page: 1 / 65
Total 649 questions