Hadoop MCQ Questions And Answers - Flume
Hadoop Flume MCQs : This section focuses on "Flume" in Hadoop. These Multiple Choice Questions (mcq) should be practiced to improve the Hadoop skills required for various interviews (campus interviews, walk-in interviews, company interviews), placements, entrance exams and other competitive examinations.
1. What is true about Apache Flume?
A. Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data.
B. It has a simple yet flexible architecture based on streaming data flows
C. Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis.
D. All of the above
View Answer
Ans : D
Explanation: Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data. It has a simple yet flexible architecture based on streaming data flows. Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis.
2. A Flume agent is a JVM process which has ?
A. 3 components
B. 4 components
C. 5 components
D. 6 components
View Answer
Ans : A
Explanation: A Flume agent is a JVM process which has 3 components -Flume Source, Flume Channel and Flume Sink
3. Flume Big data has different levels of reliability to offer?
A. best-effort delivery
B. end-to-end delivery
C. Both A and B
D. None of the above
View Answer
Ans : C
Explanation: Flume Big data has different levels of reliability to offer which includes best-effort delivery and an end-to-end delivery.
4. Flume carries data between?
A. sources and decorator
B. sources and sinks
C. start and decorator
D. decorator and sinks
View Answer
Ans : B
Explanation: Flume carries data between sources and sinks.
5. This gathering of data can be?
A. scheduled
B. event-driven
C. user-defined
D. Both A and B
View Answer
Ans : D
Explanation: This gathering of data can either be scheduled or event-driven. Flume has its own query processing engine which makes it easy to transform each new batch of data before it is moved to the intended sink.
6. Flume Hadoop can also be used to transport event data including but not limited to network traffic data, data generated by social media websites and email messages.
A. TRUE
B. FALSE
C. Can be true or false
D. Can not say
View Answer
Ans : A
Explanation: True, Flume Hadoop can also be used to transport event data including but not limited to network traffic data, data generated by social media websites and email messages.
7. Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called ________ codeline.
A. NF
B. NE
C. NG
D. NP
View Answer
Ans : C
Explanation: Flume is used for efficiently collecting, aggregating, and moving large amounts of streaming event data.
8. Point out the correct statement.
A. Flume is a distributed, reliable, and available service
B. Version 1.5.2 is the eighth Flume release as an Apache top-level project
C. Flume 1.5.2 is production-ready software for integration with hadoop
D. All of the above
View Answer
Ans : A
Explanation: Flume 1.3.0 has been put through many stress and regression tests, is stable, production-ready software, and is backwards-compatible with Flume 1.2.0.
9. A ____________ is an operation on the stream that can transform the stream.
A. Decorator
B. Source
C. Sink
D. All of the above
View Answer
Ans : B
Explanation: A source can be any data source, and Flume has many predefined source adapters.
10. Point out the wrong statement.
A. Version 1.4.0 is the fourth Flume release as an Apache top-level project
B. Apache Flume 1.5.2 is a security and maintenance release that disables SSLv3 on all components in Flume that support SSL/TLS
C. Flume is backwards-compatible with previous versions of the Flume 1.x codeline
D. Flume deploys as one or more agents, each contained within its own instance of chunks
View Answer
Ans : D
Explanation: Flume deploys as one or more agents, each contained within its own instance of JVM
Discussion