Hadoop lucene MCQs

This section focuses on "Lucene" in Hadoop. These Multiple Choice Questions (MCQ) should be practiced to improve the hadoop skills required for various interviews (campus interviews, walk-in interviews, company interviews), placements, entrance exams and other competitive examinations.

1.  ___________ provides Java-based indexing and search technology.

A. solar
B. Lucene Core
C. Lucene
D. None of the above

2. Point out the correct statement.

A. Building PyLucene requires GNU Make, a recent version of Ant capable of building Java Lucene and a C++ compiler
B. PyLucene is supported on Mac OS X, Linux, Solaris and Windows
C. Use of setuptools is recommended for Lucene
D. All of the above

3. ____________ is a subproject with the aim of collecting and distributing free materials.


4. Lucene provides scalable, high-Performance indexing over ______ per hour on modern hardware.

A. 1GB
B. 150GB
C. 1TB
D. 150TB

5. Lucene index size is roughly _______ the size of text indexed.

A. 0.1
B. 0.15
C. 0.2
D. 0.25

6. All file access uses Java's __________ APIs which give Lucene stronger index safety.

A. NIO.1
B. NIO.2
C. NIO.3
D. NIO.4

7. Heap usage during IndexWriter merging is also much lower with the new _________

A. LucCodec
B. Lucene50Codec
C. Lucene20Cod
D. All of the above

8. PostingsFormat now uses a __________ API when writing postings, just like doc values.

A. read
B. write
C. push
D. pull

9. SolrJ now has first class support for __________ API.

A. Compactions
B. Collection
C. Distribution
D. All of the above

10. ____________ can be used to generate stats over the results of arbitrary numeric functions.

A. stats.field
B. sta.field
C. stats.value
D. stat.value

