Hadoop interview questions part 33

Hadoop interview questions part 33

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. During merging, __________ now always checks the incoming segments for corruption before merging.

Correct Answer is : IndexWriter

2. Heap usage during IndexWriter merging is also much lower with the new :

Correct Answer is : Lucene50Codec

3. Point out the wrong statement :

Correct Answer is : ConcurScheduler detects whether the index is on SSD or not

4. PostingsFormat now uses a __________ API when writing postings, just like doc values.

Correct Answer is : pull

5. New ____________ type enables Indexing and searching of date ranges, particularly multi-valued ones.

Correct Answer is : DateRangeField

6. SolrJ now has first class support for __________ API

Correct Answer is : Collections

7. ____________ Collection API allows for even distribution of custom replica properties.

Correct Answer is : BALANCESHARDUNIQUE

8. ____________ can be used to generate stats over the results of arbitrary numeric functions.

Correct Answer is : stats.field

9. Mahout provides ____________ libraries for common and primitive Java collections.

Correct Answer is : Java

10. Point out the correct statement :

Correct Answer is : None of the mentioned

11. _________ does not restrict contributions to Hadoop based implementations.

Correct Answer is : Mahout

12. Mahout provides an implementation of a ______________ identification algorithm which scores collocations using log-likelihood ratio.

Correct Answer is : collocation

13. Point out the wrong statement :

Correct Answer is : None of the mentioned

14. The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.

Correct Answer is : ShingleFilter

15. The _________ collocation identifier is integrated into the process that is used to create vectors from sequence files of text keys and values.

Correct Answer is : llr

16. ____________ generates NGrams and counts frequencies for ngrams, head and tail subgrams.

Correct Answer is : CollocDriver

17. A key of type ___________ is generated which is used later to join ngrams with their heads and tails in the reducer phase.

Correct Answer is : GramKey

18. ________ phase merges the counts for unique ngrams or ngram fragments across multiple documents.

Correct Answer is : CollocCombiner

19. _______ can change the maximum number of cells of a column family.

Correct Answer is : alter

20. Point out the correct statement :

Correct Answer is : You can add a column family to a table using the method addColumn()

Similar Interview Questions

    Search for latest jobs

    Icon
    Icon