| Snaprecruit.com

| Snaprecruit.com

Interview question based on skill :

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. Which of the following Hadoop streaming command option parameter is required ?

Correct Answer is : All of the mentioned

2. To set an environment variable in a streaming command use:

Correct Answer is : -cmdenv EXAMPLE_DIR=/home/example/dictionaries/

3. Point out the wrong statement :

Correct Answer is : CombineFileInputFormat does not compromise the speed at which it can process the input in a typical MapReduce job

4. The ________ option allows you to copy jars locally to the current working directory of tasks and automatically unjar the files.

Correct Answer is : archives

5. ______________ class allows the Map/Reduce framework to partition the map outputs based on certain key fields, not the whole keys.

Correct Answer is : KeyFieldBasedPartitioner

6. Which of the following class provides a subset of features provided by the Unix/GNU Sort ?

Correct Answer is : KeyFieldBasedComparator

7. Which of the following class is provided by Aggregate package ?

Correct Answer is : Reducer

8. .Hadoop has a library class, org.apache.hadoop.mapred.lib.FieldSelectionMapReduce, that effectively allows you to process text data like the unix ______ utility.

Correct Answer is : Cut

9. ___________ takes node and rack locality into account when deciding which blocks to place in the same split

Correct Answer is : CombineFileInputFormat

10. Point out the correct statement :

Correct Answer is : All of the mentioned

11. The key, a ____________ is the byte offset within the file of the beginning of the line.

Correct Answer is : LongWritable

12. _________ is the output produced by TextOutputFor mat, Hadoop default OutputFormat.

Correct Answer is : KeyValueTextOutputFormat

13. Point out the wrong statement :

Correct Answer is : SequenceFileAsTextInputFormat is a variant of SequenceFileInputFormat that retrieves the sequence file’s keys and values as opaque binary objects.

14. __________ is a variant of SequenceFileInputFormat that converts the sequence file’s keys and values to Text objects

Correct Answer is : SequenceFileAsTextInputFormat

15. __________ class allows you to specify the InputFormat and Mapper to use on a per-path basis.

Correct Answer is : MultipleInputs

16. ___________ is an input format for reading data from a relational database, using JDBC.

Correct Answer is : DBInputFormat

17. Which of the following is the default output format ?

Correct Answer is : TextOutputFormat

18. Which of the following writes MapFiles as output ?

Correct Answer is : SequenceFileAsBinaryOutputFormat

19. __________ storage is a solution to decouple growing storage capacity from compute capacity.

Correct Answer is : Archival

20. Point out the correct statement :

Correct Answer is : When there is enough space, block replicas are stored according to the storage type list