| Snaprecruit.com

| Snaprecruit.com

Interview question based on skill :

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. On a tasktracker, the map task passes the split to the createRecordReader() method on InputFormat to obtain a _________ for that split.

Correct Answer is : RecordReader

2. The default InputFormat is __________ which treats each value of input a new value and the associated key is byte offset.

Correct Answer is : TextInputFormat

3. __________ controls the partitioning of the keys of the intermediate map-outputs.

Correct Answer is : Partitioner

4. Output of the mapper is first written on the local disk for sorting and _________ process.

Correct Answer is : shuffling

5. The HDFS client software implements __________ checking on the contents of HDFS files.

Correct Answer is : checksum

6. Point out the correct statement :

Correct Answer is : The HDFS architecture is compatible with data rebalancing schemes

7. The ___________ machine is a single point of failure for an HDFS cluster.

Correct Answer is : NameNode

8. The ____________ and the EditLog are central data structures of HDFS.

Correct Answer is : FsImage

9. Point out the wrong statement :

Correct Answer is : HDFS is designed to support small files only

10. __________ support storing a copy of data at a particular instant of time.

Correct Answer is : Snapshots

11. Automatic restart and ____________ of the NameNode software to another machine is not supported.

Correct Answer is : failover

12. HDFS, by default, replicates each data block _____ times on different nodes and on at least ____ racks.

Correct Answer is : 3,2

13. _________ stores its metadata on multiple disks that typically include a non-local file server.

Correct Answer is : NameNode

14. The HDFS file system is temporarily unavailable whenever the HDFS ________ is down.

Correct Answer is : NameNode

15. Which of the following is shortcut for DUMP operator ?

Correct Answer is : \d alias

16. Point out the correct statement:

Correct Answer is : Both the run and exec commands are useful for debugging because you can modify a Pig script in an editor

17. Which of the following command is used to show values to keys used in Pig ?

Correct Answer is : set

18. Use the __________ command to run a Pig script that can interact with the Grunt shell (interactive mode).

Correct Answer is : run

19. Point out the wrong statement:

Correct Answer is : DECLARE defines a Pig macro

20. Which of the following command can be used for debugging ?

Correct Answer is : exec