1. On a tasktracker, the map task passes the split to the createRecordReader() method on InputFormat to obtain a _________ for that split.
Correct Answer is : RecordReader
2. The default InputFormat is __________ which treats each value of input a new value and the associated key is byte offset.
Correct Answer is : TextInputFormat
3. __________ controls the partitioning of the keys of the intermediate map-outputs.
Correct Answer is : Partitioner
4. Output of the mapper is first written on the local disk for sorting and _________ process.
Correct Answer is : shuffling
5. The HDFS client software implements __________ checking on the contents of HDFS files.
Correct Answer is : checksum
6. Point out the correct statement :
Correct Answer is : The HDFS architecture is compatible with data rebalancing schemes
7. The ___________ machine is a single point of failure for an HDFS cluster.
Correct Answer is : NameNode
8. The ____________ and the EditLog are central data structures of HDFS.
Correct Answer is : FsImage
9. Point out the wrong statement :
Correct Answer is : HDFS is designed to support small files only
10. __________ support storing a copy of data at a particular instant of time.
Correct Answer is : Snapshots
11. Automatic restart and ____________ of the NameNode software to another machine is not supported.
Correct Answer is : failover
12. HDFS, by default, replicates each data block _____ times on different nodes and on at least ____ racks.
13. _________ stores its metadata on multiple disks that typically include a non-local file server.
Correct Answer is : NameNode
14. The HDFS file system is temporarily unavailable whenever the HDFS ________ is down.
Correct Answer is : NameNode
15. Which of the following is shortcut for DUMP operator ?
Correct Answer is : \d alias
16. Point out the correct statement:
Correct Answer is : Both the run and exec commands are useful for debugging because you can modify a Pig script in an editor
17. Which of the following command is used to show values to keys used in Pig ?
18. Use the __________ command to run a Pig script that can interact with the Grunt shell (interactive mode).
19. Point out the wrong statement:
Correct Answer is : DECLARE defines a Pig macro
20. Which of the following command can be used for debugging ?