Hadoop Interview Questions - 11

1. ____________ is used when you want the sink to be the input source for another operation.

A. Collector Tier Event

B. Agent Tier Event –

C. Basic

D. All of the mentioned

Correct Answer is : Agent Tier Event –

2. ___________ is where you would land a flow (or possibly multiple flows joined together) into an HDFS-formatted file system.

A. Collector Tier Event

B. Agent Tier Event –

C. Basic

D. All of the mentioned

Correct Answer is : Collector Tier Event

3. ____________ sink can be a text file, the console display, a simple HDFS path, or a null bucket where the data is simply deleted.

A. Collector Tier Event

B. Agent Tier Event –

C. Basic

D. None of the mentioned

Correct Answer is : Basic

4. Flume deploys as one or more agents, each contained within its own instance of :

A. JVM

B. Channels

C. Chunks

D. None of the mentioned

Correct Answer is : JVM

5. _________ is the name of the archive you would like to create.

A. archive

B. archiveName

C. name

D. none of the mentioned

Correct Answer is : archiveName

6. Point out the correct statement :

A. A Hadoop archive maps to a file system directory

B. Hadoop archives are special format archives

C. A Hadoop archive always has a *.har extension

D. All of the mentioned

Correct Answer is : All of the mentioned

7. Using Hadoop Archives in __________ is as easy as specifying a different input filesystem than the default file system.

A. Hive

B. Pig

C. MapReduce

D. All of the mentioned

Correct Answer is : MapReduce

8. The __________ guarantees that excess resources taken from a queue will be restored to it within N minutes of its need for them.

A. capacitor

B. scheduler

C. datanode

D. none of the mentioned

Correct Answer is : scheduler

9. Point out the wrong statement :

A. The Hadoop archive exposes itself as a file system layer

B. Hadoop archives are immutable

C. Archive rename, deletes and creates return an error

D. None of the mentioned

Correct Answer is : None of the mentioned

10. _________ is a pluggable Map/Reduce scheduler for Hadoop which provides a way to share large clusters.

A. Flow Scheduler

B. Data Scheduler

C. Capacity Scheduler

D. None of the mentioned

Correct Answer is : Capacity Scheduler

11. Which of the following parameter describes destination directory which would contain the archive ?

A. -archiveName

B.

C.

D. none of the mentioned

Correct Answer is :

12. _________ identifies filesystem path names which work as usual with regular expressions.

A. -archiveName

B.

C.

D. none of the mentioned

Correct Answer is : none of the mentioned

13. __________ is the parent argument used to specify the relative path to which the files should be archived to

A. -archiveName

B. -p

C.

D.

Correct Answer is : -p

14. Mapper implementations are passed the JobConf for the job via the ________ method

A. JobConfigure.configure

B. JobConfigurable.configure

C. JobConfigurable.configureable

D. None of the mentioned

Correct Answer is : JobConfigurable.configure

15. Input to the _______ is the sorted output of the mappers.

A. Reducer

B. Mapper

C. Shuffle

D. All of the mentioned

Correct Answer is : Reducer

16. The right number of reduces seems to be :

A. 0.9

B. 0.8

C. 0.36

D. 0.95

Correct Answer is : 0.95

17. Point out the wrong statement :

A. Reducer has 2 primary phases

B. Increasing the number of reduces increases the framework overhead, but increases load balancing and lowers the cost of failures

C. It is legal to set the number of reduce-tasks to zero if no reduction is desired

D. The framework groups Reducer inputs by keys (since different mappers may have output the same key) in sort stage

Correct Answer is : Reducer has 2 primary phases

18. The output of the _______ is not sorted in the Mapreduce framework for Hadoop.

A. Mapper

B. Cascader

C. Scalding

D. None of the mentioned

Correct Answer is : None of the mentioned

19. Which of the following phases occur simultaneously ?

A. Shuffle and Sort

B. Reduce and Sort

C. Shuffle and Map

D. All of the mentioned

Correct Answer is : Shuffle and Sort

20. Mapper and Reducer implementations can use the ________ to report progress or just indicate that they are alive.

A. Partitioner

B. OutputCollector

C. Reporter

D. All of the mentioned

Correct Answer is : Reporter

Hadoop interview questions part 11

Hadoop interview questions part 11

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. ____________ is used when you want the sink to be the input source for another operation.

2. ___________ is where you would land a flow (or possibly multiple flows joined together) into an HDFS-formatted file system.

3. ____________ sink can be a text file, the console display, a simple HDFS path, or a null bucket where the data is simply deleted.

4. Flume deploys as one or more agents, each contained within its own instance of :

5. _________ is the name of the archive you would like to create.

6. Point out the correct statement :

7. Using Hadoop Archives in __________ is as easy as specifying a different input filesystem than the default file system.

8. The __________ guarantees that excess resources taken from a queue will be restored to it within N minutes of its need for them.

9. Point out the wrong statement :

10. _________ is a pluggable Map/Reduce scheduler for Hadoop which provides a way to share large clusters.

11. Which of the following parameter describes destination directory which would contain the archive ?

12. _________ identifies filesystem path names which work as usual with regular expressions.

13. __________ is the parent argument used to specify the relative path to which the files should be archived to

14. Mapper implementations are passed the JobConf for the job via the ________ method

15. Input to the _______ is the sorted output of the mappers.

16. The right number of reduces seems to be :

17. Point out the wrong statement :

18. The output of the _______ is not sorted in the Mapreduce framework for Hadoop.

19. Which of the following phases occur simultaneously ?

20. Mapper and Reducer implementations can use the ________ to report progress or just indicate that they are alive.

Similar Interview Questions

Search for latest jobs

For Employers

For Partner

For Jobseekers

Help

Follow Us

snaprecruit