Hadoop Interview Questions - 13

1. _________ is useful for iterating the properties when all deprecated properties for currently set properties need to be present.

A. addResource

B. setDeprecatedProperties

C. addDefaultResource

D. none of the mentioned

Correct Answer is : setDeprecatedProperties

2. Which of the following adds a configuration resource ?

A. addResource

B. setDeprecatedProperties

C. addDefaultResource

D. addResource

Correct Answer is : addResource

3. ________ is a platform for constructing data flows for extract, transform, and load (ETL) processing and analysis of large datasets.

A. Pig Latin

B. Oozie

C. Pig

D. Hive

Correct Answer is : Pig

4. Point out the correct statement :

A. Hive is not a relational database, but a query engine that supports the parts of SQL specific to querying data

B. Hive is a relational database with SQL support

C. Pig is a relational database with SQL support

D. All of the mentioned

Correct Answer is : Hive is not a relational database, but a query engine that supports the parts of SQL specific to querying data

5. _________ hides the limitations of Java behind a powerful and concise Clojure API for Cascading.

A. Scalding

B. HCatalog

C. Cascalog

D. All of the mentioned

Correct Answer is : Cascalog

6. Hive also support custom extensions written in :

A. C#

B. Java

C. C

D. C++

Correct Answer is : Java

7. Point out the wrong statement :

A. Elastic MapReduce (EMR) is Facebook’s packaged Hadoop offering

B. Amazon Web Service Elastic MapReduce (EMR) is Amazon’s packaged Hadoop offering

C. Scalding is a Scala API on top of Cascading that removes most Java boilerplate

D. All of the mentioned

Correct Answer is : Elastic MapReduce (EMR) is Facebook’s packaged Hadoop offering

8. ________ is the most popular high-level Java API in Hadoop Ecosystem

A. Scalding

B. HCatalog

C. Cascalog

D. Cascading

Correct Answer is : Cascading

9. ___________ is general-purpose computing model and runtime system for distributed data analytics.

A. Mapreduce

B. Drill

C. Oozie

D. None of the mentioned

Correct Answer is : Mapreduce

10. The Pig Latin scripting language is not only a higher-level data flow language but also has operators similar to :

A. SQL

B. JSON

C. XML

D. All of the mentioned

Correct Answer is : SQL

11. _______ jobs are optimized for scalability but not latency.

A. Mapreduce

B. Drill

C. Oozie

D. Hive

Correct Answer is : Hive

12. ______ is a framework for performing remote procedure calls and data serialization.

A. Drill

B. BigTop

C. Avro

D. Chukwa

Correct Answer is : Avro

13. Hadoop I/O Hadoop comes with a set of ________ for data I/O.

A. methods

B. commands

C. classes

D. none of the mentioned

Correct Answer is : none of the mentioned

14. Point out the correct statement :

A. The sequence file also can contain a “secondary” key-value list that can be used as file Metadata

B. SequenceFile formats share a header that contains some information which allows the reader to recognize is format

C. There’re Key and Value Class Name’s that allow the reader to instantiate those classes, via reflection, for reading

D. All of the mentioned

Correct Answer is : All of the mentioned

15. Apache Hadoop ___________ provides a persistent data structure for binary key-value pairs.

A. GetFile

B. SequenceFile

C. Putfile

D. All of the mentioned

Correct Answer is : SequenceFile

16. How many formats of SequenceFile are present in Hadoop I/O ?

A. 2

B. 3

C. 4

D. 5

Correct Answer is : 3

17. Point out the wrong statement :

A. The data file contains all the key, value records but key N + 1 must be greater than or equal to the key N

B. Sequence file is a kind of hadoop file based data structure

C. Map file type is splittable as it contains a sync point after several records

D. None of the mentioned

Correct Answer is : Map file type is splittable as it contains a sync point after several records

18. Which of the following format is more compression-aggressive ?

A. Partition Compressed

B. Record Compressed

C. Block-Compressed

D. Uncompressed

Correct Answer is : Block-Compressed

19. The __________ is a directory that contains two SequenceFile.

A. ReduceFile

B. MapperFile

C. MapFile

D. None of the mentioned

Correct Answer is : MapFile

20. The ______ file is populated with the key and a LongWritable that contains the starting byte position of the record.

A. Array

B. Index

C. Immutable

D. All of the mentioned

Correct Answer is : Index

Hadoop interview questions part 13

Hadoop interview questions part 13

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. _________ is useful for iterating the properties when all deprecated properties for currently set properties need to be present.

2. Which of the following adds a configuration resource ?

3. ________ is a platform for constructing data flows for extract, transform, and load (ETL) processing and analysis of large datasets.

4. Point out the correct statement :

5. _________ hides the limitations of Java behind a powerful and concise Clojure API for Cascading.

6. Hive also support custom extensions written in :

7. Point out the wrong statement :

8. ________ is the most popular high-level Java API in Hadoop Ecosystem

9. ___________ is general-purpose computing model and runtime system for distributed data analytics.

10. The Pig Latin scripting language is not only a higher-level data flow language but also has operators similar to :

11. _______ jobs are optimized for scalability but not latency.

12. ______ is a framework for performing remote procedure calls and data serialization.

13. Hadoop I/O Hadoop comes with a set of ________ for data I/O.

14. Point out the correct statement :

15. Apache Hadoop ___________ provides a persistent data structure for binary key-value pairs.

16. How many formats of SequenceFile are present in Hadoop I/O ?

17. Point out the wrong statement :

18. Which of the following format is more compression-aggressive ?

19. The __________ is a directory that contains two SequenceFile.

20. The ______ file is populated with the key and a LongWritable that contains the starting byte position of the record.

Similar Interview Questions

Search for latest jobs

For Employers

For Partner

For Jobseekers

Help

Follow Us

snaprecruit