JOBSEEKERS
Login
Sign Up
Jobseeker
Employer
Staffing Firm
Direct Client
Hadoop interview questions part 13
Hadoop interview questions part 13
Back
Take as many assements as you can to improve your validate your skill rating
Total Questions: 20
1. _________ is useful for iterating the properties when all deprecated properties for currently set properties need to be present.
A. addResource
B. setDeprecatedProperties
C. addDefaultResource
D. none of the mentioned
Show Correct Answer
Correct Answer is :
setDeprecatedProperties
2. Which of the following adds a configuration resource ?
A. addResource
B. setDeprecatedProperties
C. addDefaultResource
D. addResource
Show Correct Answer
Correct Answer is :
addResource
3. ________ is a platform for constructing data flows for extract, transform, and load (ETL) processing and analysis of large datasets.
A. Pig Latin
B. Oozie
C. Pig
D. Hive
Show Correct Answer
Correct Answer is :
Pig
4. Point out the correct statement :
A. Hive is not a relational database, but a query engine that supports the parts of SQL specific to querying data
B. Hive is a relational database with SQL support
C. Pig is a relational database with SQL support
D. All of the mentioned
Show Correct Answer
Correct Answer is :
Hive is not a relational database, but a query engine that supports the parts of SQL specific to querying data
5. _________ hides the limitations of Java behind a powerful and concise Clojure API for Cascading.
A. Scalding
B. HCatalog
C. Cascalog
D. All of the mentioned
Show Correct Answer
Correct Answer is :
Cascalog
6. Hive also support custom extensions written in :
A. C#
B. Java
C. C
D. C++
Show Correct Answer
Correct Answer is :
Java
7. Point out the wrong statement :
A. Elastic MapReduce (EMR) is Facebook’s packaged Hadoop offering
B. Amazon Web Service Elastic MapReduce (EMR) is Amazon’s packaged Hadoop offering
C. Scalding is a Scala API on top of Cascading that removes most Java boilerplate
D. All of the mentioned
Show Correct Answer
Correct Answer is :
Elastic MapReduce (EMR) is Facebook’s packaged Hadoop offering
8. ________ is the most popular high-level Java API in Hadoop Ecosystem
A. Scalding
B. HCatalog
C. Cascalog
D. Cascading
Show Correct Answer
Correct Answer is :
Cascading
9. ___________ is general-purpose computing model and runtime system for distributed data analytics.
A. Mapreduce
B. Drill
C. Oozie
D. None of the mentioned
Show Correct Answer
Correct Answer is :
Mapreduce
10. The Pig Latin scripting language is not only a higher-level data flow language but also has operators similar to :
A. SQL
B. JSON
C. XML
D. All of the mentioned
Show Correct Answer
Correct Answer is :
SQL
11. _______ jobs are optimized for scalability but not latency.
A. Mapreduce
B. Drill
C. Oozie
D. Hive
Show Correct Answer
Correct Answer is :
Hive
12. ______ is a framework for performing remote procedure calls and data serialization.
A. Drill
B. BigTop
C. Avro
D. Chukwa
Show Correct Answer
Correct Answer is :
Avro
13. Hadoop I/O Hadoop comes with a set of ________ for data I/O.
A. methods
B. commands
C. classes
D. none of the mentioned
Show Correct Answer
Correct Answer is :
none of the mentioned
14. Point out the correct statement :
A. The sequence file also can contain a “secondary” key-value list that can be used as file Metadata
B. SequenceFile formats share a header that contains some information which allows the reader to recognize is format
C. There’re Key and Value Class Name’s that allow the reader to instantiate those classes, via reflection, for reading
D. All of the mentioned
Show Correct Answer
Correct Answer is :
All of the mentioned
15. Apache Hadoop ___________ provides a persistent data structure for binary key-value pairs.
A. GetFile
B. SequenceFile
C. Putfile
D. All of the mentioned
Show Correct Answer
Correct Answer is :
SequenceFile
16. How many formats of SequenceFile are present in Hadoop I/O ?
A. 2
B. 3
C. 4
D. 5
Show Correct Answer
Correct Answer is :
3
17. Point out the wrong statement :
A. The data file contains all the key, value records but key N + 1 must be greater than or equal to the key N
B. Sequence file is a kind of hadoop file based data structure
C. Map file type is splittable as it contains a sync point after several records
D. None of the mentioned
Show Correct Answer
Correct Answer is :
Map file type is splittable as it contains a sync point after several records
18. Which of the following format is more compression-aggressive ?
A. Partition Compressed
B. Record Compressed
C. Block-Compressed
D. Uncompressed
Show Correct Answer
Correct Answer is :
Block-Compressed
19. The __________ is a directory that contains two SequenceFile.
A. ReduceFile
B. MapperFile
C. MapFile
D. None of the mentioned
Show Correct Answer
Correct Answer is :
MapFile
20. The ______ file is populated with the key and a LongWritable that contains the starting byte position of the record.
A. Array
B. Index
C. Immutable
D. All of the mentioned
Show Correct Answer
Correct Answer is :
Index
Similar Interview Questions
Search for latest jobs
Find Jobs