1. _________ is useful for iterating the properties when all deprecated properties for currently set properties need to be present.
Correct Answer is : setDeprecatedProperties
2. Which of the following adds a configuration resource ?
Correct Answer is : addResource
3. ________ is a platform for constructing data flows for extract, transform, and load (ETL) processing and analysis of large datasets.
4. Point out the correct statement :
Correct Answer is : Hive is not a relational database, but a query engine that supports the parts of SQL specific to querying data
5. _________ hides the limitations of Java behind a powerful and concise Clojure API for Cascading.
Correct Answer is : Cascalog
6. Hive also support custom extensions written in :
7. Point out the wrong statement :
Correct Answer is : Elastic MapReduce (EMR) is Facebook’s packaged Hadoop offering
8. ________ is the most popular high-level Java API in Hadoop Ecosystem
Correct Answer is : Cascading
9. ___________ is general-purpose computing model and runtime system for distributed data analytics.
Correct Answer is : Mapreduce
10. The Pig Latin scripting language is not only a higher-level data flow language but also has operators similar to :
11. _______ jobs are optimized for scalability but not latency.
12. ______ is a framework for performing remote procedure calls and data serialization.
13. Hadoop I/O Hadoop comes with a set of ________ for data I/O.
Correct Answer is : none of the mentioned
14. Point out the correct statement :
Correct Answer is : All of the mentioned
15. Apache Hadoop ___________ provides a persistent data structure for binary key-value pairs.
Correct Answer is : SequenceFile
16. How many formats of SequenceFile are present in Hadoop I/O ?
17. Point out the wrong statement :
Correct Answer is : Map file type is splittable as it contains a sync point after several records
18. Which of the following format is more compression-aggressive ?
Correct Answer is : Block-Compressed
19. The __________ is a directory that contains two SequenceFile.
Correct Answer is : MapFile
20. The ______ file is populated with the key and a LongWritable that contains the starting byte position of the record.
Correct Answer is : Index