| Snaprecruit.com

| Snaprecruit.com

Interview question based on skill :

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. _________ is useful for iterating the properties when all deprecated properties for currently set properties need to be present.

Correct Answer is : setDeprecatedProperties

2. Which of the following adds a configuration resource ?

Correct Answer is : addResource

3. ________ is a platform for constructing data flows for extract, transform, and load (ETL) processing and analysis of large datasets.

Correct Answer is : Pig

4. Point out the correct statement :

Correct Answer is : Hive is not a relational database, but a query engine that supports the parts of SQL specific to querying data

5. _________ hides the limitations of Java behind a powerful and concise Clojure API for Cascading.

Correct Answer is : Cascalog

6. Hive also support custom extensions written in :

Correct Answer is : Java

7. Point out the wrong statement :

Correct Answer is : Elastic MapReduce (EMR) is Facebook’s packaged Hadoop offering

8. ________ is the most popular high-level Java API in Hadoop Ecosystem

Correct Answer is : Cascading

9. ___________ is general-purpose computing model and runtime system for distributed data analytics.

Correct Answer is : Mapreduce

10. The Pig Latin scripting language is not only a higher-level data flow language but also has operators similar to :

Correct Answer is : SQL

11. _______ jobs are optimized for scalability but not latency.

Correct Answer is : Hive

12. ______ is a framework for performing remote procedure calls and data serialization.

Correct Answer is : Avro

13. Hadoop I/O Hadoop comes with a set of ________ for data I/O.

Correct Answer is : none of the mentioned

14. Point out the correct statement :

Correct Answer is : All of the mentioned

15. Apache Hadoop ___________ provides a persistent data structure for binary key-value pairs.

Correct Answer is : SequenceFile

16. How many formats of SequenceFile are present in Hadoop I/O ?

Correct Answer is : 3

17. Point out the wrong statement :

Correct Answer is : Map file type is splittable as it contains a sync point after several records

18. Which of the following format is more compression-aggressive ?

Correct Answer is : Block-Compressed

19. The __________ is a directory that contains two SequenceFile.

Correct Answer is : MapFile

20. The ______ file is populated with the key and a LongWritable that contains the starting byte position of the record.

Correct Answer is : Index