Hadoop Interview Questions - 7

1. Data analytics scripts are written in ____________ .

A. Hive

B. CQL

C. PigLatin

D. Java

Correct Answer is : PigLatin

2. If demux is successful within ___ attempts, archives the completed files in Chukwa.

A. one

B. two

C. three

D. all of the mentioned

Correct Answer is : three

3. Chukwa is ___________ data collection system for managing large distributed systems.

A. open source

B. proprietary

C. service based

D. none of the mentioned

Correct Answer is : open source

4. Collectors write chunks to logs/*.chukwa files until a ___ MB chunk is reached.

A. 64

B. 108

C. 256

D. 1024

Correct Answer is : 64

5. The _________ codec from Google provides modest compression ratios.

A. Snapcheck

B. Snappy

C. FileCompress

D. None of the mentioned

Correct Answer is : Snappy

6. Point out the correct statement :

A. Snappy is licensed under the GNU Public License (GPL)

B. BgCIK needs to create an index when it compresses a file

C. The Snappy codec is integrated into Hadoop Common, a set of common utilities that supports other Hadoop subprojects

D. None of the mentioned

Correct Answer is : The Snappy codec is integrated into Hadoop Common, a set of common utilities that supports other Hadoop subprojects

7. Which of the following compression is similar to Snappy compression ?

A. LZO

B. Bzip2

C. Gzip

D. All of the mentioned

Correct Answer is : LZO

8. Which of the following supports splittable compression ?

A. LZO

B. Bzip2

C. Gzip

D. All of the mentioned

Correct Answer is : LZO

9. Point out the wrong statement :

A. From a usability standpoint, LZO and Gzip are similar.

B. Bzip2 generates a better compression ratio than does Gzip, but it’s much slower

C. Gzip is a compression utility that was adopted by the GNU project

D. None of the mentioned

Correct Answer is : From a usability standpoint, LZO and Gzip are similar.

10. Which of the following is the slowest compression technique ?

A. LZO

B. Bzip2

C. Gzip

D. All of the mentioned

Correct Answer is : Bzip2

11. Gzip (short for GNU zip) generates compressed files that have a _________ extension.

A. .gzip

B. .gz

C. .gzp

D. .g

Correct Answer is : .gz

12. Which of the following is based on the DEFLATE algorithm ?

A. LZO

B. Bzip2

C. Gzip

D. All of the mentioned

Correct Answer is : Gzip

13. __________ typically compresses files to within 10% to 15% of the best available techniques.

A. LZO

B. Bzip2

C. Gzip

D. All of the mentioned

Correct Answer is : Bzip2

14. The LZO compression format is composed of approximately __________ blocks of compressed data.

A. 128k

B. 256k

C. 24k

D. 36k

Correct Answer is : 256k

15. The Apache Crunch Java library provides a framework for writing, testing, and running ___________ pipelines.

A. MapReduce

B. Pig

C. Hive

D. None of the mentioned

Correct Answer is : MapReduce

16. Point out the correct statement :

A. Scrunch’s Java API is centered around three interfaces that represent distributed datasets

B. All of the other data transformation operations supported by the Crunch APIs are implemented in terms of three primitives

C. A number of common Aggregator implementations are provided in the Aggregators class

D. All of the mentioned

Correct Answer is : A number of common Aggregator implementations are provided in the Aggregators class

17. For Scala users, there is the __________ API, which is built on top of the Java APIs

A. Prunch

B. Scrunch

C. Hivench

D. All of the mentioned

Correct Answer is : Scrunch

18. The Crunch APIs are modeled after _________ , which is the library that Google uses for building data pipelines on top of their own implementation of MapReduce.

A. FlagJava

B. FlumeJava

C. FlakeJava

D. All of the mentioned

Correct Answer is : FlumeJava

19. Point out the wrong statement :

A. Crunch pipeline written by the development team sessionizes a set of user logs generates are then processed by a diverse collection of Pig scripts and Hive queries

B. Crunch pipelines provide a thin veneer on top of MapReduce

C. Developers have access to low-level MapReduce APIs

D. None of the mentioned

Correct Answer is : None of the mentioned

20. Crunch was designed for developers who understand __________ and want to use MapReduce effectively.

A. Java

B. Python

C. Scala

D. Javascript

Correct Answer is : Java

Hadoop interview questions part 7

Hadoop interview questions part 7

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. Data analytics scripts are written in ____________ .

2. If demux is successful within ___ attempts, archives the completed files in Chukwa.

3. Chukwa is ___________ data collection system for managing large distributed systems.

4. Collectors write chunks to logs/*.chukwa files until a ___ MB chunk is reached.

5. The _________ codec from Google provides modest compression ratios.

6. Point out the correct statement :

7. Which of the following compression is similar to Snappy compression ?

8. Which of the following supports splittable compression ?

9. Point out the wrong statement :

10. Which of the following is the slowest compression technique ?

11. Gzip (short for GNU zip) generates compressed files that have a _________ extension.

12. Which of the following is based on the DEFLATE algorithm ?

13. __________ typically compresses files to within 10% to 15% of the best available techniques.

14. The LZO compression format is composed of approximately __________ blocks of compressed data.

15. The Apache Crunch Java library provides a framework for writing, testing, and running ___________ pipelines.

16. Point out the correct statement :

17. For Scala users, there is the __________ API, which is built on top of the Java APIs

18. The Crunch APIs are modeled after _________ , which is the library that Google uses for building data pipelines on top of their own implementation of MapReduce.

19. Point out the wrong statement :

20. Crunch was designed for developers who understand __________ and want to use MapReduce effectively.

Similar Interview Questions

Search for latest jobs

For Employers

For Partner

For Jobseekers

Help

Follow Us

snaprecruit