| Snaprecruit.com

| Snaprecruit.com

Interview question based on skill :

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. Which of the following file contains user defined functions (UDFs) ?

Correct Answer is : tutorial.jar

2. Which of the following is correct syntax for parameter substitution using cmd ?

Correct Answer is : pig {-param param_name = param_value | -param_file file_name} [-debug | -dryrun] script

3. You can specify parameter names and parameter values in one of the ways:

Correct Answer is : All of the mentioned

4. _________ are scanned in the order they are specified on the command line.

Correct Answer is : Both parameter files and command line parameters

5. Drill is designed from the ground up to support high-performance analysis on the ____________ data.

Correct Answer is : semi-structured

6. Point out the correct statement :

Correct Answer is : None of the mentioned

7. ___________ includes Apache Drill as part of the Hadoop distribution.

Correct Answer is : MapR

8. MapR __________ Solution Earns Highest Score in Gigaom Research Data Warehouse Interoperability Report

Correct Answer is : SQL-on-Hadoop

9. Point out the wrong statement :

Correct Answer is : Hadoop is a prerequisite for Drill

10. Drill integrates with BI tools using a standard __________ connector.

Correct Answer is : ODBC

11. Drill analyze semi-structured/nested data coming from _________ applications.

Correct Answer is : NoSQL

12. Apache _________ provides direct queries on self-describing and semi-structured data in files.

Correct Answer is : Drill

13. Drill provides a __________ like internal data model to represent and process data.

Correct Answer is : JSON

14. Drill also provides intuitive extensions to SQL to work with _______ data types.

Correct Answer is : nested

15. Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called ________ codeline

Correct Answer is : NG

16. Point out the correct statement :

Correct Answer is : Flume is a distributed, reliable, and available service

17. ___________ was created to allow you to flow data from a source into your Hadoop environment.

Correct Answer is : Flume

18. A ____________ is an operation on the stream that can transform the stream.

Correct Answer is : Source

19. Point out the wrong statement :

Correct Answer is : None of the mentioned

20. A number of ____________ source adapters give you the granular control to grab a specific file.

Correct Answer is : text file