| Snaprecruit.com

| Snaprecruit.com

Interview question based on skill :

Take as many assements as you can to improve your validate your skill rating

Total Questions: 20

1. The ________ method in the ModelCountReducer class “reduces” the values the mapper collects into a derived value

Correct Answer is : reduce

2. Which of the following works well with Avro ?

Correct Answer is : MapReduce

3. __________ tools is used to generate proxy objects in Java to easily work with the objects.

Correct Answer is : Avro

4. Spark was initially started by ____________ at UC Berkeley AMPLab in 2009.

Correct Answer is : Matei Zaharia

5. Point out the correct statement :

Correct Answer is : For cluster manager, Spark supports standalone Hadoop YARN

6. ____________ is a component on top of Spark Core.

Correct Answer is : Spark SQL

7. Spark SQL provides a domain-specific language to manipulate ___________ in Scala, Java, or Python.

Correct Answer is : RDDs

8. Point out the wrong statement :

Correct Answer is : All of the mentioned

9. ______________ leverages Spark Core fast scheduling capability to perform streaming analytics.

Correct Answer is : Spark Streaming

10. ____________ is a distributed machine learning framework on top of Spark

Correct Answer is : MLlib

11. ________ is a distributed graph processing framework on top of Spark.

Correct Answer is : GraphX

12. GraphX provides an API for expressing graph computation that can model the __________ abstraction.

Correct Answer is : Pregel

13. Spark architecture is ___________ times as fast as Hadoop disk-based Apache Mahout and even scales better than Vowpal Wabbit.

Correct Answer is : 10

14. Users can easily run Spark on top of Amazon’s __________

Correct Answer is : EC2

15. Point out the correct statement :

Correct Answer is : Spark enables Apache Hive users to run their unmodified queries much faster

16. Spark runs on top of ___________, a cluster manager system which provides efficient resource isolation across distributed applications

Correct Answer is : Mesos

17. Which of the following can be used to launch Spark jobs inside MapReduce ?

Correct Answer is : SIMR

18. Point out the wrong statement :

Correct Answer is : Spark is intended to replace, the Hadoop stack

19. Which of the following language is not supported by Spark ?

Correct Answer is : Pascal

20. Spark is packaged with higher level libraries, including support for _________ queries.

Correct Answer is : SQL