Spark SQL Quiz

 


1. Compile-time safety is provided in ____
Answer: DataSet


2. Dataframe was introduced in which Spark release?
Answer: spark 1.3


3. Dataset was introduced in which Spark release?
Answer: spark 1.6


4. The main datatype in Catalyst that contains node objects
Answer: tree


5. performs manipulations on the tree
Answer: rule



6. Complex data types in Spark SQL are
Answer: ArrayType and MapType

7. Numeric data type in Spark SQL is
Answer: bytetype


8. which of the following provide object-oriented programming interface?
Answer: dataset


9. User-defined aggregations in Spark SQL
Answer: both


10. Data types in Spark SQL are
Answer: both


11. Data is represented as rows internally and JVM Objects externally in
Answer: dataset


12. Built-in aggregations like functions designed for DataFrames
Answer: all

13. To implement a custom untyped user-defined aggregation function, the user has to extend the which class
Answer: UserDefinedAggregateFunction

14. ________ is the entry point to interact with underlying Spark functionality
Answer: SparkSession


15. When SQL run from the other programming language the result will be
Answer: either


16. Which of the following the data is organised into named column?
Answer: RDD and DataFrame


17. The Dataset API is accessible in-
Answer: Java and Scala