Spark SQL Quiz
1. Compile-time safety is provided in ____
Answer: DataSet
2. Dataframe was introduced in which Spark release?
Answer: spark 1.3
3. Dataset was introduced in which Spark release?
Answer: spark 1.6
4. The main datatype in Catalyst that contains node objects
Answer: tree
5. performs manipulations on the tree
Answer: rule
6. Complex data types in Spark SQL are
Answer: ArrayType and MapType
7. Numeric data type in Spark SQL is
Answer: bytetype
8. which of the following provide object-oriented programming interface?
Answer: dataset
9. User-defined aggregations in Spark SQL
Answer: both
10. Data types in Spark SQL are
Answer: both
11. Data is represented as rows internally and JVM Objects externally in
Answer: dataset
12. Built-in aggregations like functions designed for DataFrames
Answer: all
13. To implement a custom untyped user-defined aggregation function, the user has to extend the which class
Answer: UserDefinedAggregateFunction
14. ________ is the entry point to interact with underlying Spark functionality
Answer: SparkSession
15. When SQL run from the other programming language the result will be
Answer: either
16. Which of the following the data is organised into named column?
Answer: RDD and DataFrame
17. The Dataset API is accessible in-
Answer: Java and Scala