Hive

  • Hive components and their role (Driver, Compiler, Metastore, Execute Engine, Optimizer)

  • Shema on read vs Shema on write

  • Metastore

  • Beeline versus Hive CLI

  • Partitions and Buckets

  • View vs Temporary table

  • Engine Compare (TEZ vs MR vs Spark)

  • UDF (description; classification; ObjectInspector; Python?)

  • SQL vs HQL

  • ACID Hive? requrements.

  • Vectorization

  • CBO

Last updated