Questions

Introduction To Big Data And Hadoop

Introduction Big Data

  • Motivation: Scale-up vs scale-out

  • Motivation: scalability

  • Motivation: Fault tolerance

  • Big Data: Value of Data (Data/Information/Knowledge/Insides)

  • Big Data: Types of Data (Structured and Unstructured)

  • Big Data: Schema on read & schema on write

  • Big Data: Speed of data – from real time to batched processing

  • Big Data: Data formats

  • Big Data: Compression

DevOps

  • Practices

  • Responsibilities according to the type of Cloud Services

  • CI/CD

  • Tools: Docker; Jenkins; Ansible etc.

Introduction Hadoop

  • What is Hadoop?

  • Hadoop: Pros & Cons?

  • Hadoop: Ecosystem? Components. Ambari

  • Hadoop: 3 Способа развернуть. (Pros & Cons)

  • Hadoop: Архитектурная идея

  • Hadoop: High availability

  • Hadoop: Typical topology

Introduction Hadoop. YARN.

  • YARN

  • YARN: Components of YARN

  • YARN: Logical and physical projections

  • YARN: Application lifecycle

  • YARN: Scheduler

  • YARN: Exec modes (local + remote)

  • YARN: WRITING CUSTOM APPLICATIONS

  • YARN: Security

Introduction Hadoop. Map Reduce.

  • Map Reduce. Concepts.

  • Mapper & Reducer

  • Map Reduce & Python

  • Hadoop VS Spark

Last updated