Sunday 18 November 2018

Oozie in Hadoop-1

What is Oozie?

  • Oozie is a workflow scheduler for Hadoop
  • Originally, designed at Yahoo! for their complex search engine workflows.
  • Now it is an open-­‐source Apache incubator project.
  • Oozie allows a user to create Directed Acyclic Graphs of workflows and these can be ran in parallel and sequential in Hadoop.
  • Oozie can also run plain java classes, Pig workflows, and interact with the HDFS.

  • Oozie can run job’s sequentially (one after the other) and in parallel (multiple at a time).

1 comment:

  1. cloud agnostic platform
    Cloud-based deployments of business applications is therefore on the rise for some very good reasons – it enhances business and technological agility, accelerates time to market and time-to-value and increases economies of scale.

    ReplyDelete