Oozie in Hadoop-1
What is Oozie?
- Oozie is a workflow scheduler for Hadoop
- Originally, designed at Yahoo! for their complex search engine workflows.
- Now it is an open-‐source Apache incubator project.
- Oozie allows a user to create Directed Acyclic Graphs of workflows and these can be ran in parallel and sequential in Hadoop.
- Oozie can also run plain java classes, Pig workflows, and interact with the HDFS.
- Oozie can run job’s sequentially (one after the other) and in parallel (multiple at a time).
cloud agnostic platform
ReplyDeleteCloud-based deployments of business applications is therefore on the rise for some very good reasons – it enhances business and technological agility, accelerates time to market and time-to-value and increases economies of scale.