Featured Post

Windows Azure 90-Day Free Trial

You can get a 90 day free trial of Windows Azure. That will give you 750 HRS of Cloud Services: 750 small compute hours, 35 GB Storage with 50M transactions, 1 DU SQL Database with 1 DU of Web Business Edition, and 20 GB Data Transfers, Outbound and unlimited inbound, 10 Web Sites and Mobile Services...

Read More

Apache Tez

Posted by Anahita | Posted in Big Data | Posted on 28-12-2013

Tags: , , ,

0

Apache Tez, part of Stinger Initiative, is a Hadoop framework for near real-time big data processing. As opposed to MapReduce who created bulk data processing capability,  Tez provides a powerful interactive framework for running queries in Apache Hive, and Apache Pig, providing faster response times and throughput.

In  fact Apache Tez is a Hadoop data processing framework utilising DAG (Directed Acyclic Graph) for execution of complex tasks. This means Tez models data processing jobs as a data flow graph. This is similar to PIG Latin scripts, where the edges of the graph represent data flows and the vertices are operators that process data. The logic that modifies or moves the data is represented in vertices. Tez realises the logical graphs into physical at the time of execution on the cluster, applying parallelism at the vertices for scaling to the required data for processing.