Apache: Big Data Europe 2016
Click here to Register or for more information 
Back To Schedule
Tuesday, November 15 • 13:00 - 13:50
Power Pig with Spark - Liyun Zhang, Intel

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Pig is a popular scripting platform for processing and analyzing large data sets in the Hadoop ecosystem. With its open architecture and backend neutrality, Pig scripts can currently run on MapReduce and Tez. Apache Spark is an open-source data analytics cluster computing framework that has gained significant momentum recently. Besides offering performance advantages, Spark is also a more natural fit for the query plan produced by Pig. Pig on Spark enables improved ETL performance while also supporting users intending to standardize to Spark as the execution engine.


Liyun Zhang

Software Engineer, Intel
Liyun Zhang is a Software Engineer at Intel. She is one of main contributors of Pig on Spark project. Prior to that, she made several contributions to Intel Distribution for Hadoop.

Tuesday November 15, 2016 13:00 - 13:50 CET
Giralda V