Loading…
This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
Apache: Big Data Europe 2016
Click here to Register or for more information 
View analytic
Tuesday, November 15 • 13:00 - 13:50
Massively Parallel Data Warehousing in the Hadoop Stack - Gregory Chase & Roman Shaposhnik, Pivotal

Sign up or log in to save this to your schedule and see who's attending!

Hadoop has been touted as a replacement for data warehouses.  In practice Hadoop has had success offloading ETL/ELT workloads, but still has gaps serving requirements for operational analytics.

Apache Bigtop now includes Greenplum Database in deployment of big data solutions. Greenplum Database is, an open source massively parallel data warehouse  based on PostgreSQL, and is an excellent addition to the Hadoop ecosystem.

In this session we'll cover:
  • Introduction to Greenplum 
  • Bigtop Support for Greenplum
  • External tables in Hadoop by Greenplum
  • Parallel reads and writes to Hadoop by Greenplum
  • Running advanced analytics on structured and unstructured data in both Hadoop and Greenplum via Apache MADlib (incubating)
  • Geospatial and Machine Learning in Greenplum based on HDFS data
  • Storing data from a data lake in Greenplum for high throughput analytical queries

Speakers
GC

Gregory Chase

Director of Big Data Communities, Pivotal Software
Greg Chase is an enterprise software marketing executive more than 20 years experience in marketing, sales, and engineering with software companies. Most recently Greg has been passionately advocating for innovation and transformation of business and IT practices through big data, cloud computing, and business process management in his role as Director of Product Marketing at Pivotal Software. Greg is also a wine maker, dog lover, community... Read More →
avatar for Roman Shaposhnik

Roman Shaposhnik

Director of Open Source, Pivotal Inc.
Roman Shaposhnik is a Director of Open Source at Pivotal Inc. He is a committer on Apache Hadoop, co-creator of Apache Bigtop and contributor to various other Hadoop ecosystem projects. He is also an ASF member and a former Chair of Apache Incubator. In his copious free time he managed to co-author "Practical Graph Analytics with Apache Giraph" and he also posts to twitter as @rhatr. Roman has been involved in Open Source software for more than a... Read More →


Tuesday November 15, 2016 13:00 - 13:50
Nervion/Arenal I

Attendees (25)