Loading…
Apache: Big Data Europe 2016
Click here to Register or for more information 
Back To Schedule
Monday, November 14 • 15:30 - 16:20
Scalable Data Science in R and Apache Spark 2.0 - Felix Cheung, Committer

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

R is a very popular platform for Data Science. Apache Spark is a highly scalable data platform. How could we have the best of both worlds? In this talk we will walkthrough many examples how several new features in Apache Spark 2.0.0 will enable this. We will also look at exciting changes coming next in Apache Spark 2.0.1 and 2.1.0.




Speakers
avatar for Felix Cheung

Felix Cheung

Engineering Manager, Uber
Felix started in the big data space about 5 years ago with the then state-of-the-art MapReduce. Since then, he (re-)built Hadoop cluster from metal more times than he would like, created a Hadoop “distro” from two dozens or so projects into .rpm/.deb, and kicked off clusters in... Read More →



Monday November 14, 2016 15:30 - 16:20 CET
Giralda I/II