Apache: Big Data Europe 2016
Click here to Register or for more information 
Back To Schedule
Monday, November 14 • 12:00 - 12:50
Data Science with Spark and Case Study with Non-Motorized Travel Social Data for the Public - Yi Fan Zhang, IBM

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The collection, documentation, management and analysis of big data associated with non-motorized travel has not attracted enough attentions. This may not conform to the trend that cycling, walking and jogging are strongly advocated by governments to build low-carbon cities and also to improve peopleäó»s health conditions. This session will share the experience that quantify and characterize the non-motorized travel by means of tempo-spatial analysis. The data used in this case is captured from a famous online community for running amateurs sharing their activities. Around 0.5 million running and cycling records from 0.3 million people in Beijing are analyzed with machine learning and data science methodology in this case study. Spark ML with random forest algorithm, and grid search of the parameters selection have been used on the prediction upon weather, AQI and time.

avatar for Yi Fan Zhang

Yi Fan Zhang

Software Engineer, IBM
Working in Cloud Data Service, Big data, Entity Analytics Development, IBM China Development Lab. Recently, I am working on the Smart Traffic with People/Vehicle Trajectory Analysis Platform: Including build a Spark distributed computing environment,design and develop Spark applications... Read More →

Monday November 14, 2016 12:00 - 12:50 CET
Giralda III/IV