Public transportation project | Readings

I would like to keep my interest in actually finishing the Spark book. However, I also got an idea to set up a bot to gather real-time data on Porto’s bus locations and compare it to the schedule. This would allow to actually to analyze some hypotheses like: 1. Buses that arrive early and don’t wait 2. Suppressed buses (they just vanish?) 3. Understand where buses take longer and where an optimization of the traffic would improve the timings

This sounds a bit daunting to be sincer but at least would allow me to:

  1. Test API knowledge
  2. Test streaming pipelines
  3. Test analytic skills

At the end of the day could start a conservation on how to improve public transportation.

Readings of the week

Sevilha 2022

I'm José Cabeda, a data engineer focused on improving data systems and educating on how to use them. I also do a lot of planning and read as much as I can.