SQL: How to empower your
decisions
José Cabeda
Data engineer since 2018
Born and raised in Porto
D&D and book worm
SQL first developer
AI CoE (Mania team)
Who am I?
WWW.VWDS.PT
Data is everywhere
Daily lives
CSV/Excel Files
Json files 1000’s of online applications
Databases
Why SQL?
SQL = Structured Query language
First commercial product In 1979, Relational Software, Inc. (now Oracle)
ANSI Standard in 1986
SQL
Introduction of Property graph Queries
Enhanced JSON Support
SQL:2023
https://www.coginiti.co/blog/the-evolution-of-sql-from-sql-86-to-sql-2023/
Data warehouses (3º
gen)
Analytical databases (>TB’s)
Relational databases (1º gen)
Introduce ACID
Nosql databases (2º gen)
Increase speed at the cost of correctness
Eventual consistency
Introduce json like structures
Generations
SQL is the language of data
4º generation
Understand your
files
Steampipe Understand your
application
Duckdb
Duckdb
Open source (MIT License)
In memory database
Multiple client API (CLI, graphical client, python, javascript,
etc)
Extensions (csv, parquet, json, iceberg, s3)
Explore Porto
- Realtime data
- Supports buses, metro, etc
- Realtime
- schedule trips
Explore Porto
- How many bus cancellation
are?
- And delays?
- What can we find out
Demo Time
Query APIs and
codebases
Join between data sources
Steampipe
Check DM users vs open users
Check who has MFA enabled
Steampipe + slack
steampipe plugin install slack
Steampipe + google sheets
Same syntax
Join sheets
Join spreadsheets
steampipe plugin install googlesheets
Steampipe + aws
steampipe plugin install aws
Check untagged resources
Detect lambdas that require runtime
upgrades
Thank you
Blog: cabeda.dev