José Cabeda
Sevilha 2022

I'm José Cabeda, a data engineer focused on improving data systems and educating on how to use them. I also do a lot of planning and read as much as I can.

Talk, blogging & rust | Readings

27 November 2022 | 1 minutes to read

Bom Jesus Braga, Photo by Angela Compagnone December is turning out to be a challenging one 😬. I’ve published the first article in some…

DuckDB VS Porto buses - A small case for a new OLAP engine

24 November 2022 | 5 minutes to read

A new database, DuckDB, has seen a rise in popularity, in recent years. Other alternatives like Redshift are too expensive for many…

Status update | Readings

20 November 2022 | 1 minutes to read

Casa da música Porto With the rise of Mastodon, I’ve gotten myself into publishing some small thoughts. Not sure how useful this might be as…

Slow weeks happen now and then | Readings

13 November 2022 | 1 minutes to read

Cyberpunk by DALL-E I’ve read quite a bit preparing the presentation. Although I’m still writing, rewriting, and doing the demo, I’ll start…

Layoffs and focusing efforts on talk | Readings

07 November 2022 | 1 minutes to read

This week has been weird. A lot of layoffs in the tech sector… Funnily, the theme for the talk I was preparing might be becoming more and…

Status update

30 October 2022 | 1 minutes to read

I’ve done a lot of reading but I’ll again be focusing on the talk I plan to prepare. I want to test some of the ideas I have but if I’m…

Status update

23 October 2022 | 1 minutes to read

Gotten some progress on the statistics course and finnally caught with the to-read pile of articles. As I’ve been on vacations I didn’t do…

Status update

16 October 2022 | 1 minutes to read

As I said I’ve slowly been working on the statistics course. It’s great to get to the basics again and I’m greatly enjoying how empowering…

Status update

09 October 2022 | 1 minutes to read

I’ve taken a bit off for the past month. Nonetheless I’ve read the Streaming systems book (great one to complement Designing Data Intensive…

Status update

06 September 2022 | 1 minutes to read

I’ve taken some time to study and add notes to my second brain. It has been an interesting way of studying which has left me with a thought…

Status update

26 August 2022 | 1 minutes to read

Not much has happened this this week, I’ve gotten myself into using docsaurus to publish my second brain. You can change it here. I’ve found…

Uses

17 August 2022 | 1 minutes to read

List of all my technologies, hardware, etc Hardware Laptop (work): Macbook Pro 2021 M1 💻 Personal computer: Graphics: Nvidia GTX 97…

Status update

11 August 2022 | 1 minutes to read

I’ve quickly gotten a prototype to generate my reading updates. The python script (repo) uses the Pocket API (had to move away from…

Reading Update

10 August 2022 | 2 minutes to read

Since March there was a lot happening. I’ve moved from Talkdesk to a startup (Fidel API) to help bootstrap a data team and got to read a lot…

Reading Update

20 March 2022 | 1 minutes to read

I’ve gotten some interesting reading this week and, although I think I read quite a bit, I also think that my to read pile keeps increasing…

Reading Update

13 March 2022 | 1 minutes to read

This has been a productive week, I’ve read a lot of articles and hope to start reading the book Database Internals Data Engineering What’s…

Reading Update

06 March 2022 | 1 minutes to read

Data Engineering An Introduction to Modern Data Lake Storage Layers - A good comparison, using apache spark on how to create and run some…

Reading Update

27 February 2022 | 1 minutes to read

This week I have read much more articles non-related to data engineering as I’ve been too busy to even start reading heavily on apache flink…

Reading Update

20 February 2022 | 1 minutes to read

Data Engineering The new modern data stack Airbyte Airflow DBT - For the development of an ELT pipeline, the addition of airbyte to dbt and…

Reading Update

13 February 2022 | 1 minutes to read

![Photo by Gwen King on Unsplash](./west end.jpeg) Data Engineering What I learned from the open source data stack conference 2021 - A good…

Reading Update

07 February 2022 | 1 minutes to read

Hey! I’ve been a bit out but, nonetheless, I’ve been keeping up with news, while studying a bit on scala. Data Engineering Airflow, Prefect…

Pomodoro

27 January 2022 | 1 minutes to read

I was looking into some old code I had written for Freecodecamp and found this little thing that from time to time I’ve come to use. Thought…

Reading Update | Hopes for 2022

21 December 2021 | 1 minutes to read

2021 was an year where I got to fulfill some of my goals. I got a greater track of writing articles (most of them reading updates) and I got…

Reading Update | dbt speaker!

11 December 2021 | 1 minutes to read

Well, on thursday I went through my first international talk at Coalesce where I’ve talked about dbt in a data mesh world (basically it’s…

Reading Update

01 December 2021 | 1 minutes to read

Organization The Basecamp Guide to Internal Communication - for those working remotely these tips are very good for improving the quality of…

Workflow

28 November 2021 | 1 minutes to read

This is a working-in-progress, with no special order of how I try to organize my work and thoughts to be as productive as possible. Start of…

Reading Update

23 October 2021 | 2 minutes to read

I’ve taken some time from writing my reading updates and I’d say some of the articles have gone into something I like to call knowledge…

Data Lineage with DBT for external tables

26 August 2021 | 2 minutes to read

DBT is a great project but I’ve found myself in a kind of a situation. When we have a project that isn’t entirely in DBT how can we generate…

Reading Update

10 July 2021 | 1 minutes to read

This week I’ve been pushing my scripty guy and tried to automate the lookout for vaccines in my country. Add some partial success and let’s…

Reading Update

25 June 2021 | 1 minutes to read

I’ve gotten to watch a documentary related to night watch which I found quite nice which explain the image above 😅. Related to reading I’ve…

Reading Update

05 June 2021 | 1 minutes to read

I’m going on vacations but before doing so decided to clean my to-read list. Kinda, I’m leaving most of what I found interesting below but…

Dreaming of better data processing

26 May 2021 | 1 minutes to read

I’ve tried to summarize most of the ideas I have on better data processes. Of course many of them are simplified and up to debate but I…

Reading Update

26 May 2021 | 1 minutes to read

I’m actually trying to write a bit more but in the meantime here goes another batch of reading 😅 Data Analysis https://tech.trivago.com…

Reading Update

09 May 2021 | 1 minutes to read

I’ve been trying to read some books so I’ve taken a bit of a break reading article. But in the meantime I’ve found a good share related to…

Reading Update

03 April 2021 | 1 minutes to read

This was a relative calm week. I’ve read a lot and coded less than I wanted. I’ll try to focus more on development and less in reading for…

Reading Update

28 March 2021 | 1 minutes to read

This week I’ve been preping for three things. Read a book on Scala, writing about SQL VS code pipelines, and on how to create a new DBT…

Reading Update

20 March 2021 | 2 minutes to read

Hi! I’ve gotten into reading most articles I had for the past weeks. I’m seeing more and more regarding streaming pipelines although I think…

Reading Update

14 March 2021 | 1 minutes to read

This week I’ve gotten to read a lot on architecture. I’m still trying to reduce the articles on my pile and hopefully start an article of my…

Reading Update

07 March 2021 | 1 minutes to read

Data visualization The creator of D3 writes a good summary of his last 10 years in 10 Years of Open-Source Visualization. Data Warehouse…

Reading Update

18 February 2021 | 1 minutes to read

Hi there! I’ve gathered some articles and in the meanwhile I’ve been reading a bit about scala and also saving some papers for a “ligh read…

Reading Update

23 January 2021 | 1 minutes to read

For the past weeks, I’ve found some interesting stories related to database migrations like Your legacy database is outgrowing itself and An…

Global view in a regional world

18 January 2021 | 2 minutes to read

As a data engineer, my main goal is to create a single and complete source of truth. This has brought me into the cloud and the ELT…

This week interesting links

10 January 2021 | 1 minutes to read

This week I’ve mainly focused on either data quality through examples like Great Expectation or on data modeling with the help of Airflow…

2020 review and beyond

03 January 2021 | 2 minutes to read

Just like most people I know (and at least half the world) the pandemic took hold of a big chunk of my life. Fortunately, I was able to cope…

2019 Review and Beyond

02 January 2020 | 2 minutes to read

2019 was a good and challenging year. Looking back at the article I wrote, it seems I’ve done more than I hoped (I’m a bit pessimistic at…

A perspective on Tech In Porto

20 June 2019 | 5 minutes to read

A conference in Porto is a good conference :-p Intro Hi! I’ve attended Tech In Porto and I thought I’d write a brief summary of my…

Building Quizzer

26 May 2019 | 3 minutes to read

A JSON-based quiz shuffler. Why build this? In a conversation with my brother he told of an event he was organizing which required that they…

Objectives for 2019

07 January 2019 | 1 minutes to read

Hi there! I’m writing this article as a way of putting out there what I pretend to do this year. This is more for me than for whoever is…

Free Code Camp Calculator

18 June 2018 | 1 minutes to read

To improve some of my knowledge of Frontend I took the FreeCodeCamp (FCC) course. It’s a fantastic way to starting making some projects if…

SSIS Naming Conventions

06 June 2018 | 1 minutes to read

Hi folks! In the past month I’ve started doing a project in Business Intelligence. The work of defining the metrics and dimensions had…

Medium Articles

04 June 2018 | 1 minutes to read

Previous posts I’ve written on medium: Organize. A proposal to control our life [A new Developer path] (https://mystudentvoices.com/a-new…

The Origin

04 June 2018 | 1 minutes to read

I’ve tried and tried… I’ve lost count of the times I’ve tried to start my own blog. Last year I wrote some articles on Medium but I’ve…