José Cabeda
My awesome picture in Ireland

I'm José Cabeda, a data engineer focused on improving data systems and educating on how to use them. I also do a lot of planning and read as much as I can.

Status update

06 September, 2022

I’ve taken some time to study and add notes to my second brain. It has been an interesting way of studying which has left me with a thought…

Status update

26 August, 2022

Not much has happened this this week, I’ve gotten myself into using docsaurus to publish my second brain. You can change it here. I’ve found…

Uses

17 August, 2022

List of all my technologies, hardware, etc Hardware Laptop (work): Macbook Pro 2021 M1 💻 Personal computer: Graphics: Nvidia GTX 97…

Status update

11 August, 2022

I’ve quickly gotten a prototype to generate my reading updates. The python script (repo) uses the Pocket API (had to move away from…

Reading Update

10 August, 2022

Since March there was a lot happening. I’ve moved from Talkdesk to a startup (Fidel API) to help bootstrap a data team and got to read a lot…

Reading Update

20 March, 2022

I’ve gotten some interesting reading this week and, although I think I read quite a bit, I also think that my to read pile keeps increasing…

Reading Update

13 March, 2022

This has been a productive week, I’ve read a lot of articles and hope to start reading the book Database Internals Data Engineering What’s…

Reading Update

06 March, 2022

Data Engineering An Introduction to Modern Data Lake Storage Layers - A good comparison, using apache spark on how to create and run some…

Reading Update

27 February, 2022

This week I have read much more articles non-related to data engineering as I’ve been too busy to even start reading heavily on apache flink…

Reading Update

20 February, 2022

Data Engineering The new modern data stack Airbyte Airflow DBT - For the development of an ELT pipeline, the addition of airbyte to dbt and…

Reading Update

13 February, 2022

![Photo by Gwen King on Unsplash](./west end.jpeg) Data Engineering What I learned from the open source data stack conference 2021 - A good…

Reading Update

07 February, 2022

Hey! I’ve been a bit out but, nonetheless, I’ve been keeping up with news, while studying a bit on scala. Data Engineering Airflow, Prefect…

Pomodoro

27 January, 2022

I was looking into some old code I had written for Freecodecamp and found this little thing that from time to time I’ve come to use. Thought…

Reading Update | Hopes for 2022

21 December, 2021

2021 was an year where I got to fulfill some of my goals. I got a greater track of writing articles (most of them reading updates) and I got…

Reading Update | dbt speaker!

11 December, 2021

Well, on thursday I went through my first international talk at Coalesce where I’ve talked about dbt in a data mesh world (basically it’s…

Reading Update

01 December, 2021

Organization The Basecamp Guide to Internal Communication - for those working remotely these tips are very good for improving the quality of…

Workflow

28 November, 2021

This is a working-in-progress, with no special order of how I try to organize my work and thoughts to be as productive as possible. Start of…

Reading Update

23 October, 2021

I’ve taken some time from writing my reading updates and I’d say some of the articles have gone into something I like to call knowledge…

Data Lineage with DBT for external tables

26 August, 2021

DBT is a great project but I’ve found myself in a kind of a situation. When we have a project that isn’t entirely in DBT how can we generate…

Reading Update

10 July, 2021

This week I’ve been pushing my scripty guy and tried to automate the lookout for vaccines in my country. Add some partial success and let’s…

Reading Update

25 June, 2021

I’ve gotten to watch a documentary related to night watch which I found quite nice which explain the image above 😅. Related to reading I’ve…

Reading Update

05 June, 2021

I’m going on vacations but before doing so decided to clean my to-read list. Kinda, I’m leaving most of what I found interesting below but…

Dreaming of better data processing

26 May, 2021

I’ve tried to summarize most of the ideas I have on better data processes. Of course many of them are simplified and up to debate but I…

Reading Update

26 May, 2021

I’m actually trying to write a bit more but in the meantime here goes another batch of reading 😅 Data Analysis https://tech.trivago.com…

Reading Update

09 May, 2021

I’ve been trying to read some books so I’ve taken a bit of a break reading article. But in the meantime I’ve found a good share related to…

Reading Update

03 April, 2021

This was a relative calm week. I’ve read a lot and coded less than I wanted. I’ll try to focus more on development and less in reading for…

Reading Update

28 March, 2021

This week I’ve been preping for three things. Read a book on Scala, writing about SQL VS code pipelines, and on how to create a new DBT…

Reading Update

20 March, 2021

Hi! I’ve gotten into reading most articles I had for the past weeks. I’m seeing more and more regarding streaming pipelines although I think…

Reading Update

14 March, 2021

This week I’ve gotten to read a lot on architecture. I’m still trying to reduce the articles on my pile and hopefully start an article of my…

Reading Update

07 March, 2021

Data visualization The creator of D3 writes a good summary of his last 10 years in 10 Years of Open-Source Visualization. Data Warehouse…

Reading Update

18 February, 2021

Hi there! I’ve gathered some articles and in the meanwhile I’ve been reading a bit about scala and also saving some papers for a “ligh read…

Reading Update

23 January, 2021

For the past weeks, I’ve found some interesting stories related to database migrations like Your legacy database is outgrowing itself and An…

Global view in a regional world

18 January, 2021

As a data engineer, my main goal is to create a single and complete source of truth. This has brought me into the cloud and the ELT…

This week interesting links

10 January, 2021

This week I’ve mainly focused on either data quality through examples like Great Expectation or on data modeling with the help of Airflow…

2020 review and beyond

03 January, 2021

Just like most people I know (and at least half the world) the pandemic took hold of a big chunk of my life. Fortunately, I was able to cope…

2019 Review and Beyond

02 January, 2020

2019 was a good and challenging year. Looking back at the article I wrote, it seems I’ve done more than I hoped (I’m a bit pessimistic at…

A perspective on Tech In Porto

20 June, 2019

A conference in Porto is a good conference :-p Intro Hi! I’ve attended Tech In Porto and I thought I’d write a brief summary of my…

Building Quizzer

26 May, 2019

A JSON-based quiz shuffler. Why build this? In a conversation with my brother he told of an event he was organizing which required that they…

Objectives for 2019

07 January, 2019

Hi there! I’m writing this article as a way of putting out there what I pretend to do this year. This is more for me than for whoever is…

Free Code Camp Calculator

18 June, 2018

To improve some of my knowledge of Frontend I took the FreeCodeCamp (FCC) course. It’s a fantastic way to starting making some projects if…

SSIS Naming Conventions

06 June, 2018

Hi folks! In the past month I’ve started doing a project in Business Intelligence. The work of defining the metrics and dimensions had…

Medium Articles

04 June, 2018

Previous posts I’ve written on medium: Organize. A proposal to control our life [A new Developer path] (https://mystudentvoices.com/a-new…

The Origin

04 June, 2018

I’ve tried and tried… I’ve lost count of the times I’ve tried to start my own blog. Last year I wrote some articles on Medium but I’ve…