Logseq, Impostor Syndrome | Readings
05 November 2023
3 minutes to read
I’ve been able to move my personal thoughts and notes to Logseq instead of just using random notes + vs code. I can easily revert to the old way as the ony thing I had to change for the migration was rename my files from YYYYMMDD.md to YYYY_MM_DD.md. Simple enough!
As the tool doesn’t have git sync builtin for Android, I’ve had a bit more trouble setting up Syncthings. I’ve yet to fully understand how it functions but for now I’m quite satisfied. For backup I still keep the git working on my local computer.
Additionaly I’ve been thinking a bit on how to keep working on the Code challenges and I think I need to do an hybrid. I’ll be picking on doing a course this month instead of the code challenge. The code challenge will be left for December with the Advent of Code.
Finally I have to prepare the POS systems for the traditional São Magusto festivity where I live in.
Another week with ADR’s…
So I’ve decided to revamp this section and get a certificate around Flink. Still need to choose one
I’ve had a bit of laziness on picking on this. My goal was to have all developments finished by Saturday but I’ve got a lot more to do. I’ve boostrapped dbt to power the analytics, I’ve fixed a bug as I needed to have say I have the following checklist divided by the project and the presentation:
- How many trips are being canceled?
- What are the lines with more cancellations?
- What are the lines with biggest delays?
- What are the stops with the biggest delays?
- Create an iceberg table on top of the parquet
- Preset dashboard with all analysis above
- Intro to the problem
- Present the loader to sqlite
- Present the loader to S3 + parquet
- how to create the iceberg table
- Demo of duckdb + dbt
- Show the preset dashboard with the analysis
Second week under a storm (now called Ciáran). I’ve gone to 2 soccer games but I’m yet to run 10KM for the São Silvestre training. Moving the progress a bit nonetheless.
Readings of the week
And this has been the week I trimmed the list to 0. I’ve add to delete a lot of articles which I just wouldn’t been able to read but I’m quite happy. My expectations now is to have this list trimmed every month instead of trying to keep it at 0 every day.
- Tracking SQLite Database Changes in Git | Garrit’s Notes
- SQLite 3.44: Interactive release notes by Anton Zhiyanov
- Spark on AWS Lambda: An Apache Spark runtime for AWS Lambda | AWS Big Data Blog
- DuckDB Extensions for AWS Lambda
- How to communicate like a GitHub engineer: our principles, practices, and tools - The GitHub Blog by Ben Balter, Allison Matlack
- Introducing hybrid access mode for AWS Glue Data Catalog to secure access using AWS Lake Formation and IAM and Amazon S3 policies | AWS Big Data Blog
- Inside New Query Engine of MongoDB | Nikita Lapkov
- 8 annoying A/B testing mistakes every engineer should know - PostHog by Lior Neu-ner
- MotherDuck: MotherDuck + dbt: Better Together by MotherDuck
- Building and operating a pretty big storage system called S3 | All Things Distributed by Werner Vogels
- “I’ve spent a lot of time at Amazon thinking about how important and effective the focus on ownership is to the business, but also about how effective an individual tool it is when I work with engineers and teams.”: This is something I’ve seen recently and that personally pushes me to do better
- How to Manage Databases with CI/CD | HackerNoon by Tomas Fernandez
- Using Postgres FILTER
- Optimizing Postgres’s Autovacuum for High-Churn Tables | Tembo by Adam Hendel
- Deno 1.38: HTML doc generator and HMR
- The beauty of finished software | Jose M.
- A systematic approach to debugging | nicole@web
- Stacked Diffs (and why you should know about them) by Gergely Orosz
- Apache Iceberg optimization: Solving the small files problem in Amazon EMR | AWS Big Data Blog
- Amazon Kinesis Data Generator
- “(KDG) tool to generate sample sensor data information using the following template”: Need to take a further look to generate data for test environments
- DuckDB + Dbt + great expectations = Awesome Data pipelines | by Pranav Kohli | Sep, 2023 | Medium by Pranav Kohli
- Streaming SQL in Data Mesh by Netflix Technology Blog | Netflix TechBlog by Netflix Technology Blog
I'm José Cabeda, a data engineer focused on improving data systems and educating on how to use them. I also do a lot of planning and read as much as I can.