11
submitted 1 week ago by [email protected] to c/[email protected]

Time and again I see the same questions asked: "Why should I use dbt?" or "I don't understand what value dbt offers". So I thought I'd put together an article that touches on some of the benefits, as well as putting together a step through on setting up a new project (using DuckDB as the database), complete with associated GitHub repo for you to take a look at.

Having used dbt since early 2018, and with my partner being a dbt trainer, I hope that this article is useful for some of you. The link is paywall bypassed.

20
submitted 1 month ago by [email protected] to c/[email protected]

If you're a Data Engineer, before long you'll be asked to build a real-time pipeline.

In my latest article, I build a real-time pipeline using Kafka, Polars and Delta tables to demonstrate how these can work together. Everything is available to try yourself in the associated GitHub repo. So if you're curious, take a moment to check out this technical post.

[-] [email protected] 3 points 1 month ago

Great point. We use this for our solution design docs, and to display the final star schema in our dbt models that we then embed within our dbt docs. Given we use dbt for our warehouse, we don’t need to worry about the create table statements.

19
Diagrams as Code (medium.com)
submitted 1 month ago by [email protected] to c/[email protected]

How often do you build and edit Entity Relationship Diagrams? If the answer is ‘more often than I’d like’, and you’re fed up with tweaking your diagrams, take <5 minutes to read my latest article on building your diagrams with code. Track their changes in GitHub, have them build as part of your CI/CD pipeline, and even drop them into your dbt docs if you like.

This is a ‘friends and family’ link, so it’ll bypass the usual Medium paywall.

I’m not affiliated to the tool I’ve chosen in any way. Just like how it works.

Let me know yours thoughts!

2
submitted 3 months ago by [email protected] to c/[email protected]

I’ve written a series of Medium articles on creating a Data Pipeline from scratch, using Polars and DeltaTables. The first (linked) is an overview with link to the GitHub repository and each of the deeper dive articles. I then go into the next level of detail, walking through each component.

The articles are paywalled (it took time to build and document), but the link provided is the ‘family & friends’ link which bypasses the paywall for the Lemmy community.

I hope some of you may find this helpful.

2
submitted 4 months ago by [email protected] to c/[email protected]

A few years ago, if you'd mentioned Infrastructure-as-Code (IaC) to me, I would've given you a puzzled look. However I'm now on the bandwagon. And to help others understand how it can benefit them, I've pulled together a simple GitHub repo that showcases how Terraform can be used with Snowflake to manage users, roles, warehouses and databases.

The readme hopefully gives anyone who wants to give it a go the ability to step through and see results. I'm sharing this in the hopes that it is useful to some of you.

2
submitted 5 months ago by [email protected] to c/[email protected]

Hi all,

For those wanting a quick repo to use as a basis to get started, I’ve created jen-ai.

There are full instructions in the readme. Once running you can talk to it, and it will respond.

It’s basic, but a place to start.

[-] [email protected] 8 points 10 months ago

I think it gives everyone the same list of 29, but it’s the order that’s important. Gentoo came back as my top. I use Void which came back as 4th in my list.

[-] [email protected] 3 points 10 months ago

Well at least at the end of the questions the distro I use (Void) was somewhere near the top of the list (4th).

[-] [email protected] 1 points 11 months ago

I’m not sure where you live, but salaries have dropped significantly here (Australia), although it would be fair to say they were back to pre-covid ranges. Two years ago, big banks were offering $250k base for a Senior Data Engineer. Currently the rate is around $170-180k.

4
Free resource books (books.goalkicker.com)
submitted 1 year ago by [email protected] to c/[email protected]

Thought I’d share this link. I’m not affiliated in any way.

nydas

joined 1 year ago