Data Engineering

246 readers

1 users here now

News and discussion on Data Engineering topics

founded 1 year ago

MODERATORS

[email protected]

One big table vs a dimensional model (lemmy.world)

submitted 1 year ago by [email protected] to c/[email protected]

0 comments fedilink hide all child comments

Hi fellow data engineers,

Currently I’m restructuring a pipeline written with pyspark on Databricks. Since it’s a lot of transformations, results in an extensive DAG, but it’s cool to spend some extra processing resources to make a standard dimensional model (apart from the necessary transformations).

Was wondering what real benefits you have seen a star schema design has from the “one big table” approach, I could preach to my team? (My goal mainly would be to have a resulting smaller PowerBI model.)

And as a side question, what tools do you use to create a dimensional model such a star schema with code?

Thanks a lot!

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here