I made a repo, woo!
My first step in building the repo is to understand the engineering. I followed this youtube video to get my first lil set-up so I have a “shell” for a dbt repo in github, i.e. I have my ‘workstation’ set up. The video was relatively easy to follow so can definitely give this one a recommend!
Here’s a link to the repo.
Repo / pipeline requirements
I have set a set of parameters that I think are important for this repo when it comes to the set-up, i.e. repo, its pipeline, & the tooling:
- everything needs to be free
- anyone with basic data analyst- or coding-like knowledge should be able to download the repo
- anyone should be able to download the data I want to make available
- the data should be cleaned enough to make it possible to run analyses without a massive hassle (*issue for later)
I did some reading, some redditing, so much googling, to figure out how to continue from here to meet all these parameters. I think I have figured out a few things that will be helpful going forward, and I’ve made some decisions. Stay tuned for the next entry as I am currently drawing it up into a neat lil mindmap.