dbt core (self hosted)

This page will walkthrough the setup of dbt core (self hosted dbt) in K.

Integration details

Generate the Manifest using the dbt compile command for each project. This will ensure the manifest file contain compiled SQL. The following docs (and filenames are expected)
- https://docs.getdbt.com/reference/artifacts/manifest-json/
- Must contain compiled SQL which is generated following DBT run / compile
- filename: <project_id>_manifest_YYYYMMDDhhmmss.json
(Optional) Generate the Catalog file using the dbt docs generate command
- https://docs.getdbt.com/reference/artifacts/catalog-json
- file name: <project_id>_catalog.json
Run results are generated after each dbt run.
- https://docs.getdbt.com/reference/artifacts/run-results-json
- file name: <project_id>_run_results_YYYYMMDDHHmmss.json
Use an orchestration tool like Airflow to align the filenames and push the docs (manifest, catalog, run_results) to the landing folder created in Step 3

The inclusion of the project_id in the filename is to support multiple dbt projects.

For details about how to push files to landing - see Collectors

Select Platform Settings in the side bar
In the pop-out side panel, under Integrations click on Sources
Locate your new dbt core Source and click on the Schedule Settings (clock) icon to set the schedule

Complete the following steps to load your latest manifest.json file
Loading a full manifest.json for dbt (Cloud & Core)
Push your manifest.json, catalog.json, mapping.json and run_results.json files to the K landing directory..
- For example, you can use Azure Storage Explorer if you want to initially do this manually.

Next to your new Source, click on the Run manual load icon
Confirm how your want the source to be loaded
After the source load is triggered, a pop up bar will appear taking you to the Monitor tab in the Batch Manager page. This is the usual page you visit to view the progress of source loads

A manual source load will also require a manual run of

To load all metrics and indexes with the manually loaded metadata. These can be found in the Batch Manager page

Troubleshooting failed loads

If the job failed at the extraction step
- Check the error. Contact KADA Support if required.
- Rerun the source job
If the job failed at the load step, the landing folder failed directory will contain the file with issues.
- Find the bad record and fix the file
- Rerun the source job