K catalogues assets from across your data ecosystem, classifying them into two primary types: Data and Content.
-
Data assets represent structured data stored in databases and schemas (e.g. tables, columns).
-
Content assets represent analytical and reporting artefacts built on top of data (e.g. reports, dashboards, pipelines, ML models).
Various objects within each integrated source are automatically allocated to an object type. For example:
-
A Power BI workspace is catalogued as a Workspace
-
A Power BI app is catalogued as a Content App
-
A Power BI data flow is catalogued as a Pipeline
-
A Redshift External table is catalogued as a Table (Table type: External)
-
A Snowflake View is catalogued as a Table (Table type: View)
Data Assets
|
Object Type |
Description |
|---|---|
|
Database |
The database used to store and manage structured data assets. |
|
Schema |
A logical grouping within a database that organises related tables, views, and procedures. |
|
Table |
A data object within a schema that stores records in rows and columns. |
|
Column |
A single field or attribute within a table that defines a specific data element. |
|
Code |
A script or program that performs data processing, transformation, or analysis. |
|
Procedure |
A stored routine that encapsulates a series of SQL statements or operations. |
|
Macro |
A parameterised or reusable block of logic that simplifies repetitive operations. |
Analytical & Reporting Content Assets
|
Object Type |
Description |
|---|---|
|
Tool |
The application or platform used to create, process, or manage analytical and reporting content assets within the data ecosystem. |
|
Workspace |
A logical container that groups related assets such as datasets, reports, and pipelines. |
|
Report |
A dashboard, report, or document that presents data, charts, or insights. Typically connected to one or more datasets. |
|
Sheet |
A page within a report. |
|
Content App |
A logical container that groups assets used for data consumption (e.g. reports). |
|
Pipeline |
A data processing flow that moves, transforms, or integrates data between datasets. |
|
ML Model |
A machine learning artefact that uses input data to make predictions or classifications based on historical patterns. |
|
Dataset |
A structured collection of data used for reporting, analysis, or modelling. Typically serves as the foundation for reports or ML models. |
|
Dataset Table |
A table within a dataset that organises data into rows and columns, often representing a specific entity or subject area. |
|
Dataset Field |
An individual column or attribute within a dataset table (e.g. Customer Name, Order Date). |
|
File |
An asset stored in a filesystem or object store (e.g. CSV, JSON, Parquet), used as input or output for analysis and processing. |
Asset and Object Type Hierarchy
|
Asset Type |
Dataset / Pipeline |
Object Type |
Object Children |
|---|---|---|---|
|
Data assets |
Schema |
Table |
Column |
|
Data assets |
Schema |
Macro |
— |
|
Data assets |
Schema |
Procedure |
— |
|
Content assets |
Content |
Content Child |
— |
|
Content assets |
Dataset |
Dataset Table |
Dataset Field |
|
Content assets |
Dataset Pipeline |
— |
— |
|
Content assets |
Content App |
Content |
Content Child |
|
Content assets |
ML Model |
— |
— |
|
Content assets |
Workspace |
Content |
Content Child |
|
Content assets |
Workspace |
Dataset |
Dataset Table / Field |
|
Content assets |
DQ Test |
— |
— |