A collector runs outside of the data tools and therefore, to create data observations, it will apply several strategies:

Here is a high-level view of those strategies:

Consequently, a collector generates data observations automatically for all applications running in the data tools. This removes the responsibility from the users developing the data application or using the data tool to generate data observations.

The information leveraged by collectors is generally produced after the data tools have fully executed the data applications.
Hence, the data observations generated by collectors are often desynchronized from the data application runtime, and circuit breakers are not available

Ref: See also Fundamentals of Data Observability, chapter 6

Learn how the Kensu Collectors, an integral part of the Kensu Hub, efficiently extract and generate data observations from various data systems. Discover how these observations are seamlessly shared on the innovative Kensu Platform, enhancing data managem

Preview: PySpark Remote Configuration

Collectors

Configure dbt core collector with Elementary for Snowflake

Configure dbt core collector with Elementary for BigQuery

dbt core

Documentation

Kensu Documentation Center

Getting started