Modern Data Stack Mark Rittman Modern Data Stack Mark Rittman

Deduplicating Company Records in a Multi-Source Data Centralization Project using dbt, Google BigQuery or Snowflake

One of the most common tasks in a data centralization project is to create single, deduplicated records for each of the companies, contacts, products and other entities the business interacts with. Doing this allows you to connect sales activity from Salesforce and Hubspot to project delivery and invoicing data from Jira and Xero, for example, and this article shows you how.

Read More