Deduplicating Company Records in a Multi-Source Data Centralization Project using dbt, Google BigQuery or Snowflake
One of the most common tasks in a data centralization project is to create single, deduplicated records for each of the companies, contacts, products and other entities the business interacts with. Doing this allows you to connect sales activity from Salesforce and Hubspot to project delivery and invoicing data from Jira and Xero, for example, and this article shows you how.