Overview
The Fuzzy Merge node is a transformation node in a Weave Workflow that combines and matches two data sets using a fuzzy matching algorithm, joining similar or related records rather than only exact matches to improve data accuracy during integration. It sits in the middle of a Workflow’s processing path between two source nodes (Data Import) and terminal nodes (Result, Output) and emits the merged data downstream.
The node is configured in two places: the node panel sets the Filter Algorithm and a Percentage threshold, and the Configure dialog defines the Matching Criteria and Fuzzy Merge Criteria. The merged output includes the joined columns from both data sets along with per-match score columns produced by the algorithm.
Note: “Workflow” is the in-UI term for what some Weave documentation calls a pipeline. This reference uses “Workflow.”
When to use it
- Matching records across two data sets where keys are similar but not identical (typos, formatting differences, near-duplicates).
- Joining two data sets on a combination of an exact key and one or more fuzzy-matched columns.
- Producing a merged table with per-column match scores for downstream filtering or review.
Configuration
Node panel settings
| Setting | Description |
| Filter Algorithm | The fuzzy matching algorithm applied to the match [Levenshtein distance, Soundex Phonetic, Similarity Score]. |
| Percentage | A threshold value from 1 to 100 for the algorithm. |
| Configure | Opens the Fuzzy Merge dialog where Matching Criteria and Fuzzy Merge Criteria are defined. |
Configure dialog
The Configure dialog defines the criteria as rows. Each row pairs a column from the first data set with a column from the second via a join condition.
| Field | Description |
| Dataset | The source data set for the left and right side of the pairing. |
| Column | The column from each data set used in the match. |
| Join Condition | The condition relating the two columns. |
| Action | Add (+) a criteria row or delete an existing one. |
The dialog has two sections: Matching Criteria and Fuzzy Merge Criteria.
Key behaviors
Fuzzy Merge is a transformation node. Unlike Data Import (source) and Result / Output (terminal), it sits in the middle of the Workflow, receiving two upstream inputs, matching them, and passing the merged result downstream.
Two-stage criteria. Configuration separates Matching Criteria from Fuzzy Merge Criteria.
Threshold-driven matching. The Percentage value tunes how close records must be to count as a match.
Scored output. The merged table carries per-column and aggregate score columns reflecting match strength, supporting downstream review or filtering.