Table.AddFuzzyClusterColumn

Introduces a new column, newColumnName, to a table containing representative values from the columnName column. These representatives are determined through a fuzzy matching process for each row. An ...

Syntax

Table.AddFuzzyClusterColumn(table as table, columnName as text, newColumnName as text, optional options as nullable record) as table

About

Introduces a new column, newColumnName, to a table containing representative values from the columnName column. These representatives are determined through a fuzzy matching process for each row. An optional set of parameters can be specified to define how the key columns are compared. These options include: - Culture: Allows for the grouping of records based on culture-specific rules. It can be designated with a valid culture name, such as "ja-JP" for the Japanese culture. The default setting is "", which groups based on the Invariant English culture. - IgnoreCase: A boolean value (true/false) enabling case-insensitive key grouping. For example, setting this to true would result in "Grapes" being grouped with "grapes". The default value is true. - IgnoreSpace: A boolean value (true/false) facilitating the combination of text portions to identify groups. For instance, when set to true, "Gra pes" would be considered equivalent to "Grapes". The default value is true. - SimilarityColumnName: A designated name

Explanation

Determine the standardized location values for employees using Power Query M. MTable.AddFuzzyClusterColumn( Table.FromRecords( { [EmployeeID = 1, Location = "Seattle"], [EmployeeID = 2, Location = "seattl"], [EmployeeID = 3, Location = "Vancouver"], [EmployeeID = 4, Location = "Seatle"], [EmployeeID = 5, Location = "vancover"], [EmployeeID = 6, Location = "Seattle"], [EmployeeID = 7, Location = "Vancouver"] }, type table [EmployeeID = nullable number, Location = nullable text] ), "Location", "Location_Cleaned", [IgnoreCase = true, IgnoreSpace = true] ) OutputTable.FromRecords( { [EmployeeID = 1, Location = "Seattle", Location_Cleaned = "Seattle"], [EmployeeID = 2, Location = "seattl", Location_Cleaned = "Seattle"], [EmployeeID = 3, Location = "Vancouver", Location_Cleaned = "Vancouver"], [EmployeeID = 4, Location = "Seatle", Location_Cleaned =