Table.AddFuzzyClusterColumn
Introduces a new column, newColumnName, to a table containing representative values from the columnName column. These representatives are determined through a fuzzy matching process for each row. An ...
Table.AddFuzzyClusterColumn(table as table, columnName as text, newColumnName as text, optional options as nullable record) as table
Introduces a new column, newColumnName, to a table containing representative values from the columnName column. These representatives are determined through a fuzzy matching process for each row.
An optional set of parameters can be specified to define how the key columns are compared. These options include:
- Culture: Allows for the grouping of records based on culture-specific rules. It can be designated with a valid culture name, such as "ja-JP" for the Japanese culture. The default setting is "", which groups based on the Invariant English culture.
- IgnoreCase: A boolean value (true/false) enabling case-insensitive key grouping. For example, setting this to true would result in "Grapes" being grouped with "grapes". The default value is true.
- IgnoreSpace: A boolean value (true/false) facilitating the combination of text portions to identify groups. For instance, when set to true, "Gra pes" would be considered equivalent to "Grapes". The default value is true.
- SimilarityColumnName: A designated name
Determine the standardized location values for employees using Power Query M.
MTable.AddFuzzyClusterColumn(
Table.FromRecords(
{
[EmployeeID = 1, Location = "Seattle"],
[EmployeeID = 2, Location = "seattl"],
[EmployeeID = 3, Location = "Vancouver"],
[EmployeeID = 4, Location = "Seatle"],
[EmployeeID = 5, Location = "vancover"],
[EmployeeID = 6, Location = "Seattle"],
[EmployeeID = 7, Location = "Vancouver"]
},
type table [EmployeeID = nullable number, Location = nullable text]
),
"Location",
"Location_Cleaned",
[IgnoreCase = true, IgnoreSpace = true]
)
OutputTable.FromRecords(
{
[EmployeeID = 1, Location = "Seattle", Location_Cleaned = "Seattle"],
[EmployeeID = 2, Location = "seattl", Location_Cleaned = "Seattle"],
[EmployeeID = 3, Location = "Vancouver", Location_Cleaned = "Vancouver"],
[EmployeeID = 4, Location = "Seatle", Location_Cleaned =