PolyFuzz — Advanced Fuzzy Matching Framework

1 min read

Alternative to fuzzy matching techniques for NLP to enhancing performance

Working with natural language processing we might faced a lot of scenario to use various string matching techniques. Mostly we use fuzzy matching techniques to find the closes match of a string from a database or in some other cases we use them to understand the typo errors, mistranslations etc.

Other than using fuzzy libraries for string matching use case we often use edit distance method, levenshtein distance method, TF-IDF character based n-gram method, word embedding method to understand the meaning and to match between words of strings.

Let’s move step by step. For installing polyfuzz you have different methods which you can see below.


How to make it work?

Yeah! So consider you have two sets of strings
[happily, happy, hippy, holi, holiday, holidays, cool, school, fool] and another one [happy, holiday, schools] . Consider if we want to find the similarity based on their edit distance method. So this is how we can do it with polyfuzz.


By doing this you will get a result as such

Image for post
Generated By Author

What others features does this offer you?

It comes up with grouping and clustering of matches. From the previous results you can see there was a chance of grouping some strings together. PolyFuzz gives you the ability to do so.


You can see the results below were input strings are grouped together.

Image for post
Generated By Author

Also there is a chance of putting them together in clusters which you can do using PolyFuzz with very little effort.


You can see the cluster below in which some strings are grouped together.

Image for post
Generated By Author

PolyFuzz also has few Models implemented in it. This includes RapidFuzz, EditDistance, TF-IDF, FastText and GloVe, 🤗 Transformers.

You can use this model based on your requirements for string matching, grouping and clustering.

Check out the below repo link to see how you can use them and make it useful for you.


PolyFuzz performs fuzzy string matching, string grouping, and contains extensive evaluation functions. PolyFuzz is…


Raoof Naushad Artificial Intelligence @ Accubits Proud Engineer. My goal is not-for-profit. I believe in Goodness. Fighting for and alongside my people. Changing the world always overshadows income. Learning. Always,always,always,always,always. sports,athletics,travel

Leave a Reply

Your email address will not be published.