Skip to content
Start main Content

Tag - research data

Fuzzy String Matching Using FuzzyWuzzy

String comparison is a key step in data pre-processing, but functions in Excel such as MATCH and VLOOKUP falter in fuzzy string matching. In this post, let’s explore how the Python library "FuzzyWuzzy" overcomes these limitations.

Dataset Citation – When and How

Research data and datasets are becoming increasingly important in the scientific process. As more researchers make their data open and reusable, proper citation is crucial to give credit to the authors and acknowledge the data origin.