I am aware that Dedupe uses Active learning to remove duplicates and perform Record linkage.
However , I would like to know if we can pass excel sheet with already matched pairs(label data) as the input for active learning?
I am aware that Dedupe uses Active learning to remove duplicates and perform Record linkage.
However , I would like to know if we can pass excel sheet with already matched pairs(label data) as the input for active learning?
Copyright © 2021 Jogjafile Inc.
Not directly.
You'll need to get your data into a format that
markPairs
can consume.Something like:
We do provide a convenience function for getting spreadsheet data into this format
trainingDataDedupe
.(I am an author of dedupe)