520 Forks
3764 Stars
3764 Watchers

dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

How to download and setup dedupe

Open terminal and run command
git clone https://github.com/dedupeio/dedupe.git
git clone is used to create a copy or clone of dedupe repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with dedupe https://github.com/dedupeio/dedupe/archive/master.zip

Or simply clone dedupe with SSH
[email protected]:dedupeio/dedupe.git

If you have some problems with dedupe

You may open issue on dedupe support forum (system) here: https://github.com/dedupeio/dedupe/issues