dedupePython Deduplication Library | |
Download |
dedupe Ranking & Summary
Advertisement
- License:
- GPL v3
- Price:
- FREE
- Publisher Name:
- Graham Poulter
- Publisher web site:
- https://launchpad.net/~graham-poulter
dedupe Tags
dedupe Description
Python Deduplication Library dedupe is a Python library for finding similar rows in a table of records (e.g. in a database or CSV file) or linking similar rows between two tables.(1) index the records into blocks,(2) compare all pairs of records in each block with a similarity function and(3) cluster the comparison pairs into "matches" and "non-matches". Requirements: · Python
dedupe Related Software