dedupe

Python Deduplication Library
Download

dedupe Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL v3
  • Price:
  • FREE
  • Publisher Name:
  • Graham Poulter
  • Publisher web site:
  • https://launchpad.net/~graham-poulter

dedupe Tags


dedupe Description

Python Deduplication Library dedupe is a Python library for finding similar rows in a table of records (e.g. in a database or CSV file) or linking similar rows between two tables.(1) index the records into blocks,(2) compare all pairs of records in each block with a similarity function and(3) cluster the comparison pairs into "matches" and "non-matches". Requirements: · Python


dedupe Related Software