More On

Getting Rid of Deja-vu

One day last week, in three meetings with three different clients, I heard the same questions raised about the problem of duplicates in document collections. Ironically, the problem is greater in document collections that are paper-source or mixed paper and electronic source.

Purely electronic source document collections are, for all their other problems, easily de-duped, and the trend these days is not just to de-dupe within custodians, but preferably to de-dupe across the entire database. And the really good news is that full-scale de-duping can get rid of a lot more than you might have guessed.

Clifford F. Shnier

Bio and more articles

Join the Conversation

Advertisement. Closing in 15 seconds.