record deduplication

Record deduplication for entity distribution modeling in ASR transcripts