Submitted Questions

  • How do you remove duplicates in dataset

    in how many ways we can delete dataset? If a record is duplicated 3 times then how to get middle duplicated record? Is it advisable to use basic Tfr in Parallel jobs?

    srkreddy111

    • Aug 12th, 2011

    First you have to open the data set and click on the partitioning and after click hash partition and next click perform sort after click on unique and after ok and after compile and run the job.open the target output,the duplicate records are removed..