| |
GeekInterview.com > Interview Questions > Data Warehousing > DataStage
| Print | |
Question: Sequential file with Duplicate Records
Answer: A sequential file has 8 records with one column, below are the values in the column separated by space, 1 1 2 2 3 4 5 6
In a parallel job after reading the sequential file 2 more sequential files should be created, one with duplicate records and the other without duplicates. File 1 records separated by space: 1 1 2 2 File 2 records separated by space: 3 4 5 6 How will you do it |
| October 10, 2009 06:16:42 |
#4 |
| nagoosk |
Member Since: November 2007 Total Comments: 14 |
RE: Sequential file with Duplicate Records |
1) We have an stage called Remove duplicate stage through which we can delete the duplicate records.
2) Use the aggregator stage and specify the particular column on which you want to delete the duplicates |
| |
Back To Question | |