| |
GeekInterview.com > Interview Questions > Data Warehousing > DataStage
| Print | |
Question: Sequential file with Duplicate Records
Answer: A sequential file has 8 records with one column, below are the values in the column separated by space, 1 1 2 2 3 4 5 6
In a parallel job after reading the sequential file 2 more sequential files should be created, one with duplicate records and the other without duplicates. File 1 records separated by space: 1 1 2 2 File 2 records separated by space: 3 4 5 6 How will you do it |
| August 08, 2009 06:10:30 |
#3 |
| rameshkm |
Member Since: April 2009 Total Comments: 7 |
RE: Sequential file with Duplicate Records |
By Using Transformer the data from source sequential file is segregate in to two links (Link A and Link B) the link A is followed by Aggregator, the Aggregator type is set to be count rows , and count output column name is XXX, then perform left outer join with the Link B and link from aggregator , after that by using transformer we segregate the data as two by using constraints as XXX =1 and XXX >1, so we get out put as 1122 and 34536 |
| |
Back To Question | |