Geeks Talk

Prepare for your Next Interview


Welcome to the Geeks Talk forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact contact us.

how to remove dupliate records from flat files?

This is a discussion on how to remove dupliate records from flat files? within the Data Modeling forums, part of the Databases category; in paralle job or server job? can any one expalin clearly...

Go Back   Geeks Talk > Databases > Data Modeling
Register Blogs FAQ Tag Cloud Calendar Mark Forums Read
  #1 (permalink)  
Old 10-14-2008
Junior Member
 
Join Date: Sep 2008
Location: PA
Posts: 5
Thanks: 0
Thanked 1 Time in 1 Post
ikhan is on a distinguished road
how to remove dupliate records from flat files?

in paralle job or server job?
can any one expalin clearly
Reply With Quote
Sponsored Links
  #2 (permalink)  
Old 10-22-2008
Junior Member
 
Join Date: Jun 2008
Posts: 8
Thanks: 2
Thanked 0 Times in 0 Posts
sundeepksk is on a distinguished road
Re: how to remove dupliate records from flat files?

in server job to remove duplicate we can make use of hash file(by selecting the column as primary key where you want the duplicates to be eliminated) and in parallel jobs theres a separate stage called REMOVE DUPLICATES STAGE(here also u can select the column where u want the duplicates to be removed, moreover u even have the chance to select the instance u want to retain)
Reply With Quote
  #3 (permalink)  
Old 11-12-2008
Junior Member
 
Join Date: Nov 2008
Location: chennai
Posts: 1
Thanks: 0
Thanked 0 Times in 0 Posts
chak_raaj is on a distinguished road
Re: how to remove dupliate records from flat files?

Quote:
Originally Posted by ikhan View Post
in paralle job or server job?
can any one expalin clearly
U can remove the duplicate records through selecting the check box "SelectDistinct" options in source qualifier-edit-properties. So that before the
source is transformed to target this SelectDistinct option will remove from it from source qualifier itself.
Reply With Quote
  #4 (permalink)  
Old 07-03-2009
Junior Member
 
Join Date: Jul 2009
Location: india
Posts: 1
Thanks: 0
Thanked 0 Times in 0 Posts
neeraj82 is on a distinguished road
Re: how to remove dupliate records from flat files?

hi,
You can remove duplicate through aggregator stage in server job and through remove duplicate stgae can remove in parallel job on the base of primary key columns.
thanks
Reply With Quote
Reply

  Geeks Talk > Databases > Data Modeling

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads

Thread Thread Starter Forum Replies Last Post
Does rm * remove all files smita76 Unix/Linux 8 08-24-2008 11:20 AM
Load flat files into target Fazululla_Shaik Data Warehousing 1 01-20-2008 07:54 AM
Map multiple flat files to one target Geek_Guest Data Warehousing 0 10-16-2007 06:58 AM
Flat files in a mapplet? sanghala Data Warehousing 6 09-06-2007 08:16 AM
How to test flat files? Geek_Guest Testing Issues 2 07-11-2007 08:08 AM


All times are GMT -4. The time now is 03:24 PM.


Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO 3.3.1
Copyright © 2009 GeekInterview.com. All Rights Reserved