Lookup Stage Partitioning

With a Lookup stage the input stage has 1million records and reference link has 1million records. What is the best Partition method and why?

Questions by rajivkumar23us   answers by rajivkumar23us

Showing Answers 1 - 12 of 12 Answers

anujk_arora

  • Mar 19th, 2009
 

I believe it depends upon the system configuration as the data in the reference link is loaded into the main Memory while processing. If the total consumption of the memory passing through the reference link is above the total capacity of the machine, in that case we should look for alternatives. It could be hash partitioning on reference link along with the main link. Having said this it all depends upon the logic. So, i believe, nothing is ideal for the lookup stage paritioning although entire is a default technique but should be changed if the situation demands.

  Was this answer useful?  Yes

gopalanrr

  • Mar 26th, 2009
 

When the reference link is too high in terms of records fetched, you should be performing a join instead of performing a lookup

  Was this answer useful?  Yes

Better to use JOIN or MERGE stage in place of Lookup. If input data is low in compare to reference data then Lookup is recommendable. Assumption is 1:100 or more.

  Was this answer useful?  Yes

Give your answer:

If you think the above answer is not correct, Please select a reason and add your answer below.

 

Related Answered Questions

 

Related Open Questions