GeekInterview.com
   Home |  Tech FAQ  |   Interview Questions |  Placement Papers |  Tech Articles |  Learn |  Freelance Projects |  Online Testing |  Geeks Talk |  Job Postings |  Knowledge Base | Site Search |  Add/Ask Question

GeekInterview.com  >  Interview Questions  >  Data Warehousing  >  DataStage
Go To First  |  Previous Question  |  Next Question 
 DataStage  |  Question 243 of 377    Print  
1 1. Difference between Hashfile and Sequential File?. What is modulus?
2 2. What is iconv and oconv functions?.
3 3. How can we join one Oracle source and Sequential file?.
4 4. How can we implement Slowly Changing Dimensions in DataStage?.
5 5. How can we implement Lookup in DataStage Server jobs?.
6 6. What are all the third party tools used in DataStage?.
7 7. what is the difference between routine and transform and function?.
8 8. what are the Job parameters?.
9 9. Plug-in?.
10 10.How can we improve the performance of DataStage jobs?.
11 11.How can we create Containers?.
12 12.What about System variables?.
13 13 What is the use of Usage analysis ?
14 14 Different ways a project can be moved to production ?...example ….export-import and Version control.
15 15 What database is the Datastage repository use …? Answer: Universe Database
16 16 How is the scheduling done in the project ?
17 17 Which version of DataStage is used in the project?
18 18 What are the performance tuning required while dealing with large data ?
19 19 What do the reject option in transformer do?
20 What is the architecture of datastage?
21 How do you define and use the job parameters?
22 What is stage variables,system variables,environment varaibles?
23 How to use routines in datastage?
24 What is difference between shared-container and local-container?
25 How do you connect to Oracle?
26 Please explain any ETL process that you have developed?
27 What is hash file? Types of hash files.
28 If you are doing any changes in shared-container will it reflect in all the jobs wherever you used this shared- container?
29 Have u written any custom routines in your project? If so explain?
30 How do you get log info into a file?
31 What is before job subroutine/After job subroutine? When do you use them?
32 How do you backup and restore the project?
33 What is Clear Status File and when do you use it?
34 What is Cleanup Resources and when do you use it?
35 Can I join a flat file and oracle and load into oracle? Is this possible?
36 While loading some data into target suddenly thier is a problem loading process stopped how can u start loading from the records that were left?
37 What are the general problems that u face in Datastage?
38 What are the various reports that could be generated using this Datastage?
39 How to remove blank spaces from data
40 What is Active and Passive stage?
41 What all are the stages you have used in your project?
42 Could DataStage generate test cases?
43 What is difference between hash file and sequential file
44 What is the difference between Transform and routine
45 What is sequencer?
46 How to take backup of project?
These are some of DataStage PX Jobs questions which can be asked in interviews.
1) Types of parallel processing
2) What is the SMP(Symmetric MultiProcessing) and MPP (Masiively parallel processing)?
3) What is CPU limited , Memory limited and Disk I/O limited jobs ?
4) Can one combine pipeline & partition parallelism?
5) Advantages of PX over server job
6) Is it possible to create user-defined stage in PX?
7) Can I use hash file in PX?
8) What is surrogate key stage?
1) What is the use of APT_DUMP_SCORE ? Ans: To get messages in logs such as no of processes, no of nodes used.
2) What are the fours types of joins possible in Joiner stage? Ans: Inner, Left Outer, Right Outer, Full outer
3) What are the components of APT_CONFIG_FILE Ans: Nodes, Fastnode, Pools, Resource
What are the points that needs to be considred while creating the config file?
Ans: Available nodes, CPU time, available memory, what other process to be executed on same nodes, are there any configurations restrictions ? Eg DB only runs on certain nodes and ETL cannot run on them, Get breakdown of the resource usage? Is the hardware config SMP, Cluster or MPP?
5) When are wrappers created? Ans:only for executable commands for unix, dos
6) When are buildups created? Ans: More functionality, complex logic needed.
7) When are custom stage created ? .. Ans :new operators need which are not in EE
8) What are different job sequencer stages ?
9) What is iconv and oconv functions?.
10) can we implement Slowly Changing Dimensions in DataStage?. what are the Job parameters?.
9. Plug-in?.
10.How can we improve the performance of DataStage jobs?.
11.How can we create Containers?.
12.What about System variables?.
13 What is the use of Usage analysis ?
14 Different ways a project can be moved to production ?...example ….export-import and Version control.
15 What database is the Datastage repository use …? Answer: Universe Database
16 How is the scheduling done in the project ?
17 Which version of DataStage is used in the project?
18 What are the performance tuning required while dealing with large data ?
19 What do the reject option in transformer do?
20) How is parallelism executed?
21) What is RCP?
22) What is orchestrate
23) Difference between join, merge, and lookup stage?
24) What is dataset ?
25) Diffrenece between dataset, fileset and lookup file set?


Questions on Data Warehousing concept

1. What is Data Warehouse?
2. What is difference between Data Warehouse and Data Mart ?
3. What is Star schema?
4. What is Snow-flake schema?
5. What is fact and dimension?
6. What is surrogate key?
7. What Normlisation ?Explain 3rd Normlised form?
8. What is the difference between OLTP and OLAP?
9. Are you involved in data modeling ?If yes which tool/tech you are using?
10. Which schema modeling techniques you ever used?
11. What do you mean by summary table?
12. What Degenerated Dimensions
13. What is fact less fact?


Oracle question based on data warehouse?

1 What is parallel execution
2 What is Bitmap and B-Tree indexes ? Explain Local Vs Global variables
3 What is materialised view
4 What is page size/array size in oracle?
5 What are integrity constraints ?
6 How can one tune SQL’s in Oracle?

  
Total Answers and Comments: 1 Last Update: February 13, 2007     Asked by: Zias 
  
 Sponsored Links

 
 Best Rated Answer

No best answer available. Please pick the good answer available or submit your answer.
February 12, 2007 20:36:33   #1  
Vinod        

RE: 1 1. Difference between Hashfile and Sequential Fi...
The records in a Sequential file are organized serially, one after another, but the records in the file may be ordered or unordered. The hashed file access method scatters the records randomly throughout the RMS data file. When creating a hashed RMSfile, the maximum number of records the file will contain must be declared. When a record is added to a hashed RMSfile, the primary key value is transformed into a number between one and the number of records in the file. RMS attempts to place the record at that location. If a record already exists at that location, a collision has occurred and the record must be placed elsewhere.
 
Is this answer useful? Yes | No

 Related Questions

Data Modelling is Broadly classified into 2 types. a) E-R Diagrams (Entity - Relatioships). b) Dimensional Modelling.  
Latest Answer : Dimension modelling means to define a datawarehouse architecture using any one of the availabale schemadimensionl modelling are 3 types 1. conceptual model  2. logical model3.physical modelconceptual model: gathering of all the requirementslogical ...

a)Star Schema - Simple & Much Faster. Denormalized form. b)Snowflake Schema - Complex with more Granularity. More normalized form.  
Latest Answer : A Dimmensional modelling consists of 3 phasesa. Conceptual Modellingb. Logical Modellingc. Physical Modelling ...

Surrogate Key is a Primary Key for a Dimension table. Most importance of using it is it is independent of underlying database. i.e Surrogate Key is not affected by the changes going on with a database.  
Latest Answer : surrogate key is nothing but a primary key,it gives a serial no. to all records.surrogate key is act's as a major role in type 2 scenario.if u have any updates to be applied on a database it will be directely effected on datawarehouse part(using surrogate ...

Data in a Database is a) Detailed or Transactional b) Both Readable and Writable. c) Current.  
Latest Answer :         database                                                         dwh  1. it contains current data                  1. it contains current and historical data  ...

Fact table - Table with Collection of Foreign Keys corresponding to the Primary Keys in Dimensional table. Consists of fields with numeric values. Dimension table - Table with Unique Primary Key. Load 
Latest Answer : __________________                       _______________     ________________|_________________|                     |______________ ...

Orchestrate itself is an ETL tool with extensive parallel processing capabilities and running on UNIX platform. Datastage used Orchestrate with Datastage XE (Beta version of 6.0) to incorporate the parallel 

Primary Key is a combination of unique and not null. It can be a collection of key values called as composite primary key. Partition Key is a just a part of Primary Key. There are several methods of partition 
Latest Answer : Hi:Primary key is the key we define on the table column or set of columns(composite pk) to make sure all the rows in a table are unique.Partition key is the key that we use while partition the table(in database), process the source records in ETL(in the ...

Using "dsjob" command as follows. dsjob -run -jobstatus projectname jobname  
Latest Answer : u can use dsjob executable command from unix or command line.The previous post was correct.Hanu. ...

Stage Variable - An intermediate processing variable that retains value during read and doesnt pass the value into target column. Derivation - Expression that specifies value to be passed on to the target 
Latest Answer : all r correct one only ...

Default cache size is 256 MB. We can incraese it by going into Datastage Administrator and selecting the Tunable Tab and specify the cache size over there.  
Latest Answer : The default cache size is 128 MB. This is primarily used for hash file data cache in the server.  This setting is only can be done in Administrator not in job level. Job level tuning is available only for Buffer Size. ...


 Sponsored Links

 
Related Articles

Business Intelligence Key Performance Indicators

Business Intelligence Key Performance Indicators What are Key Performance Indicators Key Performance Indicators are also known as Key Success Indicators they help an organization to better define and measure their progress toward professional goals Once an organization has clearly identified its nee
 

Breaking up XML into Relational Data

Breaking up XML into Relational Data While the preceding example shows how to construct an XML representation over relational data the example in this section illustrates how you can shred XML data back into relational data This reverse operation can be useful if your application works with relation
 

Querying Data with Oracle XQuery

Querying Data with Oracle XQuery Starting with Oracle Database 10g Release 2 you can take advantage of a full featured native XQuery engine integrated with the database With Oracle XQuery you can accomplish various tasks involved in developing PHP Oracle XML applications operating on any kind of dat
 

Accessing Repository Resources with SQL

Accessing Repository Resources with SQL In fact Oracle XML DB repository resources are stored in a set of database tables and indexes which can be accessed via SQL You are not supposed to access those tables directly Instead Oracle XML DB provides two public views RESOURCE VIEW and PATH VIEW through
 

Using Oracle XML DB Repository

Using Oracle XML DB Repository Another variation on accessing and manipulating XML content stored in Oracle database is provided by Oracle XML DB repository which is an essential component of Oracle XML DB mosgoogle NOTE Oracle XML DB repository also known as XML repository is a hierarchically organ
 

Retrieving XML Data

Retrieving XML DataTo retrieve XML data from an XMLType table you can use a SELECT SQL statement just as you would if you had to query a relational table For example to select the employee with the id set to 100 from the employees XMLType table discussed in the preceding section you might issue the
 

Using XMLType for Handling XML Data in the Database

Using XMLType for Handling XML Data in the Database Being an object type XMLType can not only be used to store XML data in the database but also to operate on that data via its built in methods Regardless of the storage model you choose XMLType provides a set of XML specific methods to operate on XM
 

Using Oracle Database for Storing, Modifying, and Retrieving XML Data

Using Oracle Database for Storing Modifying and Retrieving XML Data With Oracle XML DB you have various XML storage and XML processing options allowing you to achieve the required level of performance and scalability One of the most interesting things about Oracle XML DB is that it allows you to per
 

Performing XSLT Transformations inside the Database

Performing XSLT Transformations inside the Database Now that you have the employees XSL stylesheet stored in the database and the xmlusr schema is permitted to access the hr employees table you can create a script that will instruct the database to build an HTML page based on the data stored in hr e
 

Business Performance Management

Business Performance Management What is Business Performance Management Business Performance Management is most commonly described as a set of processes that help companies or organizations optimize their business performance It is specifically designed to organize automate and analyze business meth
 





About Us  |   Privacy Policy  |   Terms and Conditions  |   Contact  |   Site Map  |   Add Question  |   Propose Category  |   RSS Feeds  |   Articles Sitemap  |   Site Updates  |   Add Resource

Copyright © 2005 - 2008 GeekInterview.com. All Rights Reserved
Page copy protected against web site content infringement by Copyscape