-
Flat File Header and Footer
Does flatfile contain header & footer?How to remove Header & Footer from a flatfile before extracting the data?What information header & footer contains?
-
Flatfiles Validations
When we are extracting the flatfiles, What are the basic required validations?
-
Transformer Stage Functions
If you have Numerical+Characters data in the source, how will you load only Character data to the target? Which functions will you use in Transformer stage?
-
Nodes Types
What are the types of nodes in datastage?
-
Datastage Parallel Jobs Performance
How will the performance affect if we use more number of Transformer stages in Datastage parallel jobs?
-
Datastage Lookup
By using lookup we join the data from more than one table, we can implement the same using join stage, then what is the need of Lookup stage especially in Datastage?
-
DataStage Job Scheduling
How to schedule a datastage job using cron tab utility?
-
Data Granularity
What is data granularity? Explain.
-
TOAD
What does Toad mean? How it's work?
-
Job Sequences with Restart Ability
How to do job sequences with restart ability?
-
NOCOPY mode
What is the purpose of NOCOPY mode in procedures?
-
Avoid Data Duplication
If in a table we dont use primary key or any other unique key then how to avoid duplication of data?
-
Datastage Parallel Process
How do you make a simple job which reads data from Oracle database and write into a sequential file run in parallel?Since reading of data from database is sequential and sequential file also works sequentially.
-
Link sort/Stage sort
What is the use of sort stage when you have explicit sort stage?
-
Configuration File
What is configuration file? What does it contain? What is the use of it in Datastage?
-
Null Value
How to handle Null value in WHERE Condition?
-
Datastage ETL
What is Datastage ETL?
-
Server Connection
How to connect to the server in datastage?
-
Retrieve Hash File
How to retrieve hash file data from administrator command?
-
Datastage Server Outer Join Facility
How to get outer join facility from Datastage Server?
-
Pass input parameters using UNIX Shell script
How to pass input parameters using UNIX Shell script to DataStage Director?
-
Datastage Parallel Extender
How to extract data from source to staging area using datastage parallel extender?
-
Clear logs
How to clear logs after 5 days, by using a routine?
-
Use of Terminator Activity
What is the Use of Terminator Activity in Datastage server Jobs?
-
Orchestrate Schema
What is Orchestrate Schema? Distinguish internal data type (Orchestrate schema) vs external data type
-
DSParams file
Is it like that if we define the values of variables in DSParams file then there is no need to give the values at job level ar Project level ?& how to configure this file at job level ?so that we need not hardcode the values.....
-
shell scripts in sequencers
how you will call shell scripts in sequencers in datastage
-
Fileset
What for exactly fileset is used?
-
-
BASIC Transformer and NORMAL Transformer
What is the Exact difference between BASIC Transformer and NORMAL Transformer?When we will go for BASIC Or NORMAL Transformer?
-
-
write the shell script using Unix to run the 2 jobs?
write the shell script using Unix to run the 2 jobs?How to schedule the 2 jobs using Unix crontab to run for particular time?
-
How to assign the one path of file to PROJDEF using the job parameters?
At the time to incremental loading I need to take the reusable files' path in this situation How to assign the one path of file to PROJDEF using the job parameters?
-
What is the use of clear status file
What is the use of clear status file in DS?How we will use it?explain with examples?
-
How can we load the flat file
How can i load a flat file into target as fast as i can?Assuming that the source bottleneck is not there,that is there is no performance issues in the source side.
-
Datastage Nodes and CPUs
Please Give Me some small Notes on Nodes and CPUs. I have one question when i am using 4 nodes and 5 CPUS how many Processors will be going to execute. Please help me in this...
-
-
-
Where does director create its log files?
We are spacing a problem of C: getting filled up because of a creation of some file. I think it is because of the director log files.
-
-
-
host name specified is not valid, or the host is not responding
i downloaded data stage at my home pc. when iam trying to open data stage administrator datastage designer or datastage director it showing me an error like "host name specified is not valid, or the host is not responding(81011). could any body can help me out how can i get host name, user name and password, because i didn't mention any host name, username or password while installing the software.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
What is the Batch Program and how can generate ?
Batch programe is the programe it's generate run time to maintain by the datastage it self but u can easy to change own the basis of your requirement (Extraction, Transformation,Loading) .Batch programe are generate depands your job nature either simple job or sequencer job,You can see this programe on job controll option.
-
How many places u can call Routines?
Four Places u can call (i) Transform of routine (A) Date Transformation (B) Upstring Transformation (ii) Transform of the Before & After Subroutines(iii) XML transformation(iv)Web base trannsformation
-
-
-
When should we use ODS?
DWH's are typically read only, batch updated on a scheduleODS's are maintained in more real time, trickle fed constantly
-
Tell me one situation from your last project, where you had faced problem and How did u solve it?
A. The jobs in which data is read directly from OCI stages are running extremely slow. I had to stage the data before sending to the transformer to make the jobs run faster.B. The job aborts in the middle of loading some 500,000 rows. Have an option either cleaning/deleting the loaded data and then run the fixed job or run the job again from the row the job has aborted. To make sure the load...
-
The above might rise another question: Why do we have to load the dimensional tables first, then fact tables:
As we load the dimensional tables the keys (primary) are generated and these keys (primary) are Foreign keys in Fact tables.
-
Read the String functions in DS
Functions like [] -> sub-string function and ':' -> concatenation operatorSyntax: string [ [ start, ] length ]string [ delimiter, instance, repeats ]
-
How did u connect with DB2 in your last project?
Most of the times the data was sent to us in the form of flat files. The data is dumped and sent to us. In some cases were we need to connect to DB2 for look-ups as an instance then we used ODBC drivers to connect to DB2 (or) DB2-UDB depending the situation and availability. Certainly DB2-UDB is better in terms of performance as you know the native drivers are always better than ODBC drivers. 'iSeries...
-
Where do you use Link-Partitioner and Link-Collector ?
Link Partitioner - Used for partitioning the data.Link Collector - Used for collecting the partitioned data.
-
What versions of DS you worked with?
DS 7.0.2/6.0/5.2
-
What are the often used Stages or stages you worked with in your last project?">
What are the often used Stages or stages you worked with in your last project?
A) Transformer, ORAOCI8/9, ODBC, Link-Partitioner, Link-Collector, Hash, ODBC, Aggregator, Sort.
-
-
Compare and Contrast ODBC and Plug-In stages?
ODBC : a) Poor Performance. b) Can be used for Variety of Databases. c) Can handle Stored Procedures. Plug-In: a) Good Performance. b) Database specific.(Only one database) c) Cannot handle Stored Procedures.
-
Orchestrate Vs Datastage Parallel Extender?
Orchestrate itself is an ETL tool with extensive parallel processing capabilities and running on UNIX platform. Datastage used Orchestrate with Datastage XE (Beta version of 6.0) to incorporate the parallel processing capabilities. Now Datastage has purchased Orchestrate and integrated it with Datastage XE and released a new version Datastage 6.0 i.e Parallel Extender.