The ETL Help Guide: Frequently Asked Questions

Questions on Configuration File and Partitioning

1) What is Configuration file, its contents, usage, and node pools?

2) What is difference between resource desk and resource scratch desk space in configuration file.

3) DataStage Collector methods, which collector method to use if records should be collected in a sequential way by 3 target files?

4) What all stages require Hash partitioning and why ?

5) What is the difference between Hash and Modulus partitioning ?

6) What will happen if the data is not sorted on keys before join stage ?

7) In a parallel job with 4 nodes configuration file, containing 1 sequential file stage, 2 transformers and 1 odbc stage. how many processes will be generated ?

Questions on Datastage Components

1) What is the difference between look up, join and merge ?

2) Dataset Descriptor file and Data file, How to check the location of data file. How to check no. of records in dataset from unix command line?

3) What will happen if a dataset which is created by 4 nodes config file, is read by job of 3 nodes config file ?

4) What is Copy stage force option ?

5) How to do null handling in Modify stage ?

6) What is the difference between key change and cluster key change in sort stage ?

7) What is the order of execution in transformer stage ?

8) How will you implement aggregation logic without aggregator stage in Datastage ?

9) How to drop copy records from Change Capture stage ?

10) What is pivot index ?

11) How to pass output of unix command to a datastage job as a parameter ?

12) How to run a job A if another job B fails ?

13) How to schedule jobs using Director ?

14) How to unlock a job ?

15) How to create a user defined variable on project level ?

Questions on Debugging and Performance Tuning

There is a sequence which is running for 5 minutes every day but one day it will take more than 5 hours how you will debug in support environment?

How to trace-out the issues?

What kind of the job failures you will get in the project and how do you fix it?

Various Environment Variables defined at System level.

How to kill the job without using datastage director.

File to DB, If DB Connection aborts – Next steps, restart mechanism?

Miscellaneous Questions

How to create a connection in db2 database?
UNIX command to run DS job?
Explain SCD2?
Any Complex Project and job?
What kind of errors faced in your real time projects?
By which schema you used in your projects, briefly explain?
What kind of data issues you faced in the project with example?
What is Message Queue?
What are types of XML stage and how to parse data , What is XML stage?
How to Read ASCII and BINARY files?
Change Capture Stage functionality?
Lookup Stage properties, different options - drop, reject, continue operations based on lookup outcome on particular record?
Difference between Local and Shared containers?
Sequence Jobs - Terminator activity stage property - Send STOP/Abort signal if any jobs in flow fail.
Sequencers and different Stages available?
Hands on experiences on Routines

The ETL Help Guide

Saturday, 6 January 2018

Frequently Asked Questions - Datastage

About Me

Total Pageviews

Saturday, 6 January 2018

Frequently Asked Questions - Datastage

About Me

Total Pageviews

Subscribe To