Does sdlc changes when you use teradata instead of Oracle?
What is active data warehousing?
Active Data Warehouse is the DWH that enables the decision makers of the organisation to understand the trends and patterns of the market (by analysing the captured transactional data in the DWH) and thereby helping them to arrive at an INFORMED future decision(s).
Thanks.
Alpesh
Active Data Warehousing (ADW)The technical ability to capture transactions when they change, and integrate them in to the warehouse - along with maintaining batch or scheduled cycle refreshes.
Can we apply rank and sort at a time on a single report
ranking has priority than sorting .we apply ranking on dimension based on the measure like if you want to display top 100 sales revenue generated by a product or bottom 100 sales generated by a product.sorting is not based on measure or anything it will sort in
ascending ,descending or custom order
If we think logically what is the purpose of sorting, if its been ranked already, Ranking will sort the rows from higher to lowest order(Descending order). Does it make sense :)
Thanks
Rex
How will you know the version of bo using designer?
Go to help and click About Designer in help, you will find it.
Cheers
Sithu
What are variable ports and list two situations when they can be used?
Variable Ports usually carry intermediate data (values) and can be used in Expression transformation.
variable port is used to break the complex expression into simpler
and also it is used to store intermediate values
In Star schema, the data in the dimensional table is denormalized where as data in the Snow flake schema, it is normalized
In any schema fact table is in normalized form . it has reference keys to reference dimensional tables. Hence it is normalized
What is data warehosuing hierarchy?
hierarchy is an ordered series of related dimension objects grouped together to perform the multidimensional analysis.
Multidimensional analysis is a technique to modify the data,so that the data can be viewed from different perspectives and at different levels
A related group of dimentions grouped together to perform multi dimentional analysis is known as Hierarchy.
Why not just use an inline select statement in your sql qualifier to eliminat the overhead of a lookup transformation all together?
Connected and Unconnected lookups have their own pros and cons.Unconnected lookup is designed for a specific purpose.The task that an unconnected lookup does can be done by a connected lookup also. Th...
Which objects are required by the debugger to create a valid debug session?
Informatica server must run,,
use following
expression in update strategy transformation.
iif ( condition 1 , DD_INSERT,
condition 2 , DD_UPDATE,
condition 3 , DD_DELETE,
DD_REJECT -- for default)
and set session property as data driven, You can cross check these conditions values during session debug runs.
In general DWH scenario, we create TABLE_REJECT or FILE_REJECT to keep a check of the rejected records. For all the updates happening we keep timestamp or some other flag in the database.Before t...
How to join two tables without using the joiner transformation.
if both the tables are relational we can join both the tables using sql override but if one table is relational and another table is a flat then we've to use joiner transformation.
if both the tables are available in same database by using source qualifier we can join i.e by using customized source qualifier
if the tables are available in different database by using lookup we can join i.e one is source table and other is lookup table
Briefly explian the versioning concept in power center 7.1.
When you create a version of a folder referenced by shortcuts, all shortcuts continue to reference their original object in the original version. They do not automatically update to the current folder...
Hi manoj,
I appreciate ur response ,But can u be a bit clear
thanks
sri
What is the metadata extension?
Informatica Metadata Exchange (MX) provides a set of relational views that allow easy SQL access to the Informatica metadata repository. The Repository Manager generates these views when you create or...
Hi Ravi ,
If possible please elaborate on Informatica SuperGlue.
Regards
Punu
What is data validation strategies for data mart validation after loading process
Validation Strategies Data validation strategies are often heavily influenced by the architecture for the application. If the application is already in production it will be significantly harder to bu...
Two universes can be link Dynamically by using "Link Parameter".
In join is a relational operator, It can connect between two column. In universal join to merge two universe between same column
Pls tell me in which situation context and alias r going to use?
Context is basically used when we will have conformed dimension in the loop........
context is nothing but specifying the name of the path
the place where u cannot resolve the loops with allias there comes concept of context
alias is nothing but the duplicate name of some other table
Designing a normalized schema is known as data modeling
Designing a de-normalized schema is known as dimension modeling
Data Modelling is designing of the data content and structure of the database. The data model documents the structure of and interrelationships between the data - it is presented as a combinatio...
What is the difference between etl tool and olap tools
ETL TOOL (eg:Informatica, Data stage, Abinitio..)Â is for EXTRACTING data from a source systems(a system where client's transactional business information resides) and performing business logic(applyi...
ETL tool is used for Extration ,Transformation(filtering) and loading the data into data warehouse .
OLAP tools are used to analitical purpose basically for report generation i.e to retrieve the data from dataware,OLAP is multidimensional model.
How do you connect the facts and dimensions in the tables
Surroget key.
What is ods (operation data source)
ODS is nothing but operational data source. It contains most recent data .it contains around 30-60 days data It is placed in between source system and staging area.. reports can also be taken in ODS .
Operational Data Store (ODS): An ODS is an integrated database of operational data. Its sources include legacy systems and it contains current or near term data. An ODS may contain 30 to 60 days of information, while a data warehouse typically contains years of data.
it is before staging
What are the diffrences between business objects 5.1 & 6.5 ?
Any body please help me with bo's diffrences
Main Diff Bet 5.1 & 6.5v - Cluster Concept:In BO 6.5v architecture ClusterManager & Cluster Nodes are there. "Cluster" means group of Cluster Nodes."Cluster Manager" means group of C...
Hi Sithu,I am looking at the PDF document "businessobjects_xi_whats_new.pdf" (goto Page 10) downloaded from the BusinessObjects site. A sample info found on this pdf:----------------------------------...
Answered by: shivanand
Member Since Nov-2005 | Answered On : Nov 12th, 2005
Repository means set of database tables, Business object store security information e.g user, group, access permission, user type etc. , universe information e.g. objects, classes, table name, column name, relation ship etc.and document information.
BO repository is a set of relational data structures stored on a database. It is the heart of business objects. I t can also be called central management server.repository contains reports of BO.
There is no concept or Domains from XI R2. Everything will be viewed as metadata though all these information is stored in the database tables.
Domain concept ends with the 6x only.
Thanks.
What are the types of dimension tables
There are following types of dimensional tables: 1.Slowly changind dimensions(SCD) a)SCD-1:stores only current values b)SCD-2:stores both previous and current values c)SCD-3:stores cu...
The Dimensions are Confirmed Dimension, Junk/Dirty Dimension, Degenerated Dimension and Slowly Changing Dimensions(SCD)-(Under slowly changing dimensions we have 6 types, they are Type1, Type2, Type3, Type4, Type5 and Type6).
What does the expression n filter transformations do in Informatica slowly growing target wizard?
You can use the Expression transformation to calculate values in a single row before you write to the target. For example, you might need to adjust employee salaries, concatenate first and last names, or convert strings to numbers.
Expression finds the Primary key is or not, and calculates new flag
Based on that New Flag, filter transformation filters the Data
Cheers,
Sithu
What is surrogate key ? Where we use it explain with examples?
Answered by: saravanan
Answered On : Dec 2nd, 2005Surrogate key is a unique identification key, it is like an artificial or alternative key to production key, bz the production key may be alphanumeric or composite key but the surrogate key is always single numeric key. Assume the production key is an alphanumeric field if you create an index for this fields it will occupy more space, so it is not advisable to join/index, bz generally all the datawarehousing fact table are having historical data. These factable are linked with so many dimension table. if it's a numerical fields the performance is high
Surrogate key is a numeric or integer value in a table to identify each row uniquely. Primary key and surrogate key are same but surrogate key is a system generated value has a define incremental valu...
Surrogate key and primary key both are same, but point of application is different. Primary key is part of transactional system but surrogate key is part of analytical system. Surrogate key is used...
A cube can be stored on a single analysis server and then defined as a linked cube on other analysis servers. End users connected to any of these analysis servers can then access the cube. This arrangement avoids the more costly alternative of storing and maintaining copies of a cube on multiple analysis...
Hi All,
Could you please let me know what is Replicate Cube & Transparent Cube?
Thanks & regards,
Amit Sagpariya
Hi All,
Could you please let me know what is Replicate cube &
Lets you set the user identification and properties for the administrator of the Broadcast Agent. Broadcast Agent is a BusinessObjects product that handles the scheduled or batch processing of documen...
broadcast agent is used for scheduling or bursting the reports.thru this,you can schedule your reports so that end-users can access reports thru internet.you have broadcast administration console here...
What are non-additive facts in detail?
Non-additive facts are facts that cannot be summed up for any of the dimensions present in the fact table
Non-Additive: Non-additive facts are facts that cannot be summed up for any of the dimensions present in the fact table.
What is the difference between connected and unconnected stored procedures.
Run a stored procedure before or after your session.Unconnected Run a stored procedure once during your mapping, such as pre- or post-session.Unconnected Run a stored procedure every time a ...
Unconnected:The unconnected Stored Procedure transformation is not connected directly to the flow of the mapping. It either runs before or after the session, or is called by an expression in another t...
Compare data warehousing top-down approach with bottom-up approach
Top-Down approach means first create the datawarehouse for entire organization then divide it into for each specific
subject are called datamarts
Bottom-Up approach means first create the data marts then by combining all data marts create the datawarehouse
Top Down-------------To build an mammoth enterprise data warehouse iteratively considering all enterprise needs and then feed individual departmental data marts.Bottom Up------------To build ...
Can we modify the data in flat file?
The update transformation/strategy uses the concept of keys, in order to update the target table. As flat file do not have any Keys defined on it, it is not possible to update the target,. which is a flat file.
You can modify the flat file using shell scripting in unix ( awk, grep, sed ).
Hope this helps.
to copy a table along with records
create table
select *from
if we want to copy only structure
create table < new tablename>
select *from
Yes , we can copy the table structure. create table
What is the drill up , drill down, drill by , drill trough ?
Drill up : one level up in the hirearchydrill down: one level down in the hirearchydrill by : direct selection of level in the hirearchydrill trough : to drill data from one hirearchy to another hirearchy
Drill up means up in the hierarchy.
Month:-Quarter:-Year
Drill down means down in the hierarchy
Year:-Quarter:-Month
Drill through means you can drill through to the database directly from drill mode and get the required data
Drill by means you can select one hierarchy to another hierarchy
Drill up : UP-one level
Drill down: DOWN-one level
Drill by : selection of level Hierarchy
Drill trough : Hirearchy to another Hirearchy
Cheers,
Sithu
What is the main differnce between schema in RDBMS and schemas in datawarehouse....?
User It is just an account name for the user Owns a schema Have the same name Create user command, create the user User can be considered schema User can created without schema It is the account for t...
Diff b.w OLTP and OLAP :------------------------OLTP Schema :* Normalized * More no.of trans* Less time for queries execution* More no.of users* Have Insert,delete and update trans. OLAP (D...
Compare data warehouse database and oltp database?
The data warehouse and the oltp data base are both relational databases. However, the objectives of both these databases are different. The oltp database records transactions in real time and aims to automate clerical data entry processes of a business entity. Addition, modification and deletion of data...
In simple words,
An OLTP system is used to get data into the system.
A datawarehouse system is used to get data out of the system.
-Ritu
data ware house is summarized,subject oriented,non volatile,historic, time invarient (only insertion's no updations) , denormalized. one, which support OLAP process.where as database support's detail...
What is the difference between data warehouse and online analytical processing?
Ralph kimball the co-founder of the data warehousing concept has defined the data warehouse as a “"a copy of transaction data specifically structured for query and analysis”. Both definitions highlight specific features of the data warehouse. The former definition focuses on the structure and organization...
OLAP--- Using the online entered data is used for analytical processing.
Data warehouse is the place where the data will pulled and kept in one place which will not change frequently and is also used for analytical processsing.
A data warehouse is a database geared towards the business intelligence requirements of an organisation. The data warehouse integrates data from the various operational systems and is typically loaded...
Online analytical processing a tool to evaluate and analyze the data in the data warehouse using analytical queries. A tool which helps organize data in the data warehouse using multidimensional models of data aggregation and summarization. Supports the data analyst in real time and enables online analysis...
OLAP is a concept of analyzing the data present in datawarehouse. i.e. the data should be in denormalized form,integrated,non-volatile,time varient. Then we can make use OLAP better.
for better decision making process and to increase the profitability of business.
A tool to evaluate and analyze the data in the data warehouse using analytical queries. A tool which helps organize data in the data warehouse using multidimensional models of data aggregation and sum...
Data warehouse data from different data sources is stored in a relational database for end use analysisdata from different data sources is stored in a relational database for end use analysis data is organized in summarized, aggregated, subject oriented, non volatile patterns. Data is a data warehouse...
DWh is (6 important points about DWH) :
1. Dedicated Data Base
2. Integrated Data
3. Subject Oriented
4. Time Variant
5. Nonvolatile
6. Decision Support
Idan Halfon
Okay there are two camps on this Bill Inmon which is Date and Coddbased database design that has grown into the ODS reporting DSS database and of course Ralph Kimball whose mythology comprises of a st...
A program running in the database that can take complex actions based on the inputs you send it. Using a stored procedure is faster than doing the same work on a client, because the program runs right inside the database server. Stored procedures are nomally written in PL/SQL or Java.
Stored Procedure is a re-usable program which is created once and reused n times .
It stores the procedure at database level
It provides security, better performance.
Stored Procedure is an already written sql statements that is saved in database.
The design method consists of two major phases. During the first phase, you create the underlying database structure of your universe. This structure includes the tables and columns of a database and the joins by which they are linked. You may need to resolve lOOPS which occur in the joins using aliases...
Universe Design as follows:
Define Parameters -> Insert Database Schema (tables, columns..etc)->Make the Joins->Resolve the loops->create classes and Objects->Hierarchies.
Then test the universe for integraty.
Cheers,
Ravi Kumar Garre
ravi_kumar_garre@yahoo.co.in
The design method consists of two major phases.During the first phase, you create the underlying database structure of your universe. This structure includes the tables and columns of a database and t...
Designer is a businessobjects is module used by universe designers to create and maintain universes. Universes are the semantic layer that isolates end users from the technical issues of the database structure. Universe designers can distribute universes to end users by moving them as files through the...
Designer
Sithu
Its a mapping of business requirements with the database structures in the database.
A work mode whereby a designer creates universes in an environment without a repository. The mode in which a universe is saved determines whether other designers are able to access it. By default, a universe is saved in the mode in which the designer is already working.
Designer lets you save universes in either enterprise or workgroup mode.Enterprise mode means that you are working in an environment with a repository. Workgroup mode means that you are working withou...
An object created by the end user from the businessobjects user module. A user object can be inserted into a universe by the designer.
Insert a user object
1 Select the User Objects command from the Insert menu.
The Insert User Objects dialog box is displayed.
2 Click a user object file.
A user object file has a .udo extension
3 Click the Open button.
Cheers,
Sithu
The area of the repository that holds exported universes. The universe domain makes it possible to store, distribute, and administrate universes. There may be multiple universe domains in a repository.
this holds the logical arrangement of facts/diemensions and mesures
Cheers,
Sithu
A mapping of the data structure found in databases: tables, columns, joins, etc. A universe, which is made up of classes, objects, and conditions, can represent any specific application, system, or group of users.
An universe is an symanteic layer between the user and the Designer which contains connection of a database
Universe is a Semantice layer that maps the data structure found in databases (tables, columns, views..etc) to Business Terms.
Cheers,
Ravi Kumar Garre
ravi_kumar_garre@yahoo.co.in
A component within a class that groups objects. A subclass can itself contain other subclasses or objects.
A component with in a class that groups the object
Cheers,
Sithu
The graphical component within the main designer window that shows the schema of the universe. It reflects the underlying database structure of the universe.
Insert tables in the Structure pane1 Select the Tables command from the Insert menu.2 In the Table Browser, click the tables you want to add to the universe structure.To select several conti...
Scripts that automatically extract structural information about tables, columns, joins, or cardinalities from a database. Designer provides default strategies but a designer can also create strategies. These are referred to as external strategies.
A strategy is a script that automatically extracts structural information from a database. Designer provides a number of default strategies you can use. These are strategies for extracting joins, dete...
external
A shared connection is used to access common resources such as universes or documents. It can be used by several users. Information about a shared connection is stored in a sdac.Lsi or sdac.Ssi file; its definition is updated dynamically.
The secured connections are used to access universe or document. It can be used by several users.
Cheers,
Sithu
A secured connection is used to access universes or documents that may be restricted or confidential. It can be shared by several authorized users. Stored in the repository, the definition of a secured connection is updated dynamically.
The secured connections are used to access universe or document that may be restricted or confidential
Cheers,
Sithu
A wizard in the designer module that provides guided instructions for creating a basic universe. It lets a designer name a universe, set up a connection to a database, select strategies, create classes and objects, as well as generate joins with cardinalities.
If you click the File/New starts Quick Design wizard check box in the General tab (Tools menu, Options command) the New command automatically launches the Quick Design wizard.
Cheers,
Sithu
In designer a query is a technique for creating or modifying a list of values associated with an object. From the query panel, a designer builds a query from the classes, objects, and conditions of a universe. In the businessobjects user module, a query is a type of data provider. An end user builds...
the values what exactly the user need is query
Cheers,
Sithu
A property of an object that determines how it can be used in multidimensional analysis. An object can be qualified as one of three types: a dimension, detail or measure.
The qualification of an object reveals how it can be used in multidimensional analysis. An object can be qualified as a dimension, a detail, or a measure. In the Universe window, the symbol besid...
A personal connection is used to access resources such as universes or documents. It can be used only by the user who created it. Information about a personal connection is stored in both the pdac.Lsi and pdac.Ssi files; its definition is static and cannot be modified.
Personal connections are used to access resources such as universe or document
Cheers,
Sithu
The work mode in which the designer works with universes stored locally.
The designer can design the universe in offline mode for that you need to login at least once
Cheers,
Sithu
An object that is derived from an aggregate function. It conveys numeric information by which a dimension object can be measured.
An object derived from aggregate function, that is know as mesure object
Cheers,
Sithu
A situation that occurs when more than one path exists from one table to another in the universe.
A loop is a situation that occurs when more than one path exists from one table to another. Loops result in ambiguity in the design of a universe.
Designer enables you to identify loops in one of two ways:
alies
Contexts
Cheers,
Sithu
Loop is a logical and pictorial representation of tables joined in such a way that the join's path begin and end with the same table.
Usually such a table would return less number of rows and it is usually the intersection condition values.
Loops are resolved by Alias Table.
A list of values contains the data values associated with an object. These data values can originate from a corporate database, or a flat file such as a text file or excel file. In designer you create a list of values by running a query from the query panel. You can then view, edit, purge, refresh and...
A list of values contains the data values associated with an object. These data values can originate from a corporate database, or a flat file such as a text file or Excel file. In Designer you create...
A relational operation that causes two tables with a common column to be combined into a single table. Designer supports equi-joins, theta joins, outer joins, and shortcut joins.
An ordered series of related dimensions used for multidimensional analysis.
Hierarchy is an arrangement of objects for report to report interactions [RRI]. It is used for drill-down and drill-up operations in universe design.
a series of dimensions used for analysis based on user requierments
Cheers,
Sithu
A work mode whereby a designer creates universes in an environment with a repository. The mode in which a universe is saved determines whether other designers are able to access it. By default, a universe is saved in the mode in which the designer is already working.
Within BusinessObjects enterprise, user names can be authenticated against one of more security databases. The default authentication source is Enterprise. However, if your company uses Windows NT, for example then NT user name and password can be used to log into Designer.
Cheers,
Ravi Kumar Garre
Enterprise mode means that you are working in an environment with a repository. Workgroup mode means that you are working without a repository. The mode in which you save your universe determines whether other designers are able to access them.
Cheers,
Sithu
The area of the repository that stores documents, templates, scripts, and lists of values.
The area of the repository that stores documents, templates, scripts, and lists of values.
all the document informations are stored here,
Cheers,
Sithu
An object being tracked in multidimensional analysis; the subject of the analysis. Dimensions are organized into hierarchies.
A dimension is an object that is a focus of analysis in a query. A dimension maps to one or more columns or functions in the database that are key to a query. For example Country, Sales Person, Products, or Sales Line.
A Dimension object is typically textual information by which users analyze the numeric measures, such as product, region, or month. A dimension object often comes from the lookup or reference tables w...
A method by which designer can decide which path to choose when more than one path is possible from one table to another in the universe.
Context : defined as a group of joins shares common business purpose ...
Context: Intelligence to generate SQL.
Contexts are a collection of joins which provide a valid query path for Web
Regards,
Siva
Set of parameters that provides access to an RDBMS. These parameters include system information such as the data account, user identification, and path to the database. Designer provides three types of connections: secured, shared, and personal.
BO Designer supports three different types of connections, each denoted with a unique symbol in the list of connections:1. Personnal connections that users may create for freehand SQL or that universe...
A connection is a set of parameters that provides access to an RDBMS. These parameters include system information such as the data account, user identification, and the path to the database.
Cheers,
Sithu
A component that controls the type and the amount of data returned by a specific object in a query. A condition created in the designer module is referred to as a predefined condition.
Lets you create a condition. The conditions that you define appear within the Universe window. From the Query Panel, a user can drag the conditions from the Classes and Objects box to the Conditions box.
Cheers
Sithu
A logical grouping of objects and conditions within a universe. In general, the name of a class reflects a business concept that conveys the category or type of objects.
A Category is grouping of certain entities [reporting objects]. A class is a category of reporting objects. A object is an instance of class or a reporting field [entity]. A class can be further divid...
Classes are folders that serve to organize data objects according to common business need. A class can contain any type of object, including another class, (i.e. subclass). The number of classes and s...
A situation in which a query includes two or more tables that are not linked by a join. If executed, this type of query retrieves all possible combinations between each table and may lead to inaccurate results.
Table A having m records and Table B having n records then the cartesian prodcut is mXn.
Regards,
Ravi Kumar Garre
ravi_kumar_garre@yahoo.co.in
Cartesan product is the result of a query when the two tables are not joined by any type of join. All the possible combinations of one table are displayed with the other table just like a cartesan product in algebra
Expresses the minimum and the maximum number of instances of an entity b that can be associated with an instance of an entity a. The minimum and the maximum number of instances can be equal to 0,1, or n.
Cardinality describes a join between 2 tables by stating how many rows of one table will match with rows of another table. It is very important to detect join problems and to create the contexts to co...
Cardinality means "How many instances of an Entity (on one side of the join) are related to the how many instance of another entity (on the other side of the join)"
A logical pointer to an alternate table name. The purpose of an alias is to resolve lOOPS in the paths of joins.
An alias is identical to base table with different name. The data in alias is exactly same as original table/but the different name tricks SQL into accepting that we are using 2 different tables. ...
Actually, What happens in the back end (i.e) in database end, while alias is performed, whether it points to one table or too many table. How the performance will be in universe level.
Please help me by explaining with this concept correctly, because i faced such question by interviewer.
Answered by: Praveen
Answered On : Aug 15th, 2005# Additive: Additive facts are facts that can be summed up through all of the dimensions in the fact table.
# Semi-Additive: Semi-additive facts are facts that can be summed up for some of the dimensions in the fact table, but not the others.
# Non-Additive: Non-additive facts are facts that cannot be summed up for any of the dimensions present in the fact table.
Non additive facts are facts that cannot be summed up for any of the dimensions present in the fact table.
A fact that cannot be logically added between records, such as average, and that must be combined in a computation with other facts before being added across records.
Ans A) is correct
Each Foreign Key must exist as a Primary Key.
Foreign Keys can change values over time.
Cheers.
Sithu
Which two statements are true about a foreign key?
Each foreign key must exist as a primary key.Foreign keys can change values over time.
The utilities are in teradata as follows
Bteq for loadind/Unloading the data
Fload for load the data only one table at a time it must be empty
mload for load the data which is empty or populated table up to 5 tables at a time
fexport to export the data
tpump to load the data upto 64 tables
Both statements are TRUE
What is a common data source for the central enterprise data warehouse?
Operational data stores
ODS (Operational Data sources) or Datamarts
What is an alias and how does it differ from a synonym?
An alias is an alternative to a synonym, designed for a distributed environment to avoid having to use the location qualifier of a table or view. The alias is not dropped when the table is dropped.
Sysnonym can be used for the user who created it. But alias can be used for any users. Synonym is dropped when base table got dropped but alias will not get dropped. Synonym is recorded in the sys.synonym table and alias is recorded in sys.tables.
An alias is a logical pointer to an alternate table name. The purpose of an alias is to resolve loops in the paths of joins. In some cases, more than one alias may be necessary for a given table.
Cheers,
Sithu
What do the initials DDL and dml stand for and what is their meaning?
DDL is data definition language and dml is data manipulation language. DDL statements are create, alter, truncate. Dml statements are select, insert, delete and update.
DDL - Data Definition Language - Used to create/alter/delete Database objects
DML- Data Manipulation Language - Used to insert/delete/update/select the data in the database obects(table/views...)
DDL(data definition language)DDL Statement are · Create· Alter and· &...
How can u work with remote database in Informatica?Did u work directly by using remote connections?
To work with remote datasource u need to connect it with remote connections.But it is not preferable to work with that remote source directly by using remote connections .Instead u bring that source into u r local machine where Informatica server resides.If u work directly with remote source the session...
You can work with remote,
But you have to
Configure FTP
Connection details
IP address
User authentication
Cheers
Sithu
What is power center repository?
The powercenter repository allows you to share metadata across repositories to create a data mart domain. In a data mart domain, you can create a single GLobal repository to store metadata used across an enterprise, and a number of local repositories to share the GLobal metadata as needed.
Standalone repository. A repository that functions individually, unrelated and unconnected to other repositories. Global repository. (PowerCenter only.) The centralized repository in a domain, a group...
What r the types of metadata that stores in repository?
Following r the types of metadata that stores in the repository database connections GLobal objects mappings mapplets multidimensional metadata reusable transformations sessions and batches short cuts source definitions target defintions transformations
Source definitions. Definitions of database objects (tables, views, synonyms) or files that provide source data. Target definitions. Definitions of database objects or files that contain the target da...
Define Informatica repository?
The Informatica repository is a relational database that stores information, or metadata, used by the Informatica server and client tools. Metadata can include information such as mappings describing how to transform source data, sessions indicating when you want the Informatica server to perform the...
It is an RDBMS.
It keeps metadata. (information about tha data)
It keeps only logical informatoin. No physical data inside the repository.
Infromatica Repository:The informatica repository is at the center of the informatica suite. You create a set of metadata tables within the repository database that the informatica application and too...
What is difference between maplet and reusable transformation?
Maplet consists of set of transformations that is reusable.A reusable transformation is a single transformation that can be reusable. If u create a variables or parameters in maplet that can not be used in another maping or maplet.Unlike the variables that r created in a reusable transformation can be...
Mapplet is a group of reusable transformation.The main purpose of using Mapplet is to hide the logic from end user point of view...It works like a function in C language.We can use it N number of times.Its a reusable object.
Reusable transformation is a single transformation.
Maplet: one or more transformations
Reusable transformation: only one transformation
Cheers,
Sithu
Parameter file is to define the values for parameters and variables used in a session.A parameter file is a file created by text editor such as word pad or notepad. U can define the following values in parameter file maping parameters maping variables session parameters
When you start a workflow, you can optionally enter the directory and name of a parameter file. The Informatica Server runs the workflow using the parameters in the file you specify. For UNIX shell us...
Can u generate reports in informatcia?
Yes. By using metadata reporter we can generate reports in Informatica.
Yes.
Yes. It is Possible through Metadata Reporter
What r the different types of type2 dimension maping?
Type2 dimension/version data maping: in this maping the updated dimension in the source will gets inserted in target along with a new version number.And newly added dimension in source will inserted into target with a primary key. Type2 dimension/flag current maping: this maping is also used for slowly...
What r the types of maping in getting started wizard?
Simple pass through maping : loads a static fact or dimension table by inserting all rows. Use this mapping when you want to drop all existing data from your table before loading new data. Slowly growing target : loads a slowly growing fact or dimension table by inserting new rows. Use this mapping to...
1. Simple Pass through2. Slowly Growing TargetCheers,Sithu
What r the types of maping wizards that r to be provided in Informatica?
The designer provides two mapping wizards to help you create mappings quickly and easily. Both wizards are designed to create mappings for loading and maintaining star schemas, a series of dimensions related to a central fact table. Getting started wizard. Creates mappings to load static fact and dimension...
Inf designer :Mapping -> wizards --> 1) Getting started -->Simple pass through mapping &nb...
Slowly Growing Target
Slowly Changing the Dimension
Type1
Most recent values
Type2
Full History
Version
Flag
Date
Current and one previous
The Informatica server follows instructions coded into update strategy transformations with in the session maping determine how to flag records for insert, update, delete or reject. If u do not choose data driven option setting,the Informatica server ignores all update strategy transformations in the...
When Data driven option is selected in session properties it the code will consider the update strategy (DD_UPDATE,DD_INSERT,DD_DELETE,DD_REJECT) used in the mapping and not the options selected in the session properties.
The Informatica Server follows instructions coded into Update Strategy transformations within the session mapping to determine how to flag rows for insert, delete, update, or reject. If the mapping fo...
What is update strategy transformation ?
This transformation is used to maintain the history data or just most recent changes in to target table.
Update strategy transformation is used for flagging the records for insert, update, delete and reject In Informatica power center u can develop update strategy at two levels use update strategy T/...
The model you choose constitutes your update strategy, how to handle changes to existing rows. In PowerCenter and PowerMart, you set your update strategy at two different levels: Within a session. Whe...
What is the default join that source qualifier provides?
Inner equi join.
When you join related tables in one Source Qualifier transformation, the Integration Service joins the tables based on the related keys in each table. This default join is an inner equijoin, using the...
Default join is Normal Join/Inner Join/Equi-Join
What r the tasks that source qualifier performs?
Join data originating from same source data base. Filter records when the Informatica server reads source data. Specify an outer join rather than the default inner join specify sorted records. Select only distinct values from the source. Creating custom query to issue a special select statement for the...
Join data originating from the same source database. You can join two or more tables with primary-foreign key relationships by linking the sources to one Source Qualifier. Filter records when the Info...
What is source qualifier transformation?
When u add a relational or a flat file source definition to a maping,u need to connect it to a source qualifer transformation.The source qualifier transformation represnets the records that the Informatica server reads when it runs a session.
It is a transformation which is automatically generated to read the data from source tables into informatica designer.
Source Qualifier is a transformation with every source definiton if the source is Relational Database.Source Qualifier fires a Select statement on the source db.With every Source Definition...
Why we use stored procedure transformation?
For populating and maintaining data bases.
can you give me a real time scenario please?
we use a stored procedure transformation to execute a stored procedure which in turn might do the above things in a database and more.
What r the types of groups in router transformation?
Input group output group the designer copies property information from the input ports of the input group to create a set of output ports for each output group. Two types of output groups user defined groups default group u can not modify or delete default groups.
A Router transformation has the following types of groups: Input Output Input Group The Designer copies property information from the input ports of the input group to create a set of output ports for...
Input group contains the data which is coming from the source.We can create as many user-defined groups as required for each condition we want to specify.Default group contains all the rows of data that doesn't satisfy the condition of any group.
What is the router transformation?
A router transformation is similar to a filter transformation because both transformations allow you to use a condition to test data. However, a filter transformation tests data for one condition and drops the rows of data that do not meet the condition. A router transformation tests data for one or...
Note:- i think the definition and purpose of Router transformation define by sithusithu, sithu is not clear and not fully correct as they of have mentioned <A Router transformation tests data for o...
A Router transformation is similar to a Filter transformation because both transformations allow you to use a condition to test data. A Filter transformation tests data for one condition and drops the...
What is the rankindex in ranktransformation?
The designer automatically creates a rankindex port for each rank transformation. The Informatica server uses the rank index port to store the ranking position for each record in a group. For example, if you create a rank transformation that ranks the top 5 salespersons for each quarter, the rank index...
Based on which port you want generate Rank is known as rank port, the generated values are known as rank index.
Cheers,
Sithu
Which transformation should we use to normalize the cobol and relational sources?
Normalizer transformation. When u drag the cobol source in to the mapping designer workspace,the normalizer transformation automatically appears,creating input and output ports for every column in the source.
The Normalizer transformation normalizes records from COBOL and relational sources, allowing you to organize the data according to your own needs. A Normalizer transformation can appear anywhere in a ...
What r the types of lookup caches?
Persistent cache: u can save the lookup cache files and reuse them the next time the Informatica server processes a lookup transformation configured to use the cache. Recache from database: if the persistent cache is not synchronized with he lookup table,u can configure the lookup transformation to rebuild...
There are 2 important caches. 1. Static : By default integration service chooses statyic cache.It is read only cache . when you first query the lookup table, it caches the table. whenever a row com...
Caches are of following types:
1. Static Cache
2. Dynamic Cache
3. Persistent Cache
4. Non-persistent Cache
5. Shared Cache
6. Un-shared Cache
More types to follow...:-)
Connected and unconnected
hey, what zaki told was correct.
There are only two type: connected and unconnected.
Mostly we use cache concept for connected, also only connected lkp will support SOME cache options.
hello boss/madam
only two types of lookup are there they are:
1) Connected lookup
2) Unconnected lookup.
I don't understand why people are specifying the cache types, I want to know that now a days caches are also taken into this category of lookup.
If yes do specify on the answer list
thankyou
What is the look up transformation?
Use lookup transformation in u’r mapping to lookup data in a relational table,view,synonym. Informatica server queries the look up table based on the lookup ports in the transformation.It compares the lookup transformation port values to lookup table column values based on the look up condition.
Lookup transformation in a mapping is used to look up data in a flat file or a relational table, view, or synonym. You can import a lookup definition from any flat file or relational database to which...
Use a Lookup transformation in your mapping to look up data in a relational table, view, or synonym. Import a lookup definition from any relational database to which both the Informatica Client and Server can connect. You can use multiple Lookup transformations in a mapping.
Cheers,
Sithu
When a joiner transformation occurs in a session, the Informatica server reads all the records from the master source and builds index and data caches based on the master rows. After building the caches, the joiner transformation reads records from the detail source and perform joins.
For version 7.x and above :When the PowerCenter Server processes a Joiner transformation, it reads rows from both sources concurrently and builds the index and data cache based on the master rows. The...
Specifies the directory used to cache master records and the index to these records. By default, the cached files are created in a directory specified by the server variable $PMCacheDir. If you overri...
What r the join types in joiner transformation?
Normal (default) master outer detail outer full outer
There are 4 Types of Joiner Trasnformations: 1) Normal 2) Master Outer 3) Detail Outer 4) Full Outer Note: A normal or master outer join performs faster than a full outer or detail o...
follw thisIn the Mapping Designer, choose Transformation-Create. Select the Joiner transformation. Enter a name, click OK. The naming convention for Joiner transformations is JNR_TransformationName. E...
What r the settiings that u use to cofigure the joiner transformation?
Master and detail source type of join condition of the join
There are number of properties that you use to configure a joiner transformation are: 1) CASE SENSITIVE STRING COMPARISON: To join the string based on the case sensitive basis. 2) WORKING DIRECT...
the Joiner transformation supports the following join types, which you set in the Properties tab:
Cheers,
Sithu
What r the diffrence between joiner transformation and source qualifier transformation?
U can join hetrogenious data sources in joiner transformation which we can not achieve in source qualifier transformation. U need matching keys to join two relational sources in source qualifier transformation.Where as u doesn’t need matching keys to join two sources. Two relational sources should come...
source qualifier is used for reading the data from the database where as joiner transformation is used for joining two data tables.source qualifier can also be used to join two tables but the conditio...
hi, as per my knowledge, you need matching keys to join two relational sources both in Source qualifier as well as in Joiner transformation. But the difference is that in Source qualifier, both the keys must have primary key - foreign key relation, Whereas in Joiner transformation its not needed.
What is aggregate cache in aggregator transforamtion?
The aggregator stores data in the aggregate cache until it completes aggregate calculations.When u run a session that uses an aggregator transformation,the Informatica server creates index and data caches in memory to process the transformation.If the Informatica server requires more space,it stores...
when server runs the session with aggregate transformation ,it stores data in memory until it completes the aggregation when u partition a source ,the server creates one memory cache and one disk ca...
Aggregate cache contains data values while aggregate calculations are being performed. Aggregate cache is made up of index cache and data cache. Index cache contains group values and data cache consists of row values.
What r the mapping paramaters and maping variables?
Maping parameter represents a constant value that u can define before running a session.A mapping parameter retains the same value throughout the entire session. When u use the maping parameter ,u declare and use the parameter in a maping or maplet.Then define the value of parameter in a parameter file...
Mapping parameter represents a constant value defined before mapping run.
Mapping reusability can be achieved by using mapping parameters.
Mapping variable represents a value that can be changed during the mapping run.
Mapping variable can be used in incremental loading process.
You can use mapping parameters and variables in the SQL query, user-defined join, and source filter of a Source Qualifier transformation. You can also use the system variable $$$SessStartTime. The Inf...
What r the unsupported repository objects for a mapplet?
Cobol source definition joiner transformations normalizer transformations non reusable sequence generator transformations. Pre or post session stored procedures target defintions power mart 3.5 style look up functions XML source definitions ibm mq source defintions
1)Stored procedure preload or postload is not possible.
2)Dont allow the not reusable sequence generator.
3)xml source qualifier
4)Normalizer
5)Nested mapplets
−Normalizer transformations
−Cobol sources
−XML Source Qualifier transformations
−XML sources
−Target definitions
−Pre- and post- session stored procedures
−Other mapplets
-PowerMart 3.5-style LOOKUP functions
-non reusable sequence generator
What r the methods for creating reusable transforamtions?
Two methods 1.Design it in the transformation developer. 2.Promote a standard transformation from the mapping designer.After u add a transformation to the mapping , u can promote it to the status of reusable transformation. Once u promote a standard transformation to reusable status,u can demote it to...
You can design using 2 methods
Cheers
Sithu
PLEASE THINK TWICE BEFORE YOU POST AN ANSWER.Answer: Two methods 1.Design it in the transformation developer. by default its a reusable transform.2.Promote a standard transformation from the mapping d...
What r the reusable transforamtions?
Reusable transformations can be used in multiple mappings.When u need to incorporate this transformation into maping,u add an instance of it to maping.Later if u change the definition of the transformation ,all instances of it inherit the changes.Since the instance of reusable transforamation is a pointer...
1. A reusable transformation can be used in multiple transformations2.The designer stores each reusable transformation as metada separate from any mappings that use the transformatio...
Hai to all friends out therethe transformation that can be reused is called a reusable transformation.as the property suggests it has to be reused:so for reusing we can do it in two different ways 1) ...
What r the active and passive transforamtions?
An active transforamtion can change the number of rows that pass through it.A passive transformation does not change the number of rows that pass through it.
Thats not correct. By this definition Update Strategy should be a passive transformation but it is not.
So correct answer is that any transformation which can change 1. Number of rows 2. Row type i.e. DD_INSERT, DD_DELETE... 3. Transaction control... All these are active ones.
Passive Transformation: When the number of input records to a transformation is equal to the number of output records from the transformation then we call that transformation as Passive Transformation...
It is a repostitory object that generates,modifies or passes data.
A TransFormation Is a Repository Object.
That Generates,Modifies or Passes Data.
The Designer Provides a Set of Transformations That perform Specific Functions.
For Example An AGGREGATOR Transformation Performs Calculations On Groups Of Data.
Transformation is a repository object of converting a given input to desired output.It can generates,modifies and passes the data.
Maplet is a set of transformations that you build in the maplet designer and u can use in multiple mapings.
Basically mapplet is a subset of the mapping in which we can have the information of the each dimension keys by keeping the different mappings created individually. If we want a series of dimension keys in the final fact table we will use mapping designer.
A mapplet is a reusable object that represents a set of transformations. Mapplet can be designed using mapping designer in informatica power center
Which transformation you need while using the cobol sources as source definitions?
Answered by: sithusithu
View all answers by sithusithu
Member Since Dec-2005 | Answered On : Jan 20th, 2006
Normalizer transformation.
Normalizer transformation which is used to normalize the data.Since COBOL sources are often consists of De-normalized data.
Normalizer transformaiton which is used to normalize the data
sirajdeen
Normalizer transformation.
Normalizer transformation which is used to normalize the data.Since COBOL sources are often consists of De-normalized data.
Why we use lookup transformations?
Lookup transformations can access data from relational tables that are not sources in mapping. With lookup transformation, we can accomplish the following tasks:get a related value-get the employee name from employee table based on the employee idperform calculation.Update slowly changing dimension...
Answered by: prodyot Sarkar
Answered On : Jul 31st, 2007The following reasons for using lookups.....
1)We use Lookup transformations that query the largest amounts of data to improve overall performance. By doing that we can reduce the number of lookups on the same table.
2)If a mapping contains Lookup transformations, we will enable lookup caching
if this option is not enabled .
We will use a persistent cache to improve performance of the lookup whenever
possible.
We will explore the possibility of using concurrent caches to improve session
performance.
We will use the Lookup SQL Override option to add a WHERE clause to the default
SQL statement if it is not defined
We will add ORDER BY clause in lookup SQL statement if there is no order by
defined.
We will use SQL override to suppress the default ORDER BY statement and enter an
override ORDER BY with fewer columns. Indexing the Lookup Table
We can improve performance for the following types of lookups:
For cached lookups, we will index the lookup table using the columns in the
lookup ORDER BY statement.
For Un-cached lookups, we will Index the lookup table using the columns in the
lookup where condition.
3)In some cases we use lookup instead of Joiner as lookup is faster than joiner in some cases when lookup contains the master data only.
4)This lookup helps in terms of performance tuning of the mappings also.
Thats a good question, suppose say you have 40 transformations in your mapping which invole complex agg, functions, so in middle of the mapping there is a requirement to get the data from some x t...
Look up tranformations are used to
Get a related value
Updating slowly changing dimension
Caluculating expressions
It represents all data queried from the source.
Answered by: sprajarajan
View all questions by sprajarajan View all answers by sprajarajan
Member Since Mar-2008 | Answered On : Aug 8th, 2008
Source Qualifier Is the default Transformation.
Through The source Qualifier Transformation Informatica Reads The Data.
We can Filter The Data.
We can sort the Data.
Its also Used to Join Homogeneous Source systems.
We can Join Any number of Sources in Singlae Source Qualifier.
We Can't Join the Flatfiles In sourcequalifier Because Flatfiles Are Heterogeneous When we open the Flatfiles At sourcequalifier At the time All The options are Disabled.
1. Source Qualifier is the most important transformation which convert the source data type in to compatible NATIVE datatype of a mapping.2. Without a SQ a mapping can not be created, after extractio...
Source Qualifier is one of the transformations, which converts any type of data to the relational format. So that, can easily be used for any of the operations on the data.
What is router transformation?
Router transformation allows you to use a condition to test data. It is similar to filter transformation. It allows the testing to be done on one or more conditions.
Router transformation can be said as an extension of filter transformation. Filter allows you to check for a single condition and rejects rows which does not meet the condition.But Router transformati...
Router transformation allows you to use a condition to test data. It is similar to filter transformation. It allows the testing to be done on one or more conditions.
What are various types of aggregation?
Various types of aggregation are sum, avg, count, max, min, first, last, median, percentile, stddev, and variance.
Answered by: sithusithu
View all answers by sithusithu
Member Since Dec-2005 | Answered On : Jan 19th, 2006
Cheers,
Sithu
Sum, Count, Average, Median
What are aggregate transformation?
Aggregator transformation allows you to perform aggregate calculations, such as averages and sums.
Answered by: Praveen vasudev
Answered On : Sep 12th, 2005Aggregator transform is m uch like the Group by clause in traditional SQL.
this particular transform is a connected/active transform which can take the incoming data form the mapping pipeline and group them based on the group by ports specified and can caculated aggregate funtions like ( avg, sum, count, stddev....e.tc) for each of those groups.
From a performanace perspective if your mapping has an AGGREGATOR transform use filters and sorters very early in the pipeline if there is any need for them.
veepee
1. Aggrigator transformation allows to perform aggrigate calculation, such as SUM, MAX, MIN, FIRST, LAST
2. Aggrigator transformation allows to perform aggrigate calculation of group.
To perform Group by calculations we use Aggregator Transformation.It perform calculations similar to Expression Transformation.But difference between both is that Aggregator Transform...
How do you reference a parameter?
In PL/SQL, you can reference and set the values of form parameters using bind variables syntax. Ex. Parameter name = '' or :block.Item = parameter parameter name
When you start a workflow, you can optionally enter the directory and name of a parameter file. The Informatica Server runs the workflow using the parameters in the file you specify. For UNIX shell us...
the set of objects owned by user account is called the schema.
Schema is a set of objects own by a Particular user..
Schema is collection of database object.
A stored procedure ia s named collection of SQL statements and procedural logic that is compiled, verified and stored in a server database. It is typically treated like any other database object. Stored procedures accept input parameters so that a single procedure can be used over the network by multiple...
A stored procedure is a precompiled collection of Transact-SQL statements and optional flow control statements, similar to an executable script. Stored procedures are stored and run within the databas...
Differentiate between truncate and delete
Truncate deletes much faster than deletetruncatedeleteit is a DDL statementit is a dml statementit is a one way trip,cannot rollbackone can rollbackdoesn't have selective features (where clause)hasdoesn't fire database triggersdoesit requires disabling of referential constraints.Does not require
DELETE is a DML command so it works on row level, where TRUNCATE is a DDL command, so it works on object level, TRUNCATE actually deletes the table and recreates it, so whatever people say that it works faster or frees occupied space or high water mark or oil mark.The actual internal work was that.
One main freature is when we have used auto increment clause the would return to next value after delete statement. e.g: if id =1000
then after delete statement when we insert any record the value will be 1001 but if we use truncate statement then auto increment value will come to 0 positon
Outer join--its a join condition used where you can query all the rows of one of the tables in the join condition even though they don’t satisfy the join condition.
OUTER JOIN is the way to collect records in case of lack of true join condition.
outer join is that type of jon which retrives the matach & alson unmatached recores also from the tables
When there are duplicate keys in either or all of the joined datasets the result is a cartesion product. This is termed as Cartesian Join in Abinitio.Eg:If there are two input datasets for a join func...
join two tables without join key , that is crossproduct.
What is bas? What is the function ?
The Business Application Support (BAS) functional area at SLAC provides administrative computing services to the Business Services Division and Human Resources Department. We are responsible for softw...
BroadCast Agent Service is used for scheduling & Monitoring the Documents.
Middle layer of the data base table to Reporting tool
Universe is a Semantic Model or you can Layer which helps to connect users to different databases to design module for business requirement. Universe is a methodology to modify the structure of data...
What are the various modules in business objects product suite?
2 Types
Web Based: CMC,Infoview,Polestar,CCM
Window Based:UNIV,Webrich,BVM,Zcelsius
The various modules in BO are .,
Desktop Intelligence, Designer module, Import wizard which are in client side.
Web Intelligence(Infoview), Central Management Console (CMC), Central Management Server (CMS), Perfomance Management (PM) are in server side.
Name some of the standard business intelligence tools in the market?
BO, Crystalreports, Xcelsius
Micro Startegy
Microsoft OLAP
Qlik view
Informatica
BO
SAS
Crystal Report
Xcelsius
What is olAP, molAP, rolAP, dolAP, holap? Examples?
Answered by: John
Answered On : Mar 27th, 2005Cubes in a data warehouse are stored in three different modes. A relational storage model is called Relational Online Analytical Processing mode or ROLAP, while a Multidimensional Online Analytical processing mode is called MOLAP. When dimensions are stored in a combination of the two modes then it is known as Hybrid Online Analytical Processing mode or HOLAP.
MOLAP
This is the traditional mode in OLAP analysis. In MOLAP data is stored in form of multidimensional cubes and not in relational databases. The advantages of this mode is that it provides excellent query performance and the cubes are built for fast data retrieval. All calculations are pre-generated when the cube is created and can be easily applied while querying data. The disadvantages of this model are that it can handle only a limited amount of data. Since all calculations have been pre-built when the cube was created, the cube cannot be derived from a large volume of data. This deficiency can be bypassed by including only summary level calculations while constructing the cube. This model also requires huge additional investment as cube technology is proprietary and the knowledge base may not exist in the organization.
ROLAP
The underlying data in this model is stored in relational databases. Since the data is stored in relational databases this model gives the appearance of traditional OLAP’s slicing and dicing functionality. The advantages of this model is it can handle a large amount of data and can leverage all the functionalities of the relational database. The disadvantages are that the performance is slow and each ROLAP report is an SQL query with all the limitations of the genre. It is also limited by SQL functionalities. ROLAP vendors have tried to mitigate this problem by building into the tool out-of-the-box complex functions as well as providing the users with an ability to define their own functions.
HOLAP
HOLAP technology tries to combine the strengths of the above two models. For summary type information HOLAP leverages cube technology and for drilling down into details it uses the ROLAP model.
Comparing the use of MOLAP, HOLAP and ROLAP
The type of storage medium impacts on cube processing time, cube storage and cube browsing speed. Some of the factors that affect MOLAP storage are:
Cube browsing is the fastest when using MOLAP. This is so even in cases where no aggregations have been done. The data is stored in a compressed multidimensional format and can be accessed quickly than in the relational database. Browsing is very slow in ROLAP about the same in HOLAP. Processing time is slower in ROLAP, especially at higher levels of aggregation.
MOLAP storage takes up more space than HOLAP as data is copied and at very low levels of aggregation it takes up more room than ROLAP. ROLAP takes almost no storage space as data is not duplicated. However ROALP aggregations take up more space than MOLAP or HOLAP aggregations.
All data is stored in the cube in MOLAP and data can be viewed even when the original data source is not available. In ROLAP data cannot be viewed unless connected to the data source.
MOLAP can handle very limited data only as all data is stored in the cube.
OLAP(ONLINE ANALYTICAL PROCESSING):It is a set of specifications which allows the client applications in retriving the data from the DataWareHouse for analytical process.There r 4 types of OLAPS...
OLAP – Online analytical Processes MOLAP – Multidimensional Online analytical Processes (Cubes) ROLAP – Relational Online analytical Processes (RDBMS) HOLAP – Combination of MOLAP...
Answer posted by vikas r on 2005-05-22 02:57:09: olap stands for online analytical processing. It is used for anaytical reporting
olap stands for Online Analytical Processing.
Every day we load OLTP to OLAP for Data analysis.
Then generate reports base this reports end-user take decisions.
OLAP is used to extract data from multiple sources. All extraction is done online by making use of ETL
What is business intelligence?
Decision making data
BI is refers to computer-based techniques used in identifying extracting and analyzing business data ,such as sales revenue by products and/or departments, or by associated costs and income and also...
Do we need an etl tool? When do we go for the tools in the market?
Cheers,
Sithu
ETL Tools are meant to extract, transform and load the data into Data Warehouse for decision making. Before the evolution of ETL Tools, the above mentioned ETL process was done manually by using SQL c...
What are parameter files ? Where do we use them?
Parameter files are used for static variable exectuion of a task. this file can b modifies/updated for later change in the parameter. say for ex, xyz="RAJAT" is defined in the parameter file...
You can pass a parameter value to a task using a file, Mostly it will be used in dynamic value assignment
Sithu
What is a mapping, session, worklet, workflow, mapplet?
SIMPLY to say::::::::::::::::::
worklet : set of sessions
mapplet : set of transformations that can be called within a mapping
mapping :-set of transformations.And moving data from source to target along with transformation s
session :-set of instructions source addresses and target address u write it
worklet:- set of sessions
How do we call shell scripts from Informatica?
Answer posted by staline on 2005-05-27 00:42:44: you can use a command task to call the shell scripts, in the following ways: 1. Standalone command task. You can use a command task anywhere in the workflow or worklet to run shell commands. 2. Pre- and post-session shell command. You can call...
You can use a Command task to call the shell scripts, in the following ways: 1. Standalone Command task. You can use a Command task anywhere in the workflow or worklet to run shell commands. 2. Pr...
Hi,There are two ways to do this,they are as follows:1)we can use a command task anywhere in the workflow or worklet to run the shell commands.2)In the session task,we can call reusable command ...
What are active transformation / passive transformations?
Answered by kiran on 2005-05-08 09:32:14: an active transformation can change the number of rows as output after a transformation, while a passive transformation does not change the number of rows and passes through the same number of rows that was given to it as input.
Conditional statements are also given in the Expression transformation then why it is passive transformation....
Active transformation is nothing but it will change the no of rows passing through it (example) Filter transformation .In it when we have 1000 rows as input after filter transformation according...
What is the difference between power center & power mart?
Answered by kiran on 2005-05-08 09:27:46: power mart is designed for: low range of warehouses only for local repositories mainly desktop environment. power mart is designed for: high-end warehouses global as well as local repositories erp support.
Informatica PowerCenter - Has all options, including distributed metadata, ability to organize repositories into a data mart domain and share metadata accross repositories, Partioning is available. In...
Power Center is designed for:
High-end Warehouses
Global as well as local repositories
Power Mart is designed for:
Low-end Warehouses
Only for local repositories
What is latest version of power center / power mart?
9.5
9.1.0 is the latest version of PowerCenter.
Differences between star and snowflake schemas
Answered by sudhakar on 2005-05-09 18:32:18: star schema uses denormalized dimension tables,but in case of snowflake schema it uses normalized dimensions to avoid redundancy...
star schema uses denormalized dimension tables,but in case of snowflake schema it uses normalized dimensions to avoid redundancy...
Star Schema &n...
Answered by kiran on 2005-05-06 20:12:19: the perception of what constitutes a vldb continues to grow. A one terabyte database would normally be considered to be a vldb.
Very Large Database (VLDB)
it is sometimes used to describe databases occupying magnetic storage in the terabyte range and containing billions of table rows. Typically, these are decision support systems or transaction processing applications serving large numbers of users.
Cheers,
Sithu
VLDB stands for Very Large Data Base, any database too large (normally more than 1TB) cosidered as VLDB.
Non Additive facts cannot be aggregrated along with dimenstion in the fact table. For eg. Age, weather
Non additive facts are the facts that do not participate in arithmetic caliculations. for example in stock fact table there will be opening and closing balances along with qty sold and amt etc. but opening and closing balances were never used in arithmetic calculations.
What type of indexing mechanism do we need to use for a typical datawarehouse
Answered by on 2005-03-23 01:45:54: bitmap index
Hash Index - If we know that the hashing will result in uniform distribution and doesn't create hotspots (spikes-chaining).Bitmap Index - If the degree of cardinality is high for the attribute, me...
Space requirements for indexes in a warehouse are often significantly larger than the space needed to store the data, especially for the fact table and particularly if the indexes are B*trees.Hence, y...
What is real time data-warehousing
The Real time data warehouse is really a combination of real time (Current) and historical data warehouse.In real time world the historical systems are less valuables because of issues facing managers...
Real time Data warehousing means combination of hetrogenious databases and query and analysis purpose and Decisionmaking and reporting purpose.
What are modeling tools available in the market
There is one more data modelling tool available in the market and that is "KALIDO".
This is end to end data warehousing tool. Its a unique and user friendly tool.
Rational Rose
What are the various reporting tools in the market
Answered by hemakumar on 2005-04-12 05:40:50: cognos businessobjects microstrategies actuate
One tool is also missing in the list. i.e. Dundas Chart for .NET OLAP Services
IntelliView
INEA
MS-Excel
Business Objects (Crystal Reports)
Cognos (Impromptu, Power Play)
Microstrategy
MS reporting services
Informatica Power Analyzer
Hyperion (BRIO)
Oracle Express OLAP
Proclarity
SAS
Qlikview
Actuate
jasper reports
What are the vaious etl tools in the market
1. Oracle Warehouse Builder (OWB) 11gR1 Oracle 2. Data Services XI 3.2 SAP Business Objects 3. IBM Information Server (Datastage) 9.1 IBM 4. SAS Data Integration Studio 4.21 SAS Institute 5. Pow...
ODI is the future of all ETL tools..because its E-LT not ETL.
Etl is extraction,transformation and loading,etl technology is used for extraction the information from the source database and loading it to the target database with the necessary transformations done in between.
ETL is a process that extracts data from one database, transforms it as per the destination database and loads it into another database. In the transformation process, data is actually standardized to...
Data Acquisition technique is now called ETL(Extraction, Transformation and Loading)Extraction-The process of extracting the data from various sources. Sources can be file system, database, XML file, ...
Answered by girinath.S.V.S on 2005-03-17 06:40:48: snowflake schemas normalize dimensions to eliminate redundancy. That is, the dimension data has been grouped into multiple tables instead of one large table. For example, a product dimension table in a star schema might be normalized into a products...
Snow-flake schema is a logical arrangement of tables in dimensional modeling where a centralized fact table references number of other dimension tables; however, those dimension tables are further nor...
Closely related to the star schema, the snowflake schema is represented by centralized fact tables which are connected to multiple dimensionsIn the snowflake schema, however, dimensions are normalized...
Dimensional Modelling is a design concept used by many data warehouse desginers to build thier datawarehouse. In this design model all the data is stored in two types of tables - Facts table and Dimen...
Dimension modelling is a logical design technique that seeks to present the data in a standard framework that allows users for high performance accesss.
Answer posted by chintan on 2005-05-22 18:34:55: a relational database schema organized around a central table (fact table) joined to a few smaller tables (dimension tables) using foreign key references. The fact table contains raw numeric items that represent relevant business facts (price, discount...
star schema defined as,A fact table is surrounded my many multiple dimension tables.
(it means all dimension tables directly maintains relationship with the fact tables in this schema)
star schema (also called star-join schema, data cube, or multi-dimensional schema) is the simplest style of data warehouse schema. The star schema consists of one or more fact tables referencing any n...
Answered by puneet on 2005-05-07 04:21:07: er - stands for entitity relationship diagrams. It is the first step in the design of data model which will later lead to a physical database design of possible a oltp or olap database
The ER Diagrams is the initial process to analaysis a database application, and understands the problems related or causes in future with the choosed database application, in this satge different comp...
Entity Relationship Diagrams are a major data modelling tool and will help organize the data in your project into entities and define the relationships There are three basic elements in ER models: Ent...
Data mart is a segment of a data warehouse that can provide data for reporting and analysis on a section, unit, department or operation in the company, e.G. Sales, payroll, production. Data marts are sometimes complete individual data warehouses which are usually smaller than the corporate data warehouse....
Data Marts: Data Mart is a subset of Data Warehousing. Depending upon the management decision the warehouse devides into smaller part according to their respective subject. eg sales is a data mart of xyz warehouse
Datamart is a subset of datwarehouse. Data mart deals with single line of business like Sales, Purchase etc. Size of data will be less when compared to Datawarehouse.
Some of the different types of datamart are Depndant datamart , Independant datamart, Hybrid datamart
Data warehouse is a repository of integrated information, available for queries and analysis. Data and information are extracted from heterogeneous sources as they are generated....This makes it much easier and more efficient to run queries over data that originally came from different sources. Typical...
Hi..!! As explained following above different people have different definitions for a data warehouse. The most popular definition came from Bill Inmon, who provided following: A data warehouse is a su...
Data warehouse is Subject oriented,Integrated Non volatile, Time variant collection of data. unlike transactional systems. Its a Denormalized huge database.Used for Querying and Reporting Analytics.
Hi there
Teradata utilities are :
1)Mloads
2)Fast Export
3)BTEQ
4)TPump
5).fast load
Hi there
Teradata utilities are :
1)Mloads
2)Fast Export
3)BTEQ
4)TPump