Give examples of degenerated dimensions
A source table has a single column, we not transfer this column to dimension table, directly to connected to the fact table ,this fact table called the de-generated dimensional table
Best example is "Invoice Number","bill number","PO Number", these all are degenerated dimensions in the Transaction Tables, they can be maintained in the fact table itself instead of crating seperate dimensions for this.
Do u need seperate space for datawarehouse & data mart
I think, we don't need separate space for Data Warehouse and Data Mart. Data Warehouse is a collection of Data Marts. Data Mart is a particular line of business..
Why so.....Is there any advantage creating the DWH after creating the Datamarts...?
what is the use of Vice versa...?
What is rapidly changing dimension?
A dimension is considered to be a rapidly changing dimension if one or more of its attributes changes frequently in many rows. For a rapidly changing dimension, the dimension table can grow very large from the application of numerous Type 2 changes.
A rapidly changing dimension is a result of poor decisions during the requirements analysis and data modeling stages of the Data Warehousing project. If the data in the dimension table is changing a lot, it is a hint that the design should be revisited.
What are the vaious etl tools in the market
1. Oracle Warehouse Builder (OWB) 11gR1 Oracle 2. Data Services XI 3.2 SAP Business Objects 3. IBM Information Server (Datastage) 9.1 IBM 4. SAS Data Integration Studio 4.21 SAS Institute 5. Pow...
ODI is the future of all ETL tools..because its E-LT not ETL.
What is surrogate key ? Where we use it explain with examples?
Answered by: saravanan
Answered On : Dec 2nd, 2005Surrogate key is a unique identification key, it is like an artificial or alternative key to production key, bz the production key may be alphanumeric or composite key but the surrogate key is always single numeric key. Assume the production key is an alphanumeric field if you create an index for this fields it will occupy more space, so it is not advisable to join/index, bz generally all the datawarehousing fact table are having historical data. These factable are linked with so many dimension table. if it's a numerical fields the performance is high
Surrogate key is a numeric or integer value in a table to identify each row uniquely. Primary key and surrogate key are same but surrogate key is a system generated value has a define incremental valu...
Surrogate key and primary key both are same, but point of application is different. Primary key is part of transactional system but surrogate key is part of analytical system. Surrogate key is used...
cross reference table used to describe many-2-many relation between tables. aggregate fact table - when granularity of fact data not required fact can be grouped over dimensions, measures will be af...
Incremental Loading:Its also called retrospective data load.In this data being loaded on a regular inerval,and only new data is moved.eg in ur data warehouse ur loading "Transactional Data" ...
Explain degenerated dimension in detail.
If a table contains the values, which are neither dimension nor measures is called degenerate dimensions.Ex : invoice id, empno
Degenerated Dimension is achieved through a gradual modeling approach following Dimensional Modeling standards. Let's take example of a Star Schema representing Sales Invoices. The FACT would have...
What is the difference between oltp and olap
Answered by swetha on 2005-03-30 12:00:33: oltp current data short database transactions online update/insert/delete normalization is promoted high volume transactions transaction recovery is necessary olap current and historical data long...
OLTP: 1.insert/update 2.thousands of users 3.size 1gb to 4gb 4.day to day operations 5.transaction of data,detailed data 6.application oriented OLAP: 1.read-query 2.hundreads of people(only d organiz...
OLTP 1) Original source of data2) Normalized 3) Quries are simple ...
What is data cleaning? How is it done?
Could you please give me one real time example How we do data cleansing.....
It is nothing but purifying data , the extract process always exposes the data qulaity issues that have been bured within the operational source system. Since the data qulaity significantly impacts da...
Summarize the differene between oltp,ods and data warehouse ?
OLTP: It stores current data.
ODS: It stores recent data
OLTP: It is real time transaction. so all transaction comes in oltp, which are being perform at current time. ex- if we go to any showroom and buy anything then showroom owner creates our database and...
Describe how you would establish the operational (service) requirements for the data warehouse?
Hello guys....Please help... i have an interview with nhs for a data ware house role and for which they have asked me to prepare a presentation on a topic "describe how you would establish the operational (service) requirements for the data warehouse? I have 10 minutes for the presentation" but I am...
service requirement means establishing the connection between client and server...right? well if it is then i can guide you that how to connect a client machine to server via repository. once the ser...
How many workflows will use to load history data and daily data?
Could you please detail the question? Loading history is nothing to do with number of workflows. Might be you got confused between initial load and incremental load. A single workflow can be used for...
one work flow.....
What is difference between e-r modeling and dimentional modeling.
I think the answer the best suits this question is as below: Entity - Relationship Modelling:- Removes data redundancy. Ensures data consistency. Expresses relationship between the entities....
ER ModelProcess : NormalizationJoin : More ( n-1)Detail DataSize: MB to GBData : Current Datauser's : More than 1000Data: Volatile.Dimensional ModelingProcess: DenormalizationJoin: LessData: Summarize...
What is push down optimization ? What is its use and importance in data warehousing ?
Pushdown Optimization is a new feature introduced from Informatica 8. To increase session performance, the integration service pushes transformation logic to the source or target database. Based on t...
Pushdown Optimization will push the source query with transformation logic
What is the difference between business key and surrogate key ?
Business key is a primary key in the operational system. Surrogate key is a system generated primary key. In Dimensions and Facts we use Surrogate keys as Primary keys. Please refer my answer for the...
Business Key is nothing but a natural key.Surrogate key is a system generator just like sequence in oracle.Basically we come across this in slowly changing dimensions type-II category.
What types of datasources are in Informatica
What are homegeneous sorces and heterogeneous data sources ? Is flat file different from those two
The dictionary meaning of Homogeneous is Uniform and Heterogeneous is Mixed. For example if a mapping is using only Oracle sources or Flat files or DB2 or XML or any other then they are called Homogen...
What is the difference between datawarehouse and datawarehousing
Hey Opbang Ur statement "Datawarehousing is not just having a single data warehouse", with datawarehousing it is understood that business decision can be taken and lots of information and analysis rep...
dataware house is a container to store the historical data
where as dataware hosuning is a process or technique to analyze tha data in the ware house
Which automation tool is used in data warehouse testing?
We have tried QTP from HP and RFT from IBM. Both can launch SQL and you can script around doing the result compares but it is not easy. we now use QuerySurge.
No Tool testing in done in DWH, only manual testing is done.
Answer posted by chintan on 2005-05-22 18:34:55: a relational database schema organized around a central table (fact table) joined to a few smaller tables (dimension tables) using foreign key references. The fact table contains raw numeric items that represent relevant business facts (price, discount...
star schema defined as,A fact table is surrounded my many multiple dimension tables.
(it means all dimension tables directly maintains relationship with the fact tables in this schema)
star schema (also called star-join schema, data cube, or multi-dimensional schema) is the simplest style of data warehouse schema. The star schema consists of one or more fact tables referencing any n...
Foreign keys of facts tables are primary keys of Dimension tables. It is clear that fact table contains columns which are primary key to other table that itself make normal form table.
The Fact table is central table in Star schema, Fact table is kept Normalized because its very bigger and so we should avoid redundant data in it. Thats why we make different dimensions there by makin...