Quantcast
Viewing all articles
Browse latest Browse all 24688

LookUp ETL Design

Hi, 

I have a sql query which contains sourceid, consumerfirstname, consumerlastname, emailaddress, addressline1, addressline2, city, state, zipcode 

I need to populate AddressInstance table upon looking up key values in two different tables ( consumer and address ). The addressinstance table does contain foriegnkeys referencing the consumer and address tables. 

I have already populated the consumer table and the address table . 

I would want to know how can I better design my package to populate the addressinstance table .

I tried with the following approach but it is NOT efficient at all. Takes very long time:

1. I have my source query as oledb source in the data flow task . The source query contains ( sourceid, consumerfirstname,  consumerlastname,emailaddress, addressline1, addressline2, city, state, zipcode )

2. I used the Multicast transformation to use 2 look ups on the same query: The first lookup to get the consumerkey from the consumer table based on the consumerfirstname, consumerlastname, sourceid, emailaddress. The second lookup to get the address key from the address table based on the addressline1, addressline2, city, state, zipcode match. 

3. In this way the 2 lookups work in parallel. I am not sure if this is the right approach as it took a massive amount of time for 130000 records. Please suggest a  better approach to design the package so that I can populate the addressintance table ( consumerkey and addresskey - looking up key values in consumer and address table.

Thanks 


EVA05


Viewing all articles
Browse latest Browse all 24688

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>