Hello all, I am working on a project where I am analyzing street addresses to confirm validity. I am interrogating the city, zip, county name and the FIPS code to check spelling and also to confirm that all fields are in sync with each other. For example, does the city match the zip code, does the zip code match the county name and corresponding zip code? The first step is to do a fuzzy lookup and compare the data against a zip code file containing the correct information to be used as the reference information. The next step is a conditional split where we output perfect matches and mismatches (including misspellings). I would like to attempt a second conditional split where I would use the “DIFFERENCE” function to further inspect the misspelled data to see if the data can still be processed. My intent is to compare the city from the input table and compare it to the RefCity from the Fuzzy Lookup reference table . The problem arises when I try to find the DIFFERENCE string function in the Conditional Split Transformation Editor. Unfortunately, this function does not exist In Visual Studio 2008. The DIFFERENCE function works better than SOUNDEX for what I am trying to accomplish. Has anyone tried to do something similar or have an idea how to work around this? The conditional split seems the best route to go for sake of effieciency. Thanks in advance for any suggestions, any help would be appreciated.
↧