DQ Global

Data Quality Toolkit
Print
  • What does SureScore do?

    It is the third component of the DQ Toolkit. It provides a flexible matching engine for the comparison of two inputs, deriving a percentage accuracy score of the match.

  • What is a Phonetic Match Key?

    It phonetically transforms any data input based upon language-dependent phonetic rules to create an output match key (called a Phonetic Match Key). This can then be used to match like-sounding data inputs.
  • What does the Fonetix™ function do?

    It phonetically transforms any data input based upon language-dependent phonetic rules to create an output match key (called a Phonetic Match Key). This can then be used to match like-sounding data inputs.
  • What does the DQTransformation function do?

    It standardises data to a consistent notation. Data inputs by type (business, name or address) are transformed using a comprehensive set of over 10,000 commonly used abbreviation, elaboration and exclusion rules. You may also modify the case, determine gender, or identify the fields that contain business names, forenames, email addresses and web sites by identifying pattern matches within your data.

    Examples are illustrated below:

     

    Item 

    Abbreviate 

    Elaborate 

    Address related 

    Road to Rd

    Avenue to Ave Rd to Road

    Business related

    Limited to Ltd

    Company to Co Ltd. to Limited

    Country information

    United Kingdom to UK

    NewZealand to NZ UK to United Kingdom

    Date related

    January to Jan

    Monday to Mon Mon to Monday, Jan to January

    Job titles

    Manager to Mgr

    Colonel to Col Mgr to Manager

    Number related

    Twenty to 20 and Nine to 9

    121 to One Hundred and Twenty One

    Qualifications

    Bachelor of Science to BSc.

    Esquire to Esq. BSc. to Bachelor of Science

    Salutations

    Doctor to Dr.

    Mister to Mr Dr to Doctor

    Geographic names

    Michigan to MI, Hampshire to Hants

    Hants to Hampshire, MI to Michigan

    Weights

    Ounces to Oz

    Oz to Ounces

    Custom data

    Object to Obj

    Obj to Object

    Forenames

    Robert, Bobby, Bob to Rob,

    Bill, Billy, Will, Willy to William

  • What is the process for preventing duplicate entry?

    The process is carried out in three stages:

    Step 1- Using intelligent data transformations, data is normalised and cleansed. For example:

    "Rob", "Bob", "Bobby" and "Robert" might transform to "Bob"

    "Rd" & "Road" might be excluded, and

    "Ltd" might be elaborated to become "Limited". 

    Step 2 - Using phonetic algorithms, the transformed data from step 1 is phonetically processed so words that sound alike are matched. For example, "Leigh Rd" and "Lee Road" or Xerox and Zerocks are phonetically alike so they could be candidates for deduplication.

    Step 3 - Using a match percentage score is derived indicating the likelihood of a suspected match.

  • Can I integrate with Postcode Address File (PAF) products?

    PAF is quite separate from the DQ Toolkit as it processes and structures addresses from a pre-defined and fixed structure: the PAF file supplied by the Royal Mail. The DQ Toolkit  is, however, a perfect companion for any PAF product as we enhance the value of PAF by managing names and addresses, offering up possible matches where operator input error may have occurred.

    Please discuss with DQ Global your specific address correction and verification needs as we provide solutions for business and consumer data in over 230 countries.

  • Can I use the DQ Toolkit for “fuzzy” searching?

    Yes, you can return matching records for retrieval and viewing just as easily as you can for de-duplication. Essentially the DQ Toolkit allows you to match things that sound the same by normalising the data, phonetically processing it and then matching it.

  • How is the technology licensed?

      The DQ Toolkit is licensed in two parts:

      Developer Licence(s)

      A developer licence locked to the developer's PC allows the construction of an application with the DQ Toolkit embedded for deployment.

        Client Licence(s)

        These are application specific, and are charged based upon the number of users.

      • How easy is the DQ Toolkit to integrate into an application?

        We have made every effort to make integration as simple as possible, developing our component library with this in mind. We provide a comprehensive context-sensitive help file, which includes extensive code examples in C++, Visual Basic and Delphi development environments.

      • How many country-related data sets are supported?

        We currently support English-speaking data sets for the UK, USA and Australia. Please contact us for latest country-related data sets as we are adding them constantly.

      • How many languages are supported phonetically?

        We currently support English, French, German, Italian and Spanish phonetic rules.

      • Where should I use the DQ Toolkit?

        The DQ Toolkit should be used within any application that captures or contains names and addresses. These include:

        • Sales & marketing database applications
        • Call/contact centre applications
        • Sales Force Automation (SFA) applications
        • Customer Relationship Management (CRM) applications
        • Enterprise Relationship Management (ERM) applications
        • Support & helpdesk applications
        • Data warehouses & data marts
        • Web-based name & address or lead capture applications
      • What are the benefits of using the DQ Toolkit?

        You undertake a prevention rather than cure approach, ensuring that the data quality and structure of your data is correct at source, thus avoiding costly bureau processing.

        Benefits include:  

        • Duplicate entry checking
        • Non-exact (fuzzy) record searching and retrieval  
        • Consistent data quality and format

      • Why should I use the DQ Toolkit?

        To attract, win and retain more business, you must demonstrate you care about your clients. And fundamental to this relationship is accurate, correctly-formatted and duplicate-free data.

      • What is the DQ Toolkit?

        It’s a business software component that ensures the quality and structure of your data is maintained and duplicate entry is a thing of the past. DQ Toolkit allows name and address-related data to be standardised and phonetically processed, so that records may be matched where traditional computer matching techniques would fail.