What is Match?
Match is a powerful software utility which interrogates database tables in search of duplicate records. The search criteria are defined by the user, from a comprehensive range of advanced match operators which include phonetic, direct word match, telephone/fax number and name initials.
How easy is Match to use?
Match is a Microsoft Windows application, with an intuitive point-and-click interface backed up by a context-sensitive online help facility and comprehensive user documentation.
What are the benefits of using Match?
Match will increase your overall profitability and efficiency by identifying and allowing removal of replicated data from your database. Savings come from reduced postage, stationery and reduced database administration, while at the same time enhancing your company image.
Will Match work with the database I already have?
Match is an ODBC-compliant application which connects to all popular database formats.
Can Match process one database against another?
Match can search a single database table for duplicates or search between two tables regardless of their format or database type. For example it can search an MS Access table against a MS SQL Server table.
Can I run Match without importing data from my database?
There is no import, formatting or preparation of data required if your database type is supported.
Can I use Match in a Client Server environment?
Match has been designed to work with databases irrespective of their environment.
Can Match be set to run outside working hours?
Match has a built-in scheduling facility which allows users to set a predefined time for the de-duplication process to run.
How long does it take to perform a de-duplication session?
Match performance times vary according to the size of database(s) used, as well as the amount and type of search criteria selected. Network speed, processor speed, memory available will also affect your performance, typically between 1 to 1.5 million records an hour.
How many data fields can be used in Match search criteria?
There is no limit to the number of fields which may be selected during the search process.
Does Match provide an export facility?
Yes – it is possible to export selected data fields and/or a list reflecting the Master/Duplicate relationships to a CSV format file.
Can I control the deletion of duplicate records?
Match allows you to review and confirm the deletion of any records identified as duplicates. Data can also be edited from the duplicate record to enhance the master record by cutting and pasting the required information.
What happens to data linked to duplicated records that I have deleted?
Match has a facility for re-assigning ‘orphan’ data to the master when records are deleted: this will work for one-to-many related tables only.
What kind of machine do I need to run Match?
To install and run Match you will need Intel or AMD CPU.
Microsoft Windows 2000 or XP minimum 512mb.
Vista and Windows 7 32bit minimum 1Gb.
Vista and Windows 7 64bit minimum 2Gb.
Can I Standardize and format my data?
Yes, Match has an inbuilt data processor which allows you to carry out numerous data processing functions with case.
Can I use Match for single customer view (SCV) projects?
Yes, Match has been used by many clients to help build a single customer view (SCV).
Can I use Match for Master Data Management (MDM) projects?
Yes, Match has been used to assist with identifying the associations required to deliver Master Data Management (MDM) projects.
How many spoken languages does Match support?
Currently Match supports: English, French, German, Italian and Spanish.
Does Match use phonetic match key generation techniques?
Yes, it generates and uses phonetic match keys in conjunction with other techniques for data normalization.
Does Match use deterministic or probabilistic logic for matching?
It uses both for what we believe delivers realistic matching.
Can you apply weighting to the Match candidate scores?
Yes, Match allows you to apply weightings to every field used in the matching process to modify the overall match score.
Can I write my own match criteria?
Yes, it allows users to write their own match types using VB Script.
Can I integrate Match with other applications?
Match ships with a powerful VB Scripting language, which gives user access to exposed internal functions and the ability to integrate with third-party applications, for example:
- CRM systems for data synchronisation and re-assignment.
- ETL tools for data migration.
- Data manipulation through VB Script exposed.
What does Match do other than matching and de-duplication?
Match has a comprehensive range of data-enhancing capabilities in five spoken languages:
- Case management: upper, lower and correct
- Gender determination
- Data standardisation (or transformation)
- Flagging
- Grouping
- Re-assignment of orphaned data
- Merging of data
Can match work with misaligned data?
Yes, Match has a capability called intra-match which is designed to cater for misaligned data in records.
How many disparate data sources can match work with?
Match currently works with a maximum of two databases or data sources. They can be of different structures and formats however. For matching multiple data sources to a master, we recommend creating and de-duplicating a master database and cross comparing all other databases to the master. This process creates a single record view and will assist in master data management projects.
Do I need to import and export my data?
There is no need for any error prone import and export routines. Match connects to your chosen database and can either (1) Flag duplicate records (2) Group duplicated records or (3) Link by Unique Record ID any matching records (4) Delete matching records.
How can I merge information from an identified duplicate record into a main record?
In order to ensure you create the perfected record Match has a sophisticated data processor which allows you to: Link, Group, Flag, Standardize, Merge and Format your data so you move data from any duplicate to its corresponding Master record.
Are salacious words detected by Match?
Match can be configured to recognize salacious words and either remove them or flag records where they are detected.
Can I create my own data transformations data for lookup and correction?
Yes, Match allows you to create your own libraries of data standardizations, data conversions and data exclusions. This capability extends the capabilities outside of just name and address matching to parts, products or any domain attribute you can recognize.
Can I Suppress records?
Yes, Match may be used for suppression, simply cross match your database to the suppression file of your choice.
Can B2B or B2C data be managed?
Yes, Match is data agnostic, you control the way the matching sessions are defined based upon the type of data. Every column (field) containing data can have different rules defined. As such any data structure can be catered for, regardless of Business Name, Person Name, Company or Personal Address, Telephone, eMail etc.
Why use matching software instead of a bureau service?
We offer a Bureau Service, however, it does not treat the root cause of bad data once as opposed to perpetually treating the effects by sending it to a bureau.
How do I view/manage a sub set of my database?
Match and DedupeExpress can connect directly to database views, access queries, or you can apply SQL filters right inside Match to work with a sub-set of your whole database.
How do I modify my inconsistent data so it becomes consistent?
Match has a data processor which will transform (standardize) data formats so (U.S., United States, America) can be abbreviated, elaborated or excluded as appropriate with ease.
What rules do I make/select to look for duplicates?
our Match Software allows users to define rules at field (column) level. These can be very simple through to complex with the ability to define field level weightings so that matching is tuned for the purpose intended. i.e. Company level, premise level or individual level matching will need different rules and weightings, which we manage this with ease.
How do I modify the rule(s) for exceptions?
Through an intuitive interface within Match. This interface allows for:
- Data standardizations (transformations)
- Data substrings (parts of columns may be used)
- Parsing of columns where multiple words may need breaking up
- Phonetic rules for similar sounding words (Zerox and Xerocks etc.)
- Custom rules if required may be executed in Visual Basic script for highly customized rules
Will it catch and categorize the non identical duplicates (Positive Match, Ambiguous Match and No Match)?
Yes, Match Software allows you to define the matching rules as well as % similarity thresholds ranges:
- Positive – Above the upper % threshold are considered duplicates
- Ambiguous – Between the upper and lower % threshold are considered ambiguous
- No Match – Below the lower % threshold are considered non duplicates
How do I merge duplicates and how do I choose what data to merge?
Whilst we have a solution for this problem every customer has different needs. As such we have created an add on for Match called the One:One data manager. We are extending this to have a rules editor to define which of the colliding pieces of 1:1 data should survive.
How do I merge child (foreign key) relationships?
We have a re-assign orphan’s wizard in Match. Using this on your database will correctly re-assign all foreign key child table data. However, if referential integrity is maintained by the application, as is often the case with CRM, all changes should be posted through the relevant application API’s or Web services.
Can I identify relationships rather than duplicates?
Can it detect and link a subsidiary to a parent? I believe so if the names are similar of the relationships have been defined in the application
Can I merge data to a parent? Yes.
Can I use a third party source, such as Brooks & Dunn or Hoovers, to help me? Yes.
License Error in Match
Please try the below steps to try to overcome the License Error you have received:
- Please try to ‘Run as Administrator’ and then apply the license.
- Please make sure you have full read/write access to the folders and sub folders of ‘C:DQGlobal’ and ‘C:Program FilesDQ GlobalMatch’ (Windows XP) or ‘C:Program Files (x86)DQ GlobalMatch’ (Above Windows XP)
If none of the above solves your issue, then please contact support@dqglobal.com as your license may have expired.
Which databases does DedupeExpress connect to?
DedupeExpress connects to all leading databases.
What is DedupeExpress?
DedupeExpress is a powerful software utility which interrogates database tables in search of duplicate records. The search criteria are defined by the user, from comprehensive range of advanced match operators which include phonetic, direct word match, telephone/fax number and name initials.
Is DedupeExpress easy to use?
Yes – DedupeExpress is a Microsoft Windows application, with an intuitive point-and-click interface backed up by a context-sensitive online help facility and comprehensive user documentation.
What are the benefits of using DedupeExpress?
DedupeExpress will increase your overall profitability and efficiency by identifying and allowing removal of replicated data from your database. Savings come from reduced postage, stationery and reduced database administration, while at the same time enhancing your company image.
Will DedupeExpress work with the database already have?
DedupeExpress is an ODBC-compliant application which connects to all popular database formats as laid out in the enclosed feature list.
What does DedupeExpress do other than matching and de-duplication?
DedupeExpress can also flag records.
Does DedupeExpress use deterministic or probabilistic logic for matching?
It uses both for what we believe delivers realistic matching.
Does DedupeExpress use phonetic match key generation techniques?
Yes, it generates and uses phonetic match keys in conjunction with other techniques for data normalization.
How many spoken languages does DedupeExpress support?
Currently DedupeExpress supports: English, French, German, Italian and Spanish.
Can I use DedupeExpress for Master Data Management (MDM) projects?
Yes, DedupeExpress has been used to assist with identifying the associations required to deliver Master Data Management (MDM) projects.
Can I use DedupeExpress for single customer view (SCV) projects?
Yes, DedupeExpress has been used by many clients to help build a single customer view (SCV).
What kind of machine do I need to run DedupeExpress?
To install and run DedupeExpress you will need Intel or AMD CPU. Microsoft Windows 2000 or XP minimum 512mb. Vista and Windows 7 32bit minimum 1Gb. Vista and Windows 7 64bit minimum 2Gb.
Can DedupeExpress process one database against another?
DedupeExpress can search a single database table for duplicates or search between two tables regardless of their format or database type. For example it can search an MS Access table against a MS SQL Server table.
Can I run DedupeExpress without importing data from my database?
There is no import, formatting or preparation of data required if your database type is supported.
Can I use DedupeExpress in a Client Server environment?
DedupeExpress has been designed to work with databases irrespective of their environment.
Can DedupeExpress be set to run outside working hours?
No unfortunately this capability is not available in DedupeExpress but is available in our Match product.
How long does it take to perform a de-duplication session?
DedupeExpress performance times vary according to the size of database(s) used, as well as the amount and type of search criteria selected. Network speed, processor speed, memory available will also affect your performance, typically between 1 to 1.5 million records an hour.
How many data fields can be used in DedupeExpress search criteria?
There is no limit to the number of fields which may be selected during the search process.
Does DedupeExpress provide an export facility?
Yes – it is possible to export selected data fields and/or a list reflecting the Master/Duplicate relationships to a CSV format file.
Can I control the deletion of duplicate records?
DedupeExpress allows you to review and confirm the deletion of any records identified as duplicates. Data can also be edited from the duplicate record to enhance the master record by cutting and pasting the required information.
What % of eMails don’t actually deliver?
This depends on a number of factors:
- Soft Bounces – out of office replies for example
- Hard bounces don’t deliver because the eMail is incorrectly typed or the target eMail account is no longer active
- The sender can be on a blacklist (spam sender list), greylist,
- Have a low sender score, hence poor sender reputation which will cause some mail servers to reject or quarantine your message
- Finally, if there is not a valid recipient on the target eMail server, catch all accounts are set up to swallow spam and send them into a black hole. When this happens no bounce report will be received and your sent eMail will never be read. Our tests indicate this is true of between 30 and 40% of all lead generation eMail traffic which distorts the sent eMail to received, opened, read and click through reports
What about opt-in?
We do not opt-in any eMails as we are not the sender. However, current law in most countries including the UK is that B2B eMails do not require to be opted in providing you have:
- A legitimate offer which is relevant to the recipient companies line of business
- An opt out provision in the eMail
- Sender contact details
What is eThenticate™?
eThenticate™ is a capability designed for validating email addresses. It silently communicates with eMail servers to check for eMail address acceptance but never physically sends intrusive emails.
What is eThenticate™ used for?
It is used to discover if the eMails you have stored in your databases or applications are receivable, may be receivable or will bounce.
Can it be used in batch mode?
Yes it will process millions of eMails per hour to identify their receivability.
Can it be used in real-time for internal systems or websites?
Yes our customers incorporate eThenticate™ into their web sites, internal applications, and commercial software products through web services.
Does it check for syntax errors?
Yes the first of multiple checks is for incorrectly formed or missing parts of the eMail.
What is eThenticate doing to validate an eMail?
It is firstly checking the eMail is formed correctly (syntax), then checking the Domain is valid (DNS), then the Domain has valid Mail Servers (MX) and finally communicating with the mail server(s) in a special way to confirm or deny receivability.
How does eThenticate™deal with “Catch All” accounts?
We are aware that many mail servers will return a success code no matter what recipient address is specified? Because of this we are able to categorize eMails as: 1 – Yes will be received, 2 – May be received and 3 – Will not be received.
Aren’t you acting like a spammer?
No. We are acting responsibly to validate that eMails can or will be received to avoid sending Bulk eMails to dead, invalid or catch all accounts in order to reduce back scatter and undeliverable internet traffic.
Can you generate or append eMails?
Yes, we are able to generate missing eMails and then test them using eThenticate™ and using our vast experience in matching, we can match your business names and addresses to our extensive BizBase™ to append an eMail where we have one already.
What is Dedupe4Excel?
Dedupe4Excel is an add-in for MS excel™, which identifies and manages duplicate rows and searches across grouped fields as specified by the user to find and remove duplicate data from MS Excel™.
Is Dedupe4Excel easy to use?
Yes, Dedupe4Excel is fast, simple to use and efficient utilising a powerful intra-match engine.
What are the benefits of using Dedupe4Excel?
You can over come the data misalignment by searching grouped fields as specified by the user. Dedupe4Excel uses sophisticated fuzzy and phonetic matching processes identify probable duplicate records.
How many spoken languages does Dedupe4Excel support?
Currently Dedupe4Excel supports: English, French, German, Italian and Spanish.
Does Dedupe4Excel create new worksheet with Duplicates?
Yes, Dedupe4Excel creates 3 new worksheets, Matching records comparisons, duplicates and clean data.
Do you have a trial version of Dedupe4Excel?
Yes, you can download a trial version of Dedupe4Excel from our download page. The trial version is limited to 250 records and expires after 7 days.
What operating systems does Dedupe4Excel support?
This product is no longer supported.
What databases does Dedupe4Excel support?
This product is no longer supported.
What specifications of hardware to you require to run Dedupe4Excel?
The recommended hardware for Dedupe4Excel is CPU 1GHz, RAM 512Mb and Diskspace 1Mb.
What is the DQ Toolkit?
It’s a business software component that ensures the quality and structure of your data is maintained and duplicate entry is a thing of the past. DQ Toolkit allows name and address-related data to be standardised and phonetically processed, so that records may be matched where traditional computer matching techniques would fail.
Why should I use the DQ Toolkit?
To attract, win and retain more business, you must demonstrate you care about your clients. And fundamental to this relationship is accurate, correctly-formatted and duplicate-free data.
What are the benefits of using the DQ Toolkit?
You undertake a prevention rather than cure approach, ensuring that the data quality and structure of your data is correct at source, thus avoiding costly bureau processing.
Benefits include:
- Duplicate entry checking
- Non-exact (fuzzy) record searching and retrieval
- Consistent data quality and format
Where should I use the DQ Toolkit?
The DQ Toolkit should be used within any application that captures or contains names and addresses. These include:
- Sales & marketing database applications
- Call/contact centre applications
- Sales Force Automation (SFA) applications
- Customer Relationship Management (CRM) applications
- Enterprise Relationship Management (ERM) applications
- Support & helpdesk applications
- Data warehouses & data marts
- Web-based name & address or lead capture applications
How many languages are supported phonetically?
We currently support English, French, German, Italian and Spanish phonetic rules.
How many country-related data sets are supported?
We currently support English-speaking data sets for the UK, USA and Australia. Please contact us for latest country-related data sets as we are adding them constantly.
How easy is the DQ Toolkit to integrate into an application?
We have made every effort to make integration as simple as possible, developing our component library with this in mind. We provide a comprehensive context-sensitive help file, which includes extensive code examples in C++, Visual Basic and Delphi development environments.
How is the technology licensed?
The DQ Toolkit is licensed in two parts:
Developer Licence(s)
A developer licence locked to the developer’s PC allows the construction of an application with the DQ Toolkit embedded for deployment.
Client Licence(s)
These are application specific, and are charged based upon the number of users.
Can I use the DQ Toolkit for “fuzzy” searching?
Yes, you can return matching records for retrieval and viewing just as easily as you can for de-duplication. Essentially the DQ Toolkit allows you to match things that sound the same by normalising the data, phonetically processing it and then matching it.
Can I integrate with Postcode Address File (PAF) products?
PAF is quite separate from the DQ Toolkit as it processes and structures addresses from a pre-defined and fixed structure: the PAF file supplied by the Royal Mail. The DQ Toolkit is, however, a perfect companion for any PAF product as we enhance the value of PAF by managing names and addresses, offering up possible matches where operator input error may have occurred.
Please discuss with DQ Global your specific address correction and verification needs as we provide solutions for business and consumer data in over 230 countries.
What is the process for preventing duplicate entry?
The process is carried out in three stages:
Step 1- Using intelligent data transformations, data is normalised and cleansed. For example:
“Rob”, “Bob”, “Bobby” and “Robert” might transform to “Bob”
“Rd” & “Road” might be excluded, and
“Ltd” might be elaborated to become “Limited”.
Step 2 – Using phonetic algorithms, the transformed data from step 1 is phonetically processed so words that sound alike are matched. For example, “Leigh Rd” and “Lee Road” or Xerox and Zerocks are phonetically alike so they could be candidates for deduplication.
Step 3 – Using a match percentage score is derived indicating the likelihood of a suspected match.
What does the DQTransformation function do?
It standardises data to a consistent notation. Data inputs by type (business, name or address) are transformed using a comprehensive set of over 10,000 commonly used abbreviation, elaboration and exclusion rules. You may also modify the case, determine gender, or identify the fields that contain business names, forenames, email addresses and web sites by identifying pattern matches within your data.
Examples are illustrated below:
|
Item |
Abbreviate |
Elaborate |
|
Address related |
Road to Rd |
Avenue to Ave Rd to Road |
|
Business related |
Limited to Ltd |
Company to Co Ltd. to Limited |
|
Country information |
United Kingdom to UK |
NewZealand to NZ UK to United Kingdom |
|
Date related |
January to Jan |
Monday to Mon Mon to Monday, Jan to January |
|
Job titles |
Manager to Mgr |
Colonel to Col Mgr to Manager |
|
Number related |
Twenty to 20 and Nine to 9 |
121 to One Hundred and Twenty One |
|
Qualifications |
Bachelor of Science to BSc. |
Esquire to Esq. BSc. to Bachelor of Science |
|
Salutations |
Doctor to Dr. |
Mister to Mr Dr to Doctor |
|
Geographic names |
Michigan to MI, Hampshire to Hants |
Hants to Hampshire, MI to Michigan |
|
Weights |
Ounces to Oz |
Oz to Ounces |
|
Custom data |
Object to Obj |
Obj to Object |
|
Forenames |
Robert, Bobby, Bob to Rob, |
Bill, Billy, Will, Willy to William |
What does the Fonetix™ function do?
It phonetically transforms any data input based upon language-dependent phonetic rules to create an output match key (called a Phonetic Match Key). This can then be used to match like-sounding data inputs.
What is a Phonetic Match Key?
It phonetically transforms any data input based upon language-dependent phonetic rules to create an output match key (called a Phonetic Match Key). This can then be used to match like-sounding data inputs.
What does SureScore do?
It is the third component of the DQ Toolkit. It provides a flexible matching engine for the comparison of two inputs, deriving a percentage accuracy score of the match.
What is Authentic8?
Authentic8 is an easy to use international address correction, UK suppression and UK data enhancement application which increases the accuracy of name and address data so your direct marketing activities are:
• More effective
• Better targeted
• Avoid mailing waste
• Avoid distress to the bereaved
• Comply with legislation (MPS, TPS, FPS, CTPS)
How many datasets can Authentic8 use?
Authentic8 can use a range of datasets – address correction in 230 countries and extensive UK suppression and enhancement.
Does Authentic8 connect to my database?
Connects to virtually any database format and structure to overcome the challenge of data import and export.
Does Authentic8 have any other built in capabilities?
Optionally, VB scripting extends the functionality of the product and enables complex pre and post processing and re-try logic to be implemented. It provides access to all the capabilities of the DQ Toolkit also for data formatting and manipulation.
What type of databases can Authentic8 work with?
It can work with all the leading database vendors and flat file formats. It attaches to databases using: Native Drivers, ODBC or UDL connections so that data can be compared to the reference address data sets for address correction and address validation.
Is Authentic8 available as an API?
Not quite, Authentic8 is a pre-built application which uses our Address Toolkit API – using our Address Toolkit you can immediately integrate into websites and business applications requiring address capture, address standardisation and address verification.
What is Authentic8?
Authentic8 is an easy to use business application for the correction and validation of international addresses for up to 230 countries, which increases the accuracy of name and address data so your direct marketing is: delivered, more effective, better targeted, less wasteful, and where used with suppression data, avoids distressing the bereaved and complies with legislation MPS, TPS, FPS, CTP.
Can I use Authentic8 with my CRM system or other business applications?
Yes, Authentic8 connects directly to your database to retrieve records and verify them without the need for error prone export or re-import. You can write the data directly back to the database, to new fields in the database or to a new database altogether.
How fast will it process my address data?
This is very dependent on the machine you run Authentic8 on, the quality of the data and several other factors. On average though it will process between 400,000 and 600,000 records per hour.
Is there a limit to the number of records I can process?
No there is no limit to the number of rows which can be processed by Authentic8.
Can I interactively review ambiguous address records?
Yes, where more than one record may be a candidate, Authentic8 allows you to review and correct them in an intuitive review screen.
Do I get result codes reported from the address processing?
Yes, every record looked at has a result code recorded against it so you may analyse the results and be certain of the accuracy of your data.
Is there any re-try logic?
Yes, you may re-try and take the benefit of the inbuilt VB script to recover address records which otherwise might fail first time round.
What are the benefits of pre or post-processing?
You can parse, standardize and format your data as a pre-process, before being sent to the addressing engine to ensure you obtain the best results. Likewise for post processing you can update your corrected data to a chosen target or propagate to multiple systems if running a Master Data Management (MDM) or Single Customer View (SCV) project.
Why do I need to bother correcting my addresses
Because having the wrong address means your message will probably not get delivered, you offend the recipient and you waste money unnecessarily on postage and collateral.
How can I deal with Vanity Addresses?
This is always a difficult area, however, using the VB script or by writing out the corrected data to new fields it is possible to detect vanity data and preserve the personalisation people demand.
Can I standardize addresses without verifying them as postally deliverable?
Not currently in Authentic8, this is a feature we are adding to our DQ 360 product.
Does Authentic8 import and export address records to verify?
There is no need for any error prone import and export routines. Authentic8 connects to your chosen database and can either
(1) output to a new database table, (2) update new columns in the existing table, or (3) overwrite the input data where determined correct.
How much interactive reviewing do you generally need to do with Authentic8?
That depends on the quality of the input data, there are four states, (a) Correct, (b) Corrected, (c) Ambiguous, i.e. there are multiple possibilities of a match for review. Or, (d) Incorrect, the address is not able to be matched nor corrected.
Why use addressing software instead of a bureau service?
We offer a Bureau Service, however, it does not treat the root cause of bad data once as opposed to perpetually treating the effects by sending it to a bureau.
What is the Address Toolkit API?
It is a component which will standardize, format and validate your local or international address data. It will work at point of address capture or for batch address processing to deliver accurate and trusted data.
Where can I use the Address Toolkit?
You can use it in any application which captures postal addresses, including the web, business applications, desktop applications, call centre applications.
Can I get GeoCodes form the Address Toolkit?
Yes, we have extended data sets which allow you to obtain GeoCodes for special analysis. These can be returned as Lat, Long or Easting and Northing as required.
What other data sets are available other than PAF?
There are a variety of data sets available for extending the basic PAF lookup including:
- Grid reference data (100m detail only)
- NHS Health Authority codes
- Local Authority Ward information
- Business data including: SIC, Turnover, Number of Employees, financial status and risk
- Consumer Data including: Lifestyle, GeoDemographic, SocioEconomic, Financial status and risk
- Suppression data sets including: Bereaved and Gone Away
- NCOA
As this data is constantly changing and evolving please contact one of our sales consultants for up to date information.
What is the Postcode Address File (PAF) file?
The Postcode Address File (PAF®) is the most up-to-date and complete address database in the UK, containing over 28 million addresses. PAF® is an invaluable tool for creating and maintaining mailing lists and databases, as well as reducing the number of returned or undelivered items.
Every house and business in the United Kingdom has been given a postal address by Royal Mail. This address is used as a routing instruction by Royal Mail staff to sort and deliver mail quickly and accurately.
The PAF file is the basis of reference for address checking through the Address Toolkit.
What’s the benefit of using the Address Toolkit API?
- Assist during address data capture to reduce the errors during address data capture
- Reduce address input time for your call centre staff
- Eliminate database spelling mistakes and formatting errors
- Improve or remove poor quality address data and validate your customers’ identity
- Create new customer mailing lists
- Allow people to look-up addresses online
- Save time and re-posting costs by correctly addressing mailings
- Capture “verified” customer address details
- Quicken your web checkout process with “address auto-fill” and help avoid abandonment’s
- Use postcode data for customer profiling
- Promote a professional image by getting it right first time
What reference data does it use?
It references trusted postal authority data sources to ensure local or International addresses are captured correctly.
This ensures your data is right first time, hence avoids data scrap and re-work. Means your deliveries or letters, parcels, statements, invoices etc. get delivered which improves customer satisfaction, improves cash flow and make you more profitable.
How many records can I process?
Unless you are on a pay as you click plan, there are few restrictions on the number of records you can process, however, as things change over time please ask one of our sales consultants for up to date information.
Does it standardize addresses?
Yes, as part of the address lookup and retrieval process, addresses are standardized according to the local postal requirements. This includes dealing with diacritics, localized formatting, abbreviations and correcting common spelling mistakes.
How many International countries are supported?
Up to 230 countries are supported to varying degrees of completeness. As this list and completeness varies from time to time, please ask one of our sales consultants.
Can I use the Address Toolkit as a web service?
Yes, it may be called via SOAP, XML over HTML or AJAX.
Can I use multi-occupancy PAF data?
Yes, this available as an extended data set, please ask one of our consultants.
Can I use newly built or not yet built PAF data?
Yes, this available as an extended data set, please ask one of our sales consultants.
What benefits does the international addressing product provide?
As well as validating overseas addresses, The Address Tookit API International allows you to choose the language and format for the country in question, e.g. Cyrillic script. This means that any mailings sent to foreign addresses will be in the appropriate format and language, reducing the likelihood of misspellings or causing unnecessary offence.
There are over 120 different address formats globally, and 30 different language scripts, so keeping global address data correct is no small task. The Address Tookit API International also removes the need to spell all those tricky foreign words!
Can I use the API as pay per click service?
Yes, we can charge for the service on a pay-per-click basis. The advantage to this is that you can access addresses anywhere in the world using one credit pack, costing from just £50 per annum.
Can I combine UK PAF® look-up with The Address Tookit API International?
Yes. We have the ability use the international addressing service for non UK data and then switch over to Royal Mail’s Postcode Address File (PAF®) data whenever a UK postcode is searched.
What technologies scan use to integrate the Address Toolkit API?
The API can be accessed using most development environments and from the web as: SOAP, XML over HTML or AJAX.
My address data is badly formatted, will the Address Toolkit API help me?
Absolutely. The Address Toolkit takes the input you provide and, after making matches, returns the results according to the Royal Mail Postcode Address File (PAF®) or International Postal Authorities reference data. This means you will always get consistently formatted results, even if there wasn’t at point of data input.
How long will my address cleansing take?
The time taken to run any address data cleanse obviously depends on the number of records in the file being cleansed, but it also depends on the quality of the data inputted. You can, however, expect your file to be cleansed at a rate of up to 100 cleanses per second, or 350,000 records an hour.
Do you overwrite my existing addresses with the results of a batch cleanse?
The contact addresses used for batch cleansing will be under your control, most clients simply append the matching addresses returned into columns to the right of the addresses you currently hold. You can then simply choose which parts of your addresses you want to replace with those returned by our Address Toolkit API.
What do the rankings returned by the DQ Toolkit API?
Each address processed has a return code to indicate the degree of match success, this ranges from 100% accurate, corrected with minor mistakes, ambiguous results (multiple matches possible with interactive review) or no match found.
How often is the UK address data updated?
The Postcode Address File (PAF)®/ data is owned and maintained by Royal Mail. Daily updates (around 5000 each day) are made to ensure the information you receive is as accurate as possible.
Why would I want to use this service?
Using the Address Toolkit to validate and capture address details provides you with three main advantages. Firstly, it speeds up the process of entering addresses significantly for your operatives or customers because it cuts keystrokes by around 80%. Secondly, it makes sure that addresses in your database are in a standardised, consistent format, reducing the likelihood of duplicates. Thirdly, your database is populated with accurate addresses, so your deliveries and invoices will arrive at the correct letterbox.
Can I find a postcode from an address, rather than the other way around?
Yes, you may capture full UK addresses from a postcode, or to search for a postcode/full address using part of the address. The service can be used in a website or integrated into any desktop application and is simple to set up and use.
How do I pay for the service?
Licensing for the Royal Mail’s PAF® file depends on whether you’re using it within a website or an internal application. Information to help you find the licence to suit your requirements. Please discuss with a sales consultant so you get the solution you need.
Can I have street level looks only?
Yes, we offer both premise and street-level address look-ups. Premise-level address look-ups will return everything including the house name or number. The “street level” option will not return the house name or number.
How do I standardize and format my addresses and parse to street, city, state, etc?
Using our Address Verification Tools address data may be corrected in up to 230 countries
