Article:

Datum et veritas - Having Faith In Your Data

by DataStar, Inc.
 

Simply put, bad information begets bad decisions. Using inaccurate data as an information source puts you at risk for making the wrong decision.

The importance of data verification is fundamental to all business and scientific practices, and has been the subject of many conferences and discussion groups. Yes Virginia, paper surveys are still alive and verification practices are an integral part of assuring high quality data capture.

 

Let's start by defining some terms:

  • To "Verify" - verb: to ascertain the truth or correctness of, as by examination, research, or comparison (Dictionary.com)
  • "Data verification" is a process wherein the data is checked for accuracy and inconsistencies after data migration is done. (Wikipedia)

When potential clients inquire about DataStar's data entry services, we often find ourselves saying "we are all about the quality of the data." What exactly does that mean? How do we get behind the quality of the data that we are contracted to enter?

Clients will often want some reassurances about data quality. Not too long ago we received this email: "I wanted to talk with you about the auditing process for data entry. How do you go back to check that data is entered correctly, etc.?" Our answer: for manual entry of paper surveys, we use a process known as "100% keyverification." Simply translated, this is the act of re-keying or entering a data set a second time, completely, as it was done the first time, but with a tool that "verifies" if the second pass of entry matches the first.

Like many data entry providers, we use professional data entry applications which include verification steps. These tools allow one operator to enter a batch of documents following a screen map previously programmed according to a desired record layout.

When the batch has been completely entered, a second operator (the "verifier") pulls up the first operator's file, which is "masked," or hidden from view. The verifier keys the batch again following the screen map. Every time he/she hits a key that does not match what the first operator entered, the program alerts the operator and stops. The verifier must then decide which response is correct by examining the source document. The change is then made and the verifier continues keying the rest of the document.

Upon completion, a series of statistics can be run detailing the number of characters keyed per hour, the number of corrections made in the verifying step and more. This information becomes part of ongoing quality control efforts.

It is important to point out that 100% key-verified does NOT mean 100% perfect. It is generally considered to yield an average of 99.96% accuracy. Un-verified data entry yields 96% accuracy (when keyed by professional operators).

Not too much difference you say? Let's do the math. If you have 200 surveys with 50 data columns (strokes or characters), this requires a grand total of 10,000 keystrokes to be entered. When the data is 100% verified you might expect to see about four operator errors at 99.96% accuracy. Compare that to un-verified data entry at 96% accuracy, which results in a whopping 400 mistakes!

Buyers beware of data entry providers who claim to do key-verifying or double punching. Sometimes it is not 100%, but rather portions of the data set (i.e. 20%) or just selected fields which are double keyed. Always ask if all of the data will be verified or double keyed.

Keying directly into a program such as Microsoft Excel, unless the operator is extremely careful and efficient, is not going to yield high quality and cost effective results. While it is possible to program in some checks and balances, spreadsheet applications do not have built-in verification features.

Some companies may achieve 100% verification by keying two separate files for the data set and then matching the two files. This process shows every discrepancy. It is then up to somebody to pull and review the source documents in order to determine which answer is correct. This is a very time consuming process, with no improvement in accuracy over keyverification.

Regardless of the method used, a solid verification process is essential to ensure a high quality dataset for further analysis. To paraphrase Martha Stewart, "Good data is a good thing."

 

This content was provided by DataStar, Inc. Visit their website at www.surveystar.com.

 

 

Other content shared by DataStar, Inc.



Article
Get the Most From Your Employee Research

by DataStar, Inc.

Get the Most From Your Employee ResearchSuccessful companies often share a strong commitment to ongoing efforts to measure employee engagement and satisfaction. In this article, we discuss some ways to boost participation rates and ensure the success of your employee survey programs. Read Article »

Article
Quality vs. Quantity - A Look At The Options For Data Capture

by DataStar, Inc.

Quality vs. Quantity - A Look At The Options For Data CaptureWe are often asked about the pros and cons of scanning data as opposed to traditional data entry. If the quantity of survey forms and the schedule demand it, using an advanced data capture system can make sense, but in the majority of situations we prefer to employ manual data capture methods. Read Article »

Article
What Every Researcher Should Know About Statistical Significance

by DataStar, Inc.

What Every Researcher Should Know About Statistical SignificanceSurvey researchers use significance testing as an aid in expressing the reliability of survey results. We use phrases such as "significantly different," "margin of error," and "confidence levels" to help describe and make comparisons when analyzing data. The purpose of this article is to foster a better understanding of the underlying principles behind the statistics. Read Article »

White Paper
Anatomy of a Crosstab

by DataStar, Inc.

Anatomy of a CrosstabAre you new to market research or do you need a refresher course in basic survey tabulation principles? The following guide will help you better understand how to read and interpret the results of your survey project. Read Article »

Article
Survey Tabulation Basics: Statistics

by DataStar, Inc.

Survey Tabulation Basics: StatisticsThis article provides a summary of the basic descriptive statistics typically shown in crosstabs and other survey data analyses. Included are the basic measures with which all researchers should be comfortable. Read Article »

Article
How to Interpret Standard Deviation and Standard Error in Survey Research

by DataStar, Inc.

How to Interpret Standard Deviation and Standard Error in Survey ResearchStandard Deviation and Standard Error are perhaps the two least understood statistics commonly shown in data tables. The following article is intended to explain their meaning and provide additional insight on how they are used in data analysis. Read Article »