White Paper:

Survey of Analysis Methods Part II

by Rajan Sambandam, TRC
 

This is Part II of a series looking at aspects of practical marketing research: identifying key drivers and developing segments (see Part I). This content describes specific segmentation methods: cluster analysis, neural networks, self-organizing map (SOM), and mixture models. Included is a discussion on ideas for developing good segments.

 

Segmentation analysis has been a part of marketing research for decades. It continues to be useful in a variety of different situations, even when the primary objective of the study is not segmentation. Since segmentation divides the data into comparatively homogenous groups, marketing efforts such as targeting, positioning, retention and product development can be more efficiently performed. While the value of segmentation analysis is rarely questioned, the methods of developing segments have always given rise to considerable debate.

One of the simplest ways of segmenting the data is basic crosstabulation analysis. Respondents can be divided into, say, age or income groups and their differences studied across a variety of questions. This approach of pre-defining the respondent is often referred to as a priori segmentation.

Use of a priori segments, while attractive, is often not sufficient given the need to obtain complex segments based on multiple variables. Therefore, most of the time segments need to be developed after data have been collected. In this article we will consider various segmentation methods, both traditional and recent, that can be used to address marketing research problems. The three methods we will consider are: cluster analysis, neural networks and mixture models.

It should be noted at the outset that regardless of the method used for analysis, the quality of the segmentation scheme is determined by its usefulness to the manager. Even if the statistics indicate that a particular solution is the best one, if it is not useful to the manager then the segmentation analysis should be seen as a failure. This condition is not as harsh as it seems, because not only are many different solutions possible with a given set of variables, but changing the variable set can lead to more solutions. Further, using different analytical methods can also provide new solutions. Finally, there is also the option of dividing some of the segments obtained into sub-segments, if that would make them more actionable.

Next, we will look at each of the segmentation methods mentioned above and how they work. This will be followed by a discussion on ideas for developing good segments.

 

Cluster analysis

Cluster analysis is the traditional method used for segmentation in marketing research. This is actually a family of methods that subsumes many variations and can be broadly classified under two distinct groups: hierarchical and non-hierarchical (or partitioning) methods.

Hierarchical clustering includes methods where the basic idea is to start with each observation as one cluster. Each observation is located on an n-dimensional space where n is the number of attributes used in the analysis. The distances between observations are measured using some form of distance metric such as Euclidean distance. Based on these distances, observations that are closest to one another are joined together to form a new cluster. This process continues until all observations have been merged into a single cluster. The optimal number of clusters can be determined by looking at standard measures of fit (statistics such as the cubic clustering criterion, pseudo-f and pseudo-t2) provided for each cluster solution.

Conversely, it is possible to start with all observations together as one cluster and work backwards until each observation becomes a cluster by itself. With both variants of the hierarchical method, the analyst will have to study the results of the analysis to determine the appropriate number of clusters.

In the non-hierarchical methods (such as k-means clustering), random observations are chosen as seeds (or cluster centers) for a pre-specified number of clusters. Thus, the initial ordering of the data can dictate the formation of clusters. Observations that are closest to a particular seed are assigned to that seed, thus giving rise to clusters. The analyst then obtains the fit statistics for a variety of solutions in order to determine the optimal number of clusters.

Choosing the appropriate number of clusters is never easy even with data sets that are reasonably well behaved. In commonly used methods like k-means clustering, the analyst needs to specify the number of clusters desired. This can be problematic, because the algorithm will assign observations to clusters regardless of whether there are bona fide segments in the data. The fit statistics that indicate the optimal number of clusters are often unclear. Sometimes the optimal number of clusters may not make operational sense. In such cases actionability should be considered before deciding on the optimal number of clusters. Hence, the process of developing segments from data using cluster analysis has a high interpretive content.

 

Neural networks

Artificial neural networks are a recent addition to the variety of techniques used for data analysis. There are two basic types of neural networks: supervised learning and unsupervised learning networks. Supervised learning networks can be used in place of traditional methods like regression and discriminant analysis and were discussed in the previous article in this series. Unsupervised learning networks are the subject of our discussion here.

Unsupervised learning networks are generally used when there are no clear distinctions between dependent and independent variables in the data and when pattern or structure recognition is required. Since pattern recognition is really what is needed in segmentation analysis, unsupervised neural networks can be used for this purpose. The type of unsupervised learning network most appropriate for the problem of segmentation is the self-organizing map (SOM) developed by Teuvo Kohonen.

 

Self-organizing map

A typical SOM consists of an input layer and a grid-like structure known as the Kohonen layer. The input layer contains the variables that are going to be used in the analysis, while the Kohonen layer is a grid of processing elements. Each of the variables in the input layer is connected to each of the processing elements in the Kohonen layer. These connections have random starting weights attached to them before the start of the analysis.

When the information from the first respondent is presented to the network, the processing elements “compete” with each other. By mathematically combining the first respondent’s score on each input variable with the weight of each connection, the processing element with the “winning” score can be determined. Winning implies that this particular processing element is the one that most closely resembles the input scores of the respondent. This processing element is called the “winner.” The weights associated with the winner will then be adjusted to more closely resemble the respondent. The network can be thought of as learning the response pattern of the respondent.

Not only are the weights associated with the winning processing element changed, but the weights of the neighboring processing elements are also changed. In other words an area of the grid is learning the response tendencies of the respondent.

When the second respondent’s data are presented to the network the process is repeated. If the second respondent is similar to the first, then a processing element from the same area of the grid wins. Whether it is the same processing element as the last time will depend on whether the second respondent is exactly similar to the first one. If the second respondent is very different, then a processing element in a different part of the network will win.

At the end of this process the grid will show a two-dimensional representation of the data with different segments showing up as different neighborhoods on the map. Because of the iterative process described above, substantial segments cannot be formed around outliers. This is a clear advantage this method enjoys over traditional k-means cluster analysis.

SOMs also have an advantage in that they were initially developed as not just a data-reduction tool, but also as a data visualization tool. This capability allows the SOM to provide a more intuitive understanding of the relationship between the variables and the segments, hence making the process of developing segments easier. However, some experts feel the reduction of a multidimensional problem to a two-dimensional space for visualization can actually be a disadvantage because of the constraints it may impose on the segmenting process. A further disadvantage in the case of large datasets is the amount of time required to run the analysis as compared to k-means cluster analysis.

 

Mixture models

This is another broad category of segmentation methods. The basic idea linking methods in this category is that the data contain many distributions or segments which are mixed together. The task of the analysis then becomes one of unmixing the distributions and for this reason they are also called unmixing models.

One of the major differences between the cluster methods described previously and mixture models is the prior specification of the number of segments in the data. In non-hierarchical cluster analysis we have to explicitly specify the number of clusters in the data. In hierarchical cluster analysis the results are presented for every possible cluster solution (with the limit being each observation treated as a cluster), thus effectively making the analyst choose the optimal number of clusters. In mixture models, the assumption of underlying distributions allows the use of optimization approaches that can automatically identify the number of segments (distributions) in the data.

Another variation of the mixture model approach to segmentation is known as latent segmentation analysis. While it belongs to the mixture model family, it has some advantages that might be very useful in a marketing research context. For example, latent segmentation analysis makes it possible to simultaneously conduct a segmentation and key driver analysis, where each segment can have its own unique key driver analysis. Thus if a manager is interested in not just identifying segments but also understanding the key drivers of, say, satisfaction within each segment, this would be an appropriate method to use. This process is more efficient than running a segmentation analysis first, followed by separate key driver runs for each segment.

While mixture models can be very useful in creating segments, they also have some disadvantages. The primary disadvantage is with the large amount of time required to run the analysis, especially when compared to k-means cluster analysis. There are also other disadvantages such as sensitivity to the presence of outliers.

 

More than one method

While different types of approaches to segmentation analysis have been discussed here it is not clear that there is one approach that is the best in every situation. Segmentation analysis often involves trying more than one method to obtain the best result. The main reason for this is that unlike key driver analysis, segmentation analysis is quite unstructured. The final solution depends on the number and nature of variables included in the analysis. Changing even one variable can have a strong impact on the results. Without seeing the results, however, it is hard to identify the variables that can be useful in the analysis. This type of circular problem implies that the most important step in a segmentation analysis is the choice of variables to use. The more thought we put into selecting the variables, the more likely it is that the results will useful.

There are a few other steps that can be taken (with any of the methods described here) to increase the chances of developing good segments. These are:

  • eliminating outliers;
  • using as few input variables as possible; and
  • using input variables with low correlation between them.

 

Eliminating outliers not only ensures that segments don’t center on them, they also result in tighter, better-defined segments. Using as few input variables as possible is hard to do, but very important for deriving useful and timely solutions. Beyond the fact that irrelevant variables can sabotage the analysis, using too many variables complicates the analysis, leading to solutions that are not useful. One way of reducing the number of input variables is to remove those that are highly correlated with other input variables. Further, since segmentation methods don’t work as well when there is a collinearity problem in the input variable set, it makes sense to eliminate collinearity as much as possible.

 

This content was provided by TRC. Visit their website at www.trchome.com.

 

 

Other content shared by TRC



White Paper
Better Questions For Segmentation: Use of MAX-DIFF

by Rajan Sambandam, TRC

Better Questions For Segmentation: Use of MAX-DIFFUsing Maximum Difference Scaling as a method in designing surveys may ensure more useful results in your market research. It is a comparative method based on importance that sidesteps the problems associated with traditional importance scales. TRC explains the mechanics behind this method through a detailed example in this white paper. Read Article »

White Paper
Database Scoring with Object Based Segmentation

by Rajan Sambandam, TRC

Database Scoring with Object Based SegmentationSegmentation created from company databases are often lacking the rich segmentation schemes formed by attitudinal surveys. A new approach is Object based segmentation that uses database variables at the basis for forming attitudinal segments, leaving both markets classifiable with clear demographic segments. TRC compares traditional segmentation analysis with Object based. Read Article »

White Paper
Asymmetry in Product Features: Use of the Kano Method

by Rajan Sambandam, TRC

Asymmetry in Product Features: Use of the Kano MethodThe presence or absence of product features strongly affect consumer satisfaction with the design. Comparing these features using asymmetry analysis can help identify satisfiers and dissatisfiers from among the features of a product. The Kano method is similar but results in categorizing each respondent's answers. TRC presents this essential method of deciding new product features in detail. Read Article »

White Paper
Conjoint Analysis versus Self-Explicated Method: A Comparison

by Rajan Sambandam, TRC

Conjoint Analysis versus Self-Explicated Method: A ComparisonDetermining feature importance in a product can be divided into two techniques - top-down methods where a customer evaluates the whole product at once, and bottum-up methods where features are evaluated individually or in sets. The former method, Conjoint Analysis, is more common while the latter method, Self-Explicated Method, is not widely used but has practical advantages. TRC compares the two methods in this white paper. Read Article »

White Paper
Product Configurator

by Rajan Sambandam, TRC

Product ConfiguratorTo help customers purchase the right product, companies often use product configurators - tools that let customers design their purchase before ordering. This method is employed as a market research technique, similar to conjoint analysis but without some of the constrictions. This white paper from TRC explains an appropriate use of the product configurator method. Read Article »

Case Study
Market Segmentation: One Method, Four Examples

by Rajan Sambandam, TRC

Market Segmentation: One Method, Four ExamplesEffective market segmentation requires an understanding of the market and the skilled art of finding the appropriate segments. TRC gives four examples of this method's application with results. Read Article »

White Paper
How to Measure the Value of a Brand

by Rajan Sambandam, TRC

How to Measure the Value of a BrandBrand name evokes an inherent value; finding a way to reliably measure that value is crucial in determining product development. A technique called discrete choice conjoint analysis is described in this paper by TRC. Read Article »

White Paper
Deriving Value from Research: the Use of Conjoint Analysis for Product Development

by Rajan Sambandam, TRC

Marketing research has been used by firms over the last several decades to provide information for decision making. Over time, increasingly sophisticated statistical methods have been developed and deployed in the service of this goal. This article focuses on one such method - conjoint analysis - and its application to product development. Read Article »

White Paper
Cluster Analysis Gets Complicated

by Rajan Sambandam, TRC

Cluster Analysis Gets ComplicatedSegmentation studies using cluster analysis have become commonplace. However, the data may be affected by collinearity, which can have a strong impact and affect the results of the analysis unless addressed. This article investigates what level presents a problem, why it's a problem, and how to get around it. Simulated data allows a clear demonstration of the issue without clouding it with extraneous factors. Read Article »

White Paper
Identifying Feature Importance: A Comparison of Methods

by TRC

Identifying Feature Importance: A Comparison of MethodsUnderstanding what customers want is fundamental to the new product development process as well as to the process of keeping existing products fresh and relevant. To be successful in this area we need to be able to correctly identify what features are important to consumers. Feature importance can be measured using a variety of methods of differing effectiveness. In this paper we will deal with the following methods: Importance Scales, Pick data, Pairwise Comparisons, and Max-Diff. Read Article »

White Paper
Monadic Price Testing vs. Price Laddering

by TRC

Compares two popular pricing methods to understand the difference in take rate information. Read Article »

White Paper
New Product Development: Stages and Methods

by Rajan Sambandam, TRC

New Product Development: Stages and MethodsTRC identifies the best methods for each stage of the product development process, from Idea Generation through Feature Development, Product Development and Product Testing. Read Article »

White Paper
Understand Choice in Banking: Use of Discrete Choice Conjoint Analysis

by TRC

Conjoint analysis provides incentive for survey respondents to determine which features must not be omitted in their final purchase. The method closely mirrors decision-making in the real world, and as shown by TRC in this white paper, is applicable to many situations including how customers choose their bank. Read Article »

White Paper
Want better product ideas? Try smart incentives

by Rajan Samandam, TRC

Want better product ideas? Try smart incentivesIdea generation from survey respondents is strongly dependent on incentive. Introducing competition strengthens the quantity and quality of creative responses. TRC provides examples of smart incentives in this white paper. Read Article »

White Paper
An alternative method of reporting customer satisfaction scores

by Rajan Sambandam and George Hausser of TRC

An alternative method of reporting customer satisfaction scores Though customer satisfaction evaluations are widely used, reporting of these scores has varied from one study to another. This is likely the result of each method’s advantages and disadvantages, as well as the personal preferences and habits of the researcher. In this article we review various reporting methods and outline our method with an example. Read Article »

Service
Identifying the Key Drivers of Brand Image

by TRC

Measuring brand image requires looking at direct effects as well as indirect effects of a company's performance. TRC compares traditional multiple regression with SatiscanTM, a method that can review all possible path models. Read Article »

Case Study
Improving Call Satisfaction: A Case Study

by TRC

Improving Call Satisfaction: A Case StudyTRC presents a case study of analyzing and improving a call center as an on-going data collection process. Read Article »

Case Study
Improving Claim Satisfaction: A Case Study

by TRC

A case study on applying full-service market research to help an insurance company improve their client satisfaction with claim handling. Read Article »

White Paper
Non-Response Bias In Survey Sampling

by TRC

Non-Response Bias In Survey SamplingMarket research accounts for many scenarios to ensure high quality of data. One of the most overlooked problems is non-response bias. TRC describes ways to reduce its effects through survey design and data adjustment in this white paper. Read Article »

White Paper
Segmentation Success

by Michael Sosnowski, TRC

Segmentation SuccessThis paper explains the basic building blocks of the segmentation process and its implementation. Read Article »

White Paper
Survey of Analysis Methods Part I

by Rajan Sambandam, TRC

Survey of Analysis Methods Part IPractical marketing research deals with two major problems: identifying key drivers and developing segments. In this two-part series TRC looks at key driver analysis and segmentation. Read Article »

Service
Validating Satiscan Using A Split Sample Approach

by TRC

TRC's SatiscanTM model is tested for validity using call center data and a split sample approach. This shows that SatiscanTM produces similar models when run on random halves of an energy industry dataset. Read Article »

Service
Satiscan and Regression Analysis: A Comparison

by TRC

The comparison shows the advantages of SatiscanTM, an analytical method from TRC, over regression in identifying the correct and cost efficient action steps. Read Article »

White Paper
TURF: New Methods for Implementation

by Westley Ritz, TRC

TURF: New Methods for ImplementationTURF is a long-established and quite useful marketing research tool, but not everyone is familiar with how it works, or with the latest developments that can make TURF even more effective. The purposes of this paper are twofold: (1) to explain the technique and (2) to describe the latest methods for implementation. Read Article »

White Paper
Product Configuration: A Research Approach for the Times

by Rajan Sambandam & Pankaj Kumar, TRC

The marketplace has shifted in the last decade with the ability of consumers to configure the product they want. This white paper explains the basics of configuration, an approach that mimics the real world of customer driven product design to obtain insight into consumer decision-making. Read Article »

White Paper
Product Configuration: Evidence for Effectiveness

by Rajan Sambandam & Pankaj Kumar, TRC

Product Configuration: Evidence for EffectivenessThis white paper looks at the examples from one product configuration study, the kinds of information that can be derived and the possibilities provided by statistical analysis. Read Article »

Article
New Product Research: A Dynamic Approach to Feature Prioritization

by Pankaj Kumar, Westley Ritz and Rajan Sambandam of TRC

New Product Research: A Dynamic Approach to Feature PrioritizationFeature prioritization is a very common new product research problem. Over the last few years, the most popular technique has been Max-Diff. However, as the number of features increases it becomes difficult to use. Bracket is a tournament-based approach that produces Max-Diff like results and can easily prioritize fifty or more features. Read Article »

Media
Doing More with Less: Getting Greater Value from Mobile Quant

by TRC

Doing More with Less: Getting Greater Value from Mobile QuantWhat “more with less” means with respect to mobile MR, and examples from traditional online studies to challenge existing assumptions about what will and will not work on a mobile device. Read Article »

Media
How to measure the value of a brand?

by TRC

How to measure the value of a brand?Knowing how to price your product that you can optimize your ROI is key. This video explains various ways to measure the value of a brand and talks about a discrete choice conjoint technique as a perfect approach to measuring the value of a brand. Read Article »

Media
Product Configuration with Michael Sosnowski

by TRC

Product Configuration with Michael SosnowskiConsider a person who wants to buy a personal computer. The customer can select exactly the combination desired, subject to a price constraint. Would it be possible to use such a process for research? Read Article »

Media
How to Improve Your Market Segmentation

by TRC

How to Improve Your Market SegmentationBob Hull from TRC talks about a market research technique for market segmentation and ways of improving them. Read Article »

Media
Rich Raquet Market Research Consulting

by TRC

Rich Raquet Market Research ConsultingRich Raquet is introducing TRC, a research & analytics firm, specializing in new product research, conjoint, segmentation, brand equity, sat & loyalty. Read Article »