Information: Statistical Services Centre (The University of Reading)

Approaches to the Analysis of Survey Data

Release date: March 2001

This is one of a series of guides for research and support staff involved in natural resources projects. The subject-matter here is approaches to the analysis of survey data. Other guides give information on allied topics. Your comments on any aspect of the guides would be welcomed.

Part 1: Preparing for the Analysis
1.1. Introduction
1.2. Data Types
1.3. Data Structure
1.4. Stages of Analysis
1.5. Population Description as the Major Objective
1.6. Comparison as the Major Objective
1.7. When Weighting Matters
1.8. Coding
1.9. Ranking & Scoring

Part 2: Doing the Analysis
2.1. Approaches
2.2. One-Way Tables
2.3. Cross-Tabulation: Two-Way & Higher-Way Tables
2.4. Tabulation & the Assessment of Accuracy
2.5. Multiple Response Data
2.6. Profiles
2.7. Looking for Respondent Groups
2.8. Indicators
2.9. Validity
2.10. Summary
2.11. Next Steps

Part 1: Preparing for the Analysis

1.1 Introduction

This guide is concerned with some fundamental ideas of analysis of data from surveys. The discussion is at a statistically simple level; other more sophisticated statistical approaches are outlined in our guide Modern Methods of Analysis. Our aim here is to clarify the ideas that successful data analysts usually need to consider to complete a survey analysis task purposefully.

An ill-thought-out analysis process can produce incompatible outputs and many results that never get discussed or used. It can overlook key findings and fail to pull out the subsets of the sample where clear findings are evident. Our brief discussion is intended to assist the research team in working systematically; it is no substitute for clear-sighted and thorough work by researchers. We do not aim to show a totally naïve analyst exactly how to tackle a particular set of survey data. However, we believe that where readers can undertake basic survey analysis, our recommendations will help and encourage them to do so better.

Part 1 outlines a series of themes, after an introductory example. Different data types are distinguished in Section 1.2. Section 1.3 looks at data structures; simple if there is one type of sampling unit involved, and hierarchical with e.g. communities, households and individuals. In Section 1.4 we separate out three stages of survey data handling - exploration, analysis and archiving - which help to define expectations and procedures for different parts of the overall process. We contrast the research objectives of description or estimation (Section 1.5), and of comparison (Section 1.6) and what these imply for analysis. Section 1.7 considers when results should be weighted to represent the population - depending on the extent to which a numerical value is or is not central to the interpretation of survey results. In Section 1.8 we outline the coding of non-numerical responses. The use of ranked data is discussed in brief in Section 1.9.

In Part 2 we look at the ways in which researchers usually analyse survey data. We focus primarily on tabular methods, for reasons explained in Section 2.1. Simple one-way tables are often useful as explained in Section 2.2. Cross-tabulations (Section 2.3) can take many forms and we need to think which are appropriate. Section 2.4 discusses issues about 'accuracy' in relation to two- and multi-way tables. In Section 2.5 we briefly discuss what to do when several responses can be selected in response to one question.

Cross-tabulations can look at many respondents, but only at a small number of questions, and we discuss profiling in Section 2.6, cluster analysis in Section 2.7, and indicators in Sections 2.8 and 2.9.

29 × 5	+ 243 × 4	+ 117 × 3	+ 86 × 2	+ 25 × 1	= 3.33
--------------------------------------------------------
29	+ 243	+ 117	+ 86	+ 25

Christian	Hindu	Muslim	Sikh	Other
29	243	117	86	25

Hindu	Non-Hindu
243	257

Excellent	Good	Moderate	Poor	Very Bad
29	243	117	86	25

Excellent	Good	Moderate	Poor	Very Bad
5	4	3	2	1

Statistical Good Practice Guidelines

Approaches to the Analysis of Survey Data

Contents

1.1 Introduction