13.5: Analyzing qualitative data

Last updated
Save as PDF

Page ID: 25675

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

Learning Objectives

Describe how to transcribe qualitative data
Identify and describe the two types of coding in qualitative research

Analysis of qualitative data typically begins with a set of transcripts of the interviews or focus groups conducted. Obtaining these transcripts requires having either taken exceptionally good notes or, preferably, having recorded the interview or focus group and then transcribed it. Transcribing audio recordings is usually the first step toward analyzing qualitative data. Researchers create a complete, written copy, or transcript, of the recording by playing it back and typing in each word that is spoken, noting who spoke which words. In general, it is best to aim for a verbatim transcript, one that reports word for word exactly what was said in the recording. If possible, it is also best to include nonverbals in a transcript. Gestures made by participants should be noted, as should the tone of voice and notes about when, where, and how spoken words may have been emphasized by participants. Because these are difficult to capture via audio, it is important to have a note-taker in focus groups and to write useful field notes during interviews.

If you have the time (or if you lack the resources to hire others), I think it is best to transcribe your qualitative data yourself. I never cease to be amazed by the things I recall from an interview or focus group when I transcribe it myself. If the researcher who conducted the interview or focus group transcribes it herself, that person will also be able to make a note of nonverbal behaviors and interactions that may be relevant to analysis but that could not be picked up by audio recording. Participants might roll their eyes, wipe tears from their face, and even make obscene gestures. These nonverbals speak volumes about participants’ feelings. Unless you write them down in your field notes or include them in your transcript, those details cannot inform your analysis.

The goal of qualitative data analysis is to reach some inferences, lessons, or conclusions by condensing large amounts of data into relatively smaller, more manageable bits of understandable information. Analysis of qualitative data often works inductively (Glaser & Strauss, 1967; Charmaz, 2006). ^[1] To move from the specific observations a researcher collects to identifying patterns across those observations, qualitative researchers will often begin by reading through transcripts and trying to identify codes. A code is a shorthand representation of some more complex set of issues or ideas. In this usage, the word code is a noun. But it can also be a verb. The process of identifying codes in one’s qualitative data is often referred to as coding. Coding involves identifying themes across qualitative data by reading and rereading (and rereading again) transcripts until the researcher has a clear idea about what themes emerge.

Qualitative researcher and textbook author Kristin Esterberg (2002) ^[2] describes coding as a multistage process. Esterberg suggests that there are two types of coding: open coding and focused coding. To analyze qualitative data, one can begin by open coding transcripts. This means that you read through each transcript, line by line, and make a note of whatever categories or themes jump out to you. At this stage, it is important that you not let your original research question or tentative hypotheses cloud your ability to see categories or themes. It’s called open coding for a reason—keep an open mind. You may have even noted some ideas for coding in your field notes or journal entries.

Open coding will probably require multiple rounds. That is, you will read through all of your transcripts multiple times. As you do, it is likely that you’ll begin to see some commonalities across the categories or themes that you’ve jotted down. Once you have completed a few passes and started noticing commonalities, you might begin focused coding. Focused coding is a multistage process. First, collapse or narrow down themes and categories identified in open coding by reading through the notes you made while conducting open coding. Identify themes or categories that seem to be related, perhaps merging some. Once you come up with a final list of codes, make sure each one has a definition that clearly spells out what the code means. Finally, you recode the dataset using the final list of codes, making sure to apply the definition of the code consistently throughout each transcript.

Defining codes is a way of making meaning of your data and of developing a way to talk about your findings. Researchers must ensure that codes are applied in a uniform way in the entire data set during focused coding. In open coding, new codes and shifts in definitions for codes are common. The researcher should keep an open mind and allow the definitions of codes to emerge from reading (and re-reading) the data. However, once focused coding begins, the definitions should not change for any reason. Any deviation will make the data analysis less trustworthy. If there are pieces of data that do not fit with your definition, then it is important to note those deviant cases in your final report.

Using multiple researchers to code the same dataset can be quite helpful. You may miss something a participant said that another coder catches. Similarly, you may shift your understanding of what a code means and not realize it until another coder asks you about it. If multiple researchers are coding the dataset simultaneously, researchers must come to a consensus about the meaning of each code and ensure that codes are applied consistently by each researcher. We discussed this previously in Chapter 9 as inter-rater reliability. Even if only one person will code the dataset, it is important to work with other researchers. If other researchers have the time, you may be able to have them check your work for trustworthiness and authenticity. We discussed these standards for methodological rigor in Chapter 9. Remember, in qualitative data analysis, the researcher is the measurement instrument, determining what is true, what is connected to what, and what it all means.

As tedious and laborious as it might seem to read through hundreds of pages of transcripts multiple times, sometimes getting started with the coding process is actually the hardest part. If you find yourself struggling to identify themes at the open coding stage, ask yourself some questions about your data. The answers should give you a clue about what sorts of themes or categories you are reading. In their text on analyzing qualitative data, Lofland and Lofland (1995) ^[3] identify a set of questions I find very useful when coding qualitative data. They suggest asking the following:

Of what topic, unit, or aspect is this an instance?
What question about a topic does this item of data suggest?
What sort of answer to a question about a topic does this item of data suggest (i.e., what proposition is suggested)?

Asking yourself these questions about the passages of data that you’re reading can help you begin to identify and name potential themes and categories.

Still feeling uncertain about how this process works? Sometimes it helps to see how qualitative data translate into codes. In Table 13.2, I present two codes that emerged from an inductive analysis of transcripts from interviews with child-free adults. I also include a brief description of each code and a few (of many) interview excerpts from which each code was developed.

Table 13.2 Interview coding example
Code	Code definition	Interview excerpts
Reify gender	Participants reinforce heteronormative ideals in two ways: (a) by calling up stereotypical images of gender and family and (b) by citing their own “failure” to achieve those ideals.	“The woman is more involved with taking care of the child. [As a woman] I’d be the one waking up more often to feed the baby and more involved in the personal care of the child, much more involved. I would have more responsibilities than my partner. I know I would feel that burden more than if I were a man.”
		“I don’t have that maternal instinct.”
		“I look at all my high school friends on Facebook, and I’m the only one who isn’t married and doesn’t have kids. I question myself, like if there’s something wrong with me that I don’t have that.”
		“I feel badly that I’m not providing my parents with grandchildren.”
Resist Gender	Participants resist gender norms in two ways: (a) by pushing back against negative social responses and (b) by redefining family for themselves in a way that challenges normative notions of family.	“Am I less of a woman because I don’t have kids? I don’t think so!”
		“I think if they’re gonna put their thoughts on me, I’m putting it back on them. When they tell me, ‘Oh, Janet, you won’t have lived until you’ve had children. It’s the most fulfilling thing a woman can do!’ then I just name off the 10 fulfilling things I did in the past week that they didn’t get to do because they have kids.”
		“Family is the group of people that you want to be with. That’s it.”

As you might imagine, wading through all these data is quite a process. Just as quantitative researchers rely on the assistance of special computer programs designed to help with sorting through and analyzing their data, so too do qualitative researchers. Where quantitative researchers have SPSS and MicroCase (and many others), qualitative researchers have programs such as NVivo (http://www.qsrinternational.com) and Atlas.ti (http://www.atlasti.com). These are programs specifically designed to assist qualitative researchers with organizing, managing, sorting, and analyzing large amounts of qualitative data. The programs work by allowing researchers to import transcripts contained in an electronic file and then label or code passages, cut and paste passages, search for various words or phrases, and organize complex interrelationships among passages and codes. They even include advanced features that allow researchers to code multimedia files, visualize relationships between a network of codes, and count the number of times a code was applied. Having completed a handwritten coding process as part of a class project with a rather old-school professor, I’m happy I can use qualitative data analysis software to save myself time and hassle.

To summarize, the following excerpt, from my paper analyzing the implementation of self-directed supports for individuals with intellectual and developmental disabilities summarizes how the process of analyzing qualitative data can work:

Transcribed interviews were analyzed using Atlas.ti 7.5 (2014) qualitative data analysis software, a commonly used program in qualitative social science. The researchers approached data analysis from an inductive perspective, allowing themes to emerge from the data. As described by Braun and Clarke (2006), the thematic analysis proceeded along six sequential phases: (a) familiarizing with the data set, (b) generating initial codes, (c) searching for themes, (d) reviewing themes, (e) defining and naming themes, (f) and reporting data. One member of the research team conducted the coding and thematic analysis, consulting with a peer reviewer at the end of each of the three passes of coding and the entire research team after the coding process was complete. The peer reviewer reviewed each phase of coding for consistency, and worked with the primary coder to identify, review, and name themes. At the end of coding, the entire research team reviewed the themes and established a shared meaning that best reflected the narratives of participants, based on a series of dialogues. The themes were organized into a thematic map which was refined through consultation with the research team to ensure homogeneity within each theme and heterogeneity between themes. The analysis contained within this paper used co-occurrence counts as a guideline for the prevalence of themes within the data set. Thus, the analysis is limited to the most prevalent themes that answer each research question, while attending to exceptional or divergent cases. Methodological journaling related to coding and peer review helped to ensure the dependability, confirmability, and trustworthiness of the final research product (DeCarlo, Bogenschutz, Hall-Lande, & Hewitt, in press). ^[4]

Key Takeaways

Open coding involves allowing codes to emerge from the dataset.
Codes must be clearly defined before focused coding can begin, so the researcher applies them in the same way to each unit of data.
NVivo and Atlas.ti are computer programs that qualitative researchers use to help with organizing, sorting, and analyzing their data.

Glossary

Code- a shorthand representation of some more complex set of issues or ideas
Coding- identifying themes across qualitative data by reading transcripts
Focused coding- collapsing or narrowing down codes, defining codes, and recoding each transcript using a final code list
Open coding- reading through each transcript, line by line, and make a note of whatever categories or themes seem to jump out to you
Transcript- a complete, written copy of the recorded interview or focus group containing each word that is spoken on the recording, noting who spoke which words

Image attributions

Compact Cassette by Petr Kvashin CC-0

concept of learning by unknown CC-0

If you would like to learn more about inductive qualitative data analysis, I recommend two titles: Glaser, B. G., & Strauss, A. L. (1967). The discovery of grounded theory: Strategies for qualitative research. Chicago, IL: Aldine; Charmaz, K. (2006). Constructing grounded theory:A practical guide through qualitative analysis. Thousand Oaks, CA: Sage. ↵
Esterberg, K. G. (2002). Qualitative methods in social research. Boston, MA: McGraw-Hill. ↵
Lofland, J., & Lofland, L. H. (1995). Analyzing social settings: A guide to qualitative observation and analysis (3rd ed.) Belmont, CA: Wadsworth. ↵
DeCarlo, M., Bogenschutz, M., Hall-Lane, J., & Hewitt, A. (in press). Implementation of self-directed supports for individuals with intellectual and developmental disabilities in the United States. Journal of disability policy studies.↵

Search

Text Color

Text Size

Margin Size

Font Type