Skip to main content
Social Sci LibreTexts

6.6: Clausal embedding

  • Page ID
    192630
    • Catherine Anderson, Bronwyn Bjorkman, Derek Denis, Julianne Doner, Margaret Grant, Nathan Sanders, and Ai Taniguchi
    • eCampusOntario

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Recursion: Sentences inside sentences

    So far we’ve talked about the organization of words into constituents in a single clause. Consider the sentence in (1), which we saw before in 6.4 Identifying phrases: Constituency tests:

    (1) The students saw a movie about dinosaurs.

    This sentence has 3 noun phrases: [the students], [dinosaurs], and [a movie about dinosaurs]. The noun phrase [dinosaurs] is inside the bigger constituent [a movie about dinosaurs], and they’re linked together by a preposition about—in fact, [about dinosaurs] is a prepositional phrase. We also have a verb phrase [saw a movie about dinosaurs]—the verb and its object (or its objects, if it is ditransitive), will always be part of the same verb phrase constituent.

    Now consider the sentences in (2) and (3):

    (2) Deniz said something.
    (3) Samnang might leave.

    In (2), the object of the verb said is something; together these form a verb phrase. But now consider a sentence like (4):

    (4) Deniz said that [ Samnang might leave ].

    In (4), the entire clause from (3) appears after the verb said, in the same position that something appeared in (2). Also, if we do constituency tests—for example replacement in (5)—we can see that [said that Samnang might leave] is a verb phrase that can be replaced by do (too).

    (5) Keiko said that Samnang might leave, and Deniz did too.

    What we see here is that the complement of a verb can be a whole clause; in this case we call the clause-inside-a-clause an embedded clause.

    What about the word that? The role of that seems to be to introduce the embedded clause. Words that have this function of introducing an embedded clause belong to the category complementizers (called that because they turn clauses into the complements of verbs).

    Like other categories, complementizers create complementizer phrases (CPs). Here the complementizer phrase is [ that Samnang might leave ]; again, you can identify this constituent with tests.

    (6) It was that Samnang might leave that Deniz said __.

    Just as verbs select how many complements they take, they can also select the category of their complement. Some transitive verbs can combine only with noun phrase objects, some only with prepositional phrases, some only with complementizer phrases—and some with any or all of these.

    For example, the verb know can combine with several different categories of complements:

    (7) They know…
    …this fact. (noun phrase)
    …about birds. (prepositional phrase)
    …that birds can fly. (complementizer phrase)

    Other verbs can only take some of these as complements:

    (8) We ate…
    …curry. (noun phrase)
    *…about curry. (*prepositional phrase)
    *…that curry is for dinner. (*complementizer phrase)
    (9) The teacher said…
    …something. (noun phrase)
    *…about chocolate. (*prepositional phrase)
    …that they like chocolate. (complementizer phrase)
    (10) They talked…
    …mythology. (noun phrase)
    …about mythology. (prepositional phrase)
    *…that mythology is interesting. (*complementizer phrase)

    (Some people might not find They talked mythology. totally grammatical; whether talk can take a noun phrase object is something that has changed over time in English.)

    So far the examples of embedded clauses that we’ve seen are all embedded statements. Is that the only kind of embedded clause that exists in English, or in language in general? Are there any complementizers other than that?

    Take a moment to see if you can think of some other verbs that embed whole clauses, and see if you can identify some element in those sentences that looks like another complementizer. You can do this for English, or for another language that you know.

    Questions inside sentences: Embedded interrogative clauses

    We just saw that the English verb know can combine with several different types of complements (complementizer phrases, noun phrases, and prepositional phrases). It also happens to be able to combine with more than one type of embedded clause. Consider the following examples:

    (11) I know…
    …(that) ghosts exist.
    whether ghosts exist.
    if ghosts exist.

    What we see here is that the verb know can combine not only with clauses introduced by that (or nothing), but also ones introduced by whether or if. Another way to write this would be to use { curly braces } to surround the complementizers allowed after know, as in:

    (12) I know [CP {that, ∅, whether, if} ghosts exist ].

    Not all verbs are equally flexible! Some verbs, like believe, only allow that or ∅, not whether or if:

    (13) a. I believe [CP {that, ∅} ghosts exist ].̱
    b. *I believe [CP {whether, if} ghosts exist ].̱

    Other verbs only allow whether or if as complementizers, like wonder:

    (14) a. *I wonder [CP {that, ∅} ghosts exist ].̱
    b. I wonder [CP {whether, if} ghosts exist ].

    What this tells us is that the difference between that or ∅ on the one hand, and whether or if on the other hand, is something that verbs can be sensitive to when it comes to selection.

    What, then, is the difference between these two sets of complementizers?

    We can see the difference if we look at their use with verbs of quotation, comparing embedded clauses with direct quotation (indicated in English writing by using quotation marks).

    Consider the verb say in (15):

    (15) They said that ghosts exist. = They said: “Ghosts exist.

    The embedded clause introduced by that can directly paraphrase a directly quoted statement.

    Now compare the verb ask in (16):

    (16) They asked if ghosts exist. = They asked: “Do ghosts exist?

    In (16) we see that the embedded clause with if corresponds not to a quoted statement, but to a quoted question! This is the difference between that and ∅ versus if and whether:

    • that and ∅: introduce embedded statements
      • they are [–Q] (non-question) complementizers
    • if and whether: introduce embedded questions
      • they are [+Q] (question) complementizers

    In English, all verbs that can select a CP headed by whether can also select a CP headed by if, though some speakers prefer one or the other, or prefer one to the other for certain verbs. If you’re a fluent English speaker, ask yourself if you prefer one of these words to the other one.

    The relationship between that and ∅ is slightly more complex. For the most part, any verb that can select a CP headed by that can also select a CP headed by ∅. But there are a few verbs that strongly prefer an overt that. In my own English, one example is the verb report:

    (17) a. The newspaper reported that there was a demonstration yesterday.
    b. *The newspaper reported ∅ there was a demonstration yesterday.

    When a complementizer phrase occurs in a different position in a sentence that is also often obligatory. For example, when a clause is the subject of a sentence, many people find it ungrammatical to leave that out in English.

    (18) a. [That there was a demonstration yesterday] surprised some people.
    b. *[There was a demonstration yesterday] surprised some people.
    c. It surprised some people [that there was a demonstration yesterday].
    d. *It surprised some people [there was a demonstration yesterday].

    The choice of complementizers is something that varies a lot across different varieties of English. If you or other people in your class don’t share the judgements reported here, see if you can figure out generalizations that describe your own grammar better!

    Embedded nonfinite clauses

    Are complementizer phrases all just questions or statements?

    There’s at least one other distinction in types of clauses that verbs can take as complements. Consider, for example, the verb want:

    (19) a. I want [ghosts to exist].
    b. *I want [{that/∅} ghosts exist].
    c. *I want [{whether/if} ghosts exist].

    In my English, and maybe in yours, the verb want doesn’t allow any of the complementizers we’ve seen so far. Instead, it requires that the clause it embeds have a nonfinite verb. Is a clause like [ ghosts to exist ] a complementizer phrase, or should we identify it as something else?

    There is reason to think that at least some nonfinite clauses can occur in complementizer phrases: while many nonfinite embedded clauses don’t have any overt complementizer, at least some do. Consider the examples in (20).

    (20) a. I want [for ghosts to exist].
    b. I prefer [for my coffee to have milk in it].
    c. I’d like [for you to leave now].

    Not everyone likes for in these examples, but at least some English speakers find it grammatical. Here for appears in the same kind of position we previously saw that, if, or whether in. We can analyze for as a complementizer for nonfinite clauses.

    What makes for possible in (20), for many speakers, is the presence of a subject in the embedded clause. In many languages, including English, it’s possible for nonfinite clauses to lack a subject; we won’t look at such clauses more in this chapter, but in some varieties of English the complementizer for can occur without any subject, resulting in what are called for to infinitives, as in I was happy for to leave. You can read more about this construction, and its distribution across varieties of English, at the Yale Grammatical Diversity Project: for to infinitives.

    Some verbs can take either finite or nonfinite complements. Consider the verb prefer:

    (21) a. I prefer [that cookies have chocolate chips].
    b. I prefer [for cookies to have chocolate chips].

    So just like the verb know can select either a question or a statement as its complement, the verb prefer can select either a finite or a non-finite clause.

    “Nonfinite” and “infinitive” are two words for the same thing—notice that in both cases a negative prefix (non- or in-) attaches to the root finite. Nonfinite verbs in English appear with the infinitive marker to. This marker shows up in the same position occupied by auxiliaries—after the subject but before negation—and it’s in complementary distribution with the modal auxiliaries.


    Check your understanding

    Coming soon!


    Navigation

    If you are following the alternative path through this chapter that interleaves core concepts with tree structures, the previous section was 6.17 Trees: Structural ambiguity and the next section is 6.18 Trees: Embedded clauses.


    References

    Kaplan, Aidan, Eliza Scruton & Jim Wood. 2017. For to infinitives. Yale Grammatical Diversity Project: English in North America. (Available online at http://ygdp.yale.edu/phenomena/for-to-infinitives. Accessed on 2022-08-19). Updated by Katie Martin (2018).


    This page titled 6.6: Clausal embedding is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Catherine Anderson, Bronwyn Bjorkman, Derek Denis, Julianne Doner, Margaret Grant, Nathan Sanders, and Ai Taniguchi (eCampusOntario) via source content that was edited to the style and standards of the LibreTexts platform.