Skip to main content
Social Sci LibreTexts

5.6: From constituency to tree diagrams

  • Page ID
    199924
    • Catherine Anderson, Bronwyn Bjorkman, Derek Denis, Julianne Doner, Margaret Grant, Nathan Sanders, and Ai Taniguchi
    • eCampusOntario

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    In this section we begin to introduce the formal notation of tree diagrams. We use tree diagrams to make specific and testable claims (hypotheses) about the structure of phrases and sentences.

    Thinking back to Section 6.1, one way of thinking about the goal of syntactic theory is that it’s aiming to account for what languages users know about which sentences are grammatical, and which sentences are ungrammatical.

    What constituents do we find inside sentences? Well, we know that a sentence consists of (at least) a subject and a predicate, and that subjects are (usually) noun phrases, and predicates are often verb phrases. We might express this as a rule, known as a phrase structure rule.

    Phrase structure rules were central to the theory of syntax developed in Chomsky 1957, which kickstarted the modern field of generative syntax.
    (1) S → NP VP

    This rule says that wherever you have an S, it is possible for that S to be made up of an NP followed by a VP.

    Tree diagram: [S [NP] [VP] ]
    Figure 6.1 Schematic tree for S containing an NP subject and VP predicate

    Another way to represent the same idea is with a tree diagram, as in Figure 6.1. Tree diagrams can express the same information as phrase structure rules, but can more efficiently express the output of multiple such rules; current syntactic theories are typically expressed in terms of constraints on possible trees, rather than in terms of constraints on phrase structure rules.

    What kind of structure might we expect to find inside the NP subject? Here are some NPs—you might think of a sentence in which some or all of them can occur (remember that you can tell if a string in a sentence is a single NP by using a replacement test to try substituting a pronoun).

    (2) a. robots (N)
    b. some robots (D N)
    c. six robots (Num N)
    d. the six robots (D Num N)
    e. the six small robots (D Num Adj N)
    e. robots from Boston (N PP)
    f. the robots from Boston (D N PP)
    g. the six small robots from Boston (D Num Adj N PP)

    We could abstract across all of these, and write a general phrase structure rule for
    NPs, putting parentheses around all optional elements. A subscript “n” indicates that that element can be repeated any number of times.

    (3) NP → (Det) (Num) (Adj)n N (PP)n

    This can be read as:

    An NP can consist of a determiner (optional), followed by a numeral
    (optional), followed by any number of adjectives (all optional),
    followed by a noun (required), followed by any number of prepositional
    phrases (all optional).

    We could represent the structure of some of the specific NPs in (2) as in Figure 6.2.

    Tree diagrams: [NP [N robots]] [NP [D the] [N robots]] [NP [D the] [Num six] [Adj small] [N robots] [PP from Boston] ]
    Figure 6.2 Tree diagrams for [robots], [the robots], and [the six small robots from Boston]

    The phrase structure rule for NPs referred to prepositional phrases (PP). These phrases have fewer possible shapes than NPs:

    (4) a. from Boston (P NP)
    b. outside (P)
    c. just outside (Deg P)
    d. way beyond my knowledge (Deg P NP)

    We can abstract this in a single phrase structure rule:

    (5) PP → (Deg) P (NP)n N (PP)n

    We could then expand the “NP” symbol using our phrase structure rule for NPs above. That NP might contain another PP inside it—here we’ve encountered recursion again. Figure 6.3 shows tree structures for some of the PPs in (4).

    Tree diagrams: [PP [P from] [NP Boston]], [PP [P outside]], [PP [Deg way] [P beyond] [NP [D my] [N knowledge]]]
    Figure 6.3 Tree diagrams for [from Boston], [outside], and [way beyond my knowledge]

    Now let’s look at some verb phrases (VPs). In the following examples, the VPs are all in [ square brackets ].

    (6) a. The crew [repaired the ship].
    b. The captain [gave the crew orders].
    c. The spaceship [arrived].
    d. The crew [travelled across the galaxy].

    How do we know these are VPs? Well, they come after the subject of the sentence (an NP in all these examples), so that means they are predicates. In one case the predicate is a single word arrived—this word is a verb, so the only thing it could be is a verb phrase. All the other sequences in square brackets could be swapped into the same position as arrived, so they must be phrases of the same type. For example:

    (7) The spaceship [gave the crew orders].

    This sentence might be pragmatically odd unless you assume the spaceship is an artificial intelligence, but it is grammatical. Another test would be the replacement test for VPs, which involves replacement with do (you can review the replacement test in 6.4 Identifying phrases: Constituency tests).

    Based on these tests, we know that a verb by itself (like arrived) can be a VP, and that the object is inside the VP with the preceding verb. We have intransitive VPs with just a verb, transitive VPs with one NP, and ditransitive VPs with two NPs.

    Many ditransitive verbs in English can also appear with an NP and a PP, though some ditransitive verbs, like put, only allow the NP PP version). The alternation between a ditranstive [NP NP] as in (7) and [NP PP] as in (8) is called the dative alternation.

    (8) The spaceship [gave orders to the crew].

    The VPs we’ve seen in this section can be derived with the following phrase structure rules:

    (9) a. VP → Vintrans
    b. VP → Vtrans NP
    b. VP → Vditrans NP NP
    b. VP → Vditrans NP PP

    If we looked at a wider range of VPs we’d also find that adverb phrases can go at the beginning or end of VPs, though not typically in the middle. So we’d end up with the following general phrase structure rule for VP:

    (10) VP → (AdvP) V (NP) (NP/PP) (AdvP)n

    By putting together the structures we’ve proposed for NPs, PPs, and VPs, we’re now in a position to show some trees for full sentences. Figure 6.4 shows a tree for the sentence [The crew from Mars repaired the spaceship].

    Tree diagram: [S [NP [D\\the] [N\\crew] [PP [P\\from] [NP [N\\Mars]] ] ] [VP [V\\repaired] [NP [D\\the] [N\\spaceship] ] ] ]
    Figure 6.4 Tree diagram for The crew from Mars repaired the spaceship.

    There are also some cases where a verb can be followed by an adjective phrase (We are happy.; They seem nice.). There isn’t an easy way to collapse this with our previous phrase structure rule; to account for this we could add a second phrase structure rule for VPs (VP → V AdjP).

    We can also formulate phrase structure rules for various modifier phrases: AdvP, AdjP, and NumP.

    Adverb phrases consist of an adverb (11a), preceded by an optional degree phrase (11b). You can also sometimes get a PP after an adverb (11c).

    (11) a. quickly
    b. very quickly
    c. quite quickly for a sloth

    This is summarized by the phrase structure rule in (12):

    (12) AdvP → (DegP) Adv (PP)n

    Adjective phrases are similar:

    (13) The robot is [AdjP very proud of itself].
    (14) AdjP → (DegP) Adj (PP)n

    Number phrases are also modified by degree phrases!

    (15) a. exactly six
    b. approximately 30
    c. very many
    (16) NumP → (DegP) Numn

    The “objects” of adjectives are almost always expressed by PPs—that is, if there’s something in an AdjP that comes after the adjective, it usually can’t be an NP (or a VP), but instead has to be a PP.

    Adjectives with NP complements

    In English there are a very tiny number of exceptions to the generalization that complements of adjectives are PPs, though the exact number is different for different speakers. The one exception that all English speakers have is the adjective worth. So we can say:

    (17) This object is [ worth a lot of money. ]

    Here the adjective is followed by the NP [a lot of money]. The NP has to be something that expresses a value.

    Some English speakers, including most Canadian English speakers and some in upstate New York and Pennsylvania, have another exception with the deverbal adjectives finished and done:

    (18) a. I am [finished my coffee].
    b. The children are [done their homework].

    If you aren’t from one of those places, you might need to use the verbal perfect (have finished my coffee), or use the preposition with (are finished with their homework).

    Some people allow a few more adjectives in this construction, but they’re all deverbal (that is: derived from verbs). For example, people might allow NP complements with the adjectives started and completed.

    Let’s return to phrase structure rules for whole sentences. We’ve already seen that sentences can consist of an NP followed by a VP:

    (19) [The robot] [repaired the spaceship].

    They can also have an auxiliary (modal or non-modal), and can have adverbs at the beginning or end.

    (20) a. [The robot] will [repair the spaceship].
    b. Maybe [the robot] will [repair the spaceship] tomorrow.

    You can also put negation in a sentence—though negation always requires an auxiliary in English (if there isn’t already an auxiliary, we apply Do-support):

    (21) a. [The robot] didn’t [repair the spaceship].
    b. [The robot] hasn’t [repaired the spaceship].

    From these we can get a full phrase structure rule with several optional elements, but an obligatory NP subject and an obligatory VP predicate:

    (22) S → (AdvP) NP (Aux) (Neg) VP (AdvP)

    Phrase structure rules are useful for describing the sequences that can occur in phrases of different types, but neither these rules nor the trees we’ve seen in this section do more than list the elements that can occur in phrases of different types. In the remaining sections of this chapter, we’ll explore a theory that limits possible tree structures to a few basic configurations, with the goal of explaining not only how languages can vary, but also explaining limits on the variation seen across human languages.


    Check your understanding

    Coming soon!


    If you are following the alternative path through this chapter that interleaves core concepts with tree structures, the previous section was 6.5 Functional categories and the next section is 6.14 Trees: Introducing X-bar theory.

    If your instructor has assigned Appendix 1 instead of the second half of this chapter, you should move on to Appendix 1 now.


    References

    Chomsky, Noam. 1957. Syntactic Structures. The Hague: Mouton.


    This page titled 5.6: From constituency to tree diagrams is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Catherine Anderson, Bronwyn Bjorkman, Derek Denis, Julianne Doner, Margaret Grant, Nathan Sanders, and Ai Taniguchi (eCampusOntario) via source content that was edited to the style and standards of the LibreTexts platform.