Syntax
In Sections 8.3.1 and 8.3.2, the feature structure and tree structure
declarations presented in Section 8.2 are illustrated for the domain of
syntax. Special conventions are proposed for encoding derivations within
the framework of categorial grammar in Section 8.3.3.
Feature Structures
Consider the following feature structure, which is a fragment of
a feature structure describing the sentence She smashed a
brick.
The analysis appeared in N.N., Feature and Function
Grammars,
in Blorts and Bleeps and Other Bloopers
(New York: Yamaha, 1999), p. 330.
The structure has a name, Test Sentence,
and
the features
- Category (
Test Sentence
is a sentence or S)
- Patterns (the test sentence has the patterns
Subject,
Predicator,
and Direct-Object
, defined by the underlying
linguistic theory)
- Tense (the sentence has Past tense)
- Voice (the sentence has Active voice)
- Subject (which describes the grammatical subject of the sentence
in terms of a nested feature structure)
- Actor (which points at the feature value of Subject, indicating
that the underlying Actor of the sentence is the same as the surface
grammatical subject)
and other features not included here. The feature Subject,
in
turn, has as its value a feature structure containing other features
(some of them the same as those on the sentence itself) and another
nested feature structure. (Note: this feature structure is intended to
be illustrative only; a fully specified feature structure for a sentence
would include much more information.)
__ __
| |
| TEST-SENTENCE |
| Category = S |
| Patterns = (Subject Predicator Direct-Object) |
| Tense = Past |
| Voice = Active |
| __ __ |
| Subject = | | |
| | Category = NP | /1/ |
| | Patterns = (Head) | |
| | Head = __ __ | |
| | | | | |
| | | Category = Pronoun | | |
| | | Gender = Feminine | | |
| | | Lex = she | | |
| | | (etc. ...) | | |
| | |__ __| | |
| | | |
| | (etc. ...) | |
| |__ __| |
| |
| Actor = /1/ |
| |
| (etc. ...) |
|__ __|
Alternate format follows. -Ed.
TEST-SENTENCE
Category = S
Patterns = (Subject Predicator Direct-Object)
Tense = Past
Voice = Active
Subj /1/ =
Category = NP
Patterns = (Head)
Head =
Category = Pronoun
Gender = Feminine
Lex = she
Actor = /1/
(etc. ...)
This feature structure is encoded as follows:
TEST-SENTENCE
Category
S
Patterns
Subj
Pred
DO
Tense
Past
Voice
Active
Subj
Category
NP
Patterns
Head
Head
Category
pronoun
Gender
Feminine
Lex
she
(etc. ...)
(etc. ...)
Actor
etc. ...
]]>
Let us consider a further simple example---the representation of the
German pronoun sie
, which translates as either she
, her
(acc.)
, they
or them (acc.)
. A possible feature
structure representing the relevant information is given below.
__ __
| |
| Lex = sie |
| Cat = NP |
| __ __ __ __ |
| | | | | |
| | Num = Sg | | Num = Pl | |
| | Gender = Fem | OR | Gender = Masc OR Fem OR Neut | |
| |__ __| |__ __| |
| |
| Case = Nom OR Acc |
|__ __|
Lex = sie
Cat = NP
Num = Sg
Gender = Fem
OR
Num = Pl
Gender = Masc OR Fem OR Neut
Case = Nom OR Acc
The encoding of this feature structure is as follows.
Lex
sie
Cat
NP
Num
Sg
Gender
Fem
Num
Pl
Gender
Masc
Fem
Neut
Case
Nom
Acc
]]>
Tree Structures
Consider the following tree structure, which partially describes
the structure of the sentence Who did John persuade Bill to
invite?
The analysis appeared in N.N., Name of
Paper,
in Name of Volume
(New York: Yamaha, 1999), p. 14.
&gamma1
and &alpha26
(or: G1 and A26)
are alternative names
for the tree (used to identify it in discussion and in other tree
diagrams)
and the dotted-line arrow connecting WH
and the terminal
node NP
represents coindexing. (The node labeled NP points to
the node labeled WH).
&gamma1 = &alpha26 =
G1 = A26 = S'
/ \
..>WH S
. / \
. NP VP'
. | / \
. PRO TO VP
. / \
. V NP. .
. | | .
. invite e .
. .
. . . . . . . . . .
who PRO to invite
This tree structure is encoded as follows:
G1
A26
S'
WH
S
NP
PRO
VP'
TO
VP
V
invite
NP
e
]]>
Derivations in Categorial Grammar
In categorial grammar, syntactic categories are either simple categories
(atomic symbols), complex categories formed by recursively combining
atomic symbols using operators, or derived categories---that is,
categories that result from using rules of combination. All categories
have an index in order to allow the alignment of the derivation with the
input string. In the interests of notational economy, input and output
categories are not distinguished in the relevant SGML declarations:
]]>
Two sample encodings are given below. The first example is an instance
of type-raising, a rule that yields a derived category from a single
input category.
NP
========>type-raise
S/(S\NP)
The SGML encoding is as follows:
S
S
NP
type-raise
NP
]]>
The second example is the derivation of a simple sentence:
I cooked and ate the beans
________ _________ ____ _________ _________
NP (S\NP)/NP conj (S\NP)/NP NP
____________________________conjoin
(S\NP)/NP
____________________________>apply
S\NP
_____________________________________>apply
S
The encoding, which corresponds to reading the derivation from the
bottom up, is as follows:
S
apply
NP
S
NP
apply
S
NP
NP
conjoin
S
NP
NP
and
S
NP
NP
NP
]]>
Syntactic and Lexical Ambiguity
This section discusses some of the special problems for linguistic
markup posed by lexical and structural ambiguity, giving as an example a
feature-structure representation of the English sentence Wash
sinks. Each word of this sentence can be considered either a noun
or a verb each with different senses; as a sentence it is structurally
ambiguous, being either a declarative or an imperative sentence; in
addition, the word Wash can also be considered a proper noun.
For purposes of this illustration, we assume that this sentence has four
interpretations: two as a declarative sentence, and two as an imperative
sentence. The four interpretations are paraphrasable as follows.
- Laundry goes to the bottom.
- Someone named Wash goes to the bottom.
- I order you to clean basins used for washing.
- I order you to clean depressions in a land surface.
The actual structure of the example is given in the
f.struct whose id is F1. This f.struct
contains one f.s.OR, which encloses pointers to the
analyses of the sentence as respectively declarative and
imperative. The f.struct for the declarative
interpretation does not itself contain an f.s.OR, but
it does have a pointer to one that does, namely to the
f.struct identified as f3. The f.s.OR in
this f.struct encloses subanalyses for Wash
as a proper noun phrase and as a common noun phrase. Similarly,
the f.struct for the imperative interpretation does
not contain f.s.OR. To find the alternative analyses,
one has to follow a chain of pointers from the
f.struct identified as f2 to f5 and from f5 to f9.
This f.struct contains a f.s.OR which
encloses the two glosses for the noun sinks. Note
that no single f.struct directly reveals the four-way
ambiguity assumed to be associated with the sentence, but the
ambiguity is nevertheless represented by the collection of
f.structs in the markup.The
illustration also makes use of a path on
the f.ptr tag. Its purpose is to express the
disambiguation of an ambiguous subpart of a larger structure.
The basic idea is
that if a f.ptr points to an ambiguous structure (a
structure that has an f.s.OR in it), and one of its
interpretations is to be selected, then a path
attribute is specified for it, whose value is a list of ID
attributes of feature structures within the ambiguous structure.
These IDs indicate which of the
alternative f.structs inside the f.s.OR
are the interpretations of the structure being pointed to by the
enclosing f.ptr.
First the linguistic analysis itself is described, and then the
sample markup is given.
The illustration is broken down into text, a series
of f.structs and alignment.
The markup of the text presumably does not conform to the
recommendations of chapter 6, since I have not checked them for
conformity thereto. --TL
W
a
s
h
␣
s
i
n
k
s
.
Analysis of sentence 'Wash sinks.'
Category
Sentence
Alternative subanalyses
One subanalysis of 'Wash sinks.'
Mood
Indicative
Voice
Stative
Subject
Predicate
Another subanalysis of 'Wash sinks.'
Mood
Imperative
Voice
Active
Subject
Category
NP
Number
Unspecified
Person
2
Predicate
Analysis of VP 'Wash sinks'
Category
VP
Head
Direct Object
Analysis of NP 'wash'
Category
NP
Alternative subanalyses of NP 'wash'
Proper
Number
Singular
Object-type
Person
Mass
Number
Singular
Head
Analysis of VP 'wash'
Category
VP
Head
Analysis of NP 'sinks'
Count
Number
Plural
Head
Analysis of VP 'sinks'
Category
VP
Head
Analysis of Noun 'wash'
Category
Noun
Inflected
Analysis of Verb 'wash'
Category
Verb
Inflected
Alternative subanalyses of Verb 'wash'
Subcat
Intransitive
Subcat
Transitive
Analysis of Noun 'sinks'
Category
Noun
Gloss
basins for washing
depressions in a land surface
Inflected
Number
Plural
Analysis of Verb 'sinks'
Category
Verb
Inflected
Number-of-subject
Singular
Person-of-subject
3
Alternative subanalyses of Verb 'sinks'
Subcat
Intransitive
Subcat
Transitive
]]>