Elaborazione del linguaggio
naturale
Fabio Massimo Zanzotto
FMZ
Part six
Tree Adjoining Grammars
FMZ
Our Aim
Lines of development
Grammatical Representation Power: Build a
formalism/model able to give the possibility
of reducing the unnecessary interpretations
Grammar Use: Build a formalism (and an
associated algorithm) able to represent
partial analysis
FMZ
Our Aim
Lines of development
Grammatical Representation Power:
• CFG (context free grammars)  DCG
• Feature Structures
Grammar Use:
• CYK
• Chart and Early Algorithm
FMZ
Lesson learnt
• Lexicon (i.e. words) is a very important
piece of the Language and of the language
model
• Words carry meaning and govern the
syntactic structure of sentences
FMZ
What we observed
Toy Examples:
... La vecchia porta la sbarra ...
... Il vecchio porta la sbarra ...
... Flying planes can be dangerous ...
... Flying planes is dangerous ...
FMZ
Continuing the observation of the
languages
Some more toy examples:
... Il ragazzo mangia la mela con il coltello ...
... L’uomo guarda il monitor con gli occhi stralunati ...
... Le azioni della acme inc aumentano in tre settimane
da 2 euro a 3 euro ...
How many interpretations are possible for these
sentences?
FMZ
Subcategorisation frames
Necessary subcategorisation frames:
... Il ragazzo mangia la mela con il coltello ...
((NP) mangiare (NP) (PP(con)))
... L’uomo opera il paziente di appendicite ...
((NP) operare (NP) (PP(di)))
... Le azioni della acme inc aumentano in tre settimane
da 2 euri a 3 euro ...
((NP) aumentare (PP(da)) (PP(a)))
FMZ
Modelling Subcat Frames in
CFGs
Target Frame: ((NP) mangiare (NP) (PP(con)))
S  NP VP | NP VP(mangiare)
NP  NP SBAR
VP  VerbX NP | VerbX NP PP
VerbX  Verb | Modal Verb
VP(mangiare)  VerbX(mangiare) NP | VerbX (mangiare)
NP PP(con)
VerbX(mangiare)  Verb(mangiare) | Modal Verb(mangiare)
NP  Art Noun | Art Adj Noun | Noun | Verb Noun | NP PP
FMZ
PP  Prep NP
Observations
Il ragazzo mangia la mela a mezzogiorno con il
coltello
How do we modify those(?):
VP(mangiare)  VerbX(mangiare) NP | VerbX
(mangiare) NP PP(con)
VerbX(mangiare)  Verb(mangiare) | Modal
Verb(mangiare)
FMZ
Summing up
• We understood that subcategorisation can
indicate preferred sentence readings
• We want to build lexicalised rules, that is,
rules governed by lexical elements (words)
• We want to empower the grammar
FMZ
Idea!!!
• Lexicalised rules may be partial tree!
((NP) aumentare (PP(da)) (PP(a)))
S
NP
VP
PP
PP
V
aumentare
IN NP IN NP
da
a
FMZ
Defining better our aim
• We want a lexicalised grammar
– each rule (partial tree) has to at least a lexical item
• We want a grammar equivalent to the a given
grammar
– weak equivalence: equivalence in the language
recognised
– strong equivalence: equivalence in generated trees
with respect to input sentences... remember that the
structure define the “meaning”
FMZ
Operations in CFG
in the derivation, no terminal symbols are
substituted with rewriting rules headed by the
same symbol
may be understood as
tree substitution
no terminal symbols are substituted with trees
headed by the same symbol
is it sufficient to obtain the strong equivalence?
FMZ
Investigating strong equivalence
Given the grammar
and the sequence aaaa, one of the
interpretations is:
FMZ
Let’s build the lexicalised
grammar
Given the tree collection
the interpretation
cannot be obtained!
FMZ
Another example
Given the grammar
it can be lexicalised as follows:
strange rule!!!
FMZ
Another example
The same grammar may be lexicalised also
however, what about this
FMZ
Tree Adjoining Grammars
FMZ
What do we need?
• A new operation!!!
the Tree Adjoining operation
FMZ
Tree Adjoining Example
FMZ
Again the substitution
• The well know operation
FMZ
Substitution example
FMZ
Tree Adjoining again
Does it solve the problem of obtaining the
strong equivalence?
This is the solution to the example problem!!!
FMZ
Tree Adjoining again
successive adjoining of (b4)
FMZ
More examples
may give the interpretation:
FMZ
What is a grammar now?
It is a collection of:
• initial trees, represent the lexicon, e.g.,
• auxiliary trees, represent grammatical rules,
e.g.,
FMZ
Do you remember?
copy-language structures:
Pino, Gino e Rino sono rispettivamente fratello, zio e babbo
di Nino
may be read as:
a1a2a3b1b2b3
Exercise: Find a model in TAG for this problem
FMZ
What we have done?
• we have worked on the representation power of
the grammar
• we introduced:
– lexicalised rules
– the adjoining operation
• where do we pay?
– on the parsing algorithm?
FMZ
Question
Have we resolved the problem of selecting between
different readings (sentence interpretations)?
FMZ
Scarica

Parsing del linguaggio naturale