TOWARDS AN ANNOTATED DATABASE FOR ANAPHORA 
RESOLUTION 


R.DELMONTE, L.CHIRAN, C.BACALU 
Laboratory of Computational Linguistics 
Ca' Garzoni­Moro, San Marco 3417 
Universit "Ca Foscari" 
30124 ­ VENEZIA 
Tel. 39­41­2578464/52/19 
E­mail: delmont@unive.it 

ABSTRACT 

The topic of this paper is the theoretical foundation and the results of work carried to organize a relational 
database on pronominal anaphoric relations, in particular when the anaphor is a morphologically expressed or 
unexpressed clitic. This research was carried out at the University of Venice, Laboratory of Computational 
Linguistic, Department of Linguistics and Language Teaching Theory. 


The basic data used in the research was made available by the GETARUN program developed at the 
University Ca' Foscari of Venezia [1] , which implements a parser in order to determine the morpho -- syntactic 
characteristics of any word from a text. We used a text taken from four Italian journals. The text is composed of 
different parts and contains over 40,000 words. 


The structure of the paper is as follows: Introduction; Section 2 -- Clitics and its Theoretical features; 
Section 3 -- Clitics and Database creation; Section 4 -- Results; Section 5 ­ Conclusions. In a separate Appendix 
we report excerpts from the Anaphoric Relations Database. 


1. INTRODUCTION 

Anaphora and anaphora resolution have received a great deal of attention from research in linguistics, 
computational linguistics and artificial intelligence [2­4]. Regarding this aspect of natural language, many 
theories and solutions have been proposed and implemented with varying degrees of success. However not much 
research has been devoted to the annotation of corpora with the aim of extracting information relevant and useful 
for the study and the solution of the problems related to presence of anaphora in real texts. The work we carried 
out in the last six months was directed towards a preliminary case study of the field by working on real data and 
trying to find solutions and apply them. 


From suggestions offered by research people who had already worked on this topic [M.Poesio, personal 
communication] we decided to limit our scope to only one type of anaphoric pronoun: the morphologically 
expressed or unexpressed class of clitic pronouns. These pronouns are different from the rest of the rich 
inventory available in Italian ­ and in the Romance language Romanian, which will eventually have to be 
compared to the current one ­ because they are monosyllabic, unstressed and fairly commonly used ­ but see the 
figures below. Personal emphatic pronouns, possessive pronouns, deictic pronouns, and quantified expressions ­ 
leaving aside interrogative and relative pronouns which require a completely different treatment in the syntax ­ 
have a different status and may alter the overall framework. So we decided to keep our database as much 
homogeneous as possible: but this notwithstanding, a number of preliminary decisions had to be taken already 
from the start. When we chose clitics as a comprehensive category we had first of all to decide whether we 
wanted to include the empty subject pronoun or not in the list of anaphoric relations to be studied. This was an 
easy decision to take, given the fact that morphologically expressed clitics are the morphological spellout of the 
antecedent together with some semantic information related to animacy and semantic role inherited from the 
governing predicate. So we had to conclude that the little_pro, i.e. the morphologically unexpressed subject 
pronoun used in Italian texts was an empty clitic lacking in one feature (gender) in case the main verb was not a 
compound, because in that case the past participle would have indicated that too. 
However, we had to exclude from the study sentential anaphors expressed in the form of deictic 
pronouns but also clitic pronouns; empty pronominals as placeholders for elliptical material. 

 

2. Clitics and their Theoretical features 


Clitics have been the subject of much discussion in the linguistics literature, as to their definition which is 
still a topic of debate. A possible definition might be the followin one: 
A clitic is a language element with wordlike status or form that resembles a word. 
A clitic usually cannot be used on its own as a word in a construction. Clitics are usually phonologically 
bound to a preceding or a following word. 


. Clitics are usually associated with certain properties such as the fact that they occur in a special position 
within a sentence. The following Italian example shows this: 
. Maria lo vede. / Mario sees it/him 
. In this case, the accusative clitic lo occurs on the left of the finite verb, while usually direct objects occur on 
the right. 
. Clitics cannot occur alone, they need a host to attach to; in most Romance languages this host is represented 
by the verb. 
. Clitics occur in a fixed order and this order is usually different from that of the corresponding full phrases: 
. Maria me lo spiega. / Mario to me it explains 
. In this example me represents a dative clitic and lo an accusative clitic. In Italian, the direct object usually 
precedes the indirect object if they are both represented by full phrases, while the example above shows that 
this is not the case if clitics are present. 
. Another property, which is usually associated with clitics, is that they cannot be stressed. 
. These properties are considered the basic characteristics, which distinguish clitics from other elements. 


A problem which is still open is the one related to the status of clitics, namely the question of whether they 
should be treated as affixes or as independent syntactic forms. This is because clitics have a status which is 
neither obviously that an independent word, nor that of an affix. 
Within early works in generative grammar, such as that of Kayne(1975), the assumption that clitics are 
syntactically independent elements is not questioned. More generally, the problematic status of clitics with 
respect to the interaction of syntax, morphology was to the large extent neglected. It is only with appearance of 
Zwicky (1977) that clitics are looked at from a broader perspective and that a classification of clitic types which 
takes into account their various syntactic, morphological and phonological properties is proposed. 

2.3. Types of clitics for Italian 
As to their case and related syntactic constituency, Italian clitics can be classified as follows: 
. Nominative 
. Accusative 
. Dative 
. Partitive or Genitive 
. Locative or Ablative 
As to their semantic import, Italian clitics can be classified as follows: 
. Impersonal 
. Reflexive 
. Pleonastic 
. Expletive


2.4. The analysis of si 
One particular type of clitic which occurs very frequently in our database is the ``si''­clitic. Its behavior is 
in many respects different from that of the other clitics. ``Si'' can have one of the following interpretations: 
1. Reflexive/reciprocal interpretation 
Example: 
Maria si lava. / Maria is washing herself 
In this case named ``accusative si'', the clitic represents a direct object which is coreferential with the 
subject. 
2. Impersonal interpretation 
Example: 
Si vedono le montagne. / One sees the mountains 
It is understood as a generic quantified subject, therefore it is ``nominative si'' 3rd person plural, 
+human. 
3. Dative­ Ethic ­ Beneficiary si 
Example: 
Maria si lava le mani. / Maria is washing her hands. 
4. Middle interpretation 
Example: 
Le fragole si mangiano spesso. / Strawberries are often eaten 
This construction seems like a passive ­ the property that the direct object functions as the superficial 
subject. In this case, the clitic cannot be interpreted as an argument, but it has a passivizing function. 
5. The ergative interpretation 
Example: 
La bottiglia si e' rotta. / The bottle broke 
The clitic doesn't reduce the subcategorization requirements of the verb, but it is just a marker of the 
ergative verb form. 
6. Inherent reflexive interpretation 
Example: 
Gino si e' arrabbiato. / John got angry 
In this case the clitic doesn't represent an argument of the verb; ``si'' is simply a marker of 
reflexivization. 
An interesting case is the following: 
``Si sono lavati due bambini'' / Two children were being washed ­ Two children were washing/each other 
When the phrase begins with ``si'' there is an ambiguous situation. This clitic can have an impersonal 
interpretation, but also can have a reflexive/reciprocal interpretation. To solve the ambiguity we can prepose the 
subject NP: 
``Due bambini si sono lavati'' 
the ambiguity is still present, ``si'' can have a reciprocal interpretation -- one child washed the other, and a non -- 
reciprocal interpretation -- every child washed himself. 

3. Anaphora and Database creation 

Anaphora resolution in general is one of the most challenging tasks in natural language processing. 
The object of our research was to study pronominal anaphora, from a statistical point of view: trying to 
use an approach that defined some metrics to evaluate the distance between the anaphor and its antecedent. This 
would be turned into a useful information when we wanted to organize automatic procedure to find the 
antecedent of a clitic and also simply to assess the main general charachterizing features of the phenomenon 
under study. 

So we designed a database structure and then we imported the data available in text format into this 
database. In order to mark the relation between anaphor and the corresponding antecedent we built a program 
with a friendly interface. 
The steps in the creating of our database have been: 
text has been split into paragraphs and paragraphs have been split into sentences and words; 
morphological features and POS tags for every word of our text were available from the tagger; 
syntactic structure information related to chunks or phrases into which each sentence was split were also 
available; 
we had to produce the overall design of the database in order to establish the appropriate links. 


3.1 The structure of our database 
Our database is made up of the following relations: 
Tokens -- contains the words of our text in the order of their appearance and information about the place of 
the word in the text and the place of the word in the sentence; 
Sentences -- contains the sentences of our text; 
Anaphora -- is the relation where the anaphora links are stored; 
SnSentences -- contains the syntactic analyses of the sentences of our text ; 
Feats -- contains all the possible morphological features of every word of our text, 
Another two sub­relations have been defined: 
Clit -- contains only those tokens that have the morphological characteristic clit; 
Clitvt -- contains only those tokens that have the morphological tag vt, vsup, vc, vin or vcl, i.e. that of the 
verb with the morphological features to be inherited by the empty subject clitic. 
We envisage to port the database under XML as soon as enough data are available to represent syntactic 
constituency at sentence level. 
Here below we shall present the main relations and joins between relations. 
3.1.1. The relation ``Tokens'' 
The relation ``Tokens'' contains all words of the text. It has the following structure: 
Nfrase represents the number of the sentence in which the token occurs; 
PosSentence represents the token's position in the corresponding sentence, as a number; 
Ntoken is an unique number and represents the token's position in the whole text ; 
Token is a text field and contains the token as a string: 
Tag is the field that contains the morphological characteristic of the token: 
CarSin contains the syntactic characteristic of the token; 

Sentence is a calculated field from the relation ``Sentence'' that gives the whole sentence in which the token 
occurs, as a string; 
Feats contains the morphological features of the token, such as: number, genre, etc. This is obtained, using the 
information from ``Tag'', from the relation ``Feats'' where there are all the possible morphological features for 
this token; 
ConstSint gives the token's syntactic constituent. 
Another important aspect is constituted by the joins between this relation and other relations in the database. So 
there are relationships between ``Tokens'' and ``Serntence'', ''Anaphora'' and ``Feats''. The fields that holds these 
relationships are ``NSentence'', ''Ntoken'' and ``Token''. 
3.1.2. The relation ``Sentence'' 
This relation contains the sentences from our text. 
NSentence contains the unique number of the sentence 
CountWSentence represents the number of tokens that occur in the sentence 
Sentence contains the whole sentence as a string 
3.1.3. The relation ``Anaphora'' 
This relation was projected for the anaphoric relations. 
The meaning of the fields is as follows : 
NAntecedent represents the unique number of the token which is the antecedent for the clitic; 
NAnaphor represents the unique number of the token which is the clitic (anaphor); 
Dist is a calculated field (NAnahor -- NAntecedent) and it gives the distance between the antecedent and the 
anaphor in number of words; 

FDist is also a calculated field using the ``Anaphora'' 's join with ``Sentence'', and it gives the distance between 
the anaphor and its antecedent in number of sentences. 
This relation has two joins with ``Tokens'' . Those joins are : 
3.2. The database scheme 
DataBase { TOKENS (NSentence , Pos Sentence, Ntoken ,Token, Tag, CarSin,Sentence , Feats) 
SENTENCES (NSentence, CountWSentence,Sentence) 
ANAPHORA (NAntecedent, NAnaphor, Dist, FDist ) 
FEATS (Nr, Token, Feat) 
SNSENTENCES (AnalyseFrasi, NSentence) } 
There are another two relations (sub­relation) ``Clit'' and ``Clitv'', which are two projections for ``Tokens'' 
whenever an empty subject clitic has been individuated. They have the same structure, but the criteria of the 
selections were ``Tag = Clit'' and ``(Tag = vcl) or (Tag =vt) or (Tag = vin) or (Tag = vsup) or (Tag = vc) or (Tag 
= ause) or (Tag = ausa)'' where the corresponding categories are the verbal tags which contained the 
morphological features to be associated to the empty clitic. 
Our database has primary keys and external keys: 
Relation ``Tokens'' : the primary key is ``Ntoken'' and it is also an external key, used for the join with 
``Anaphora''; 
Relation `` Sentence '' : the primary key is ``NSentence ''. ``N Sentence '' is also an external key for the joins 
with ``Tokens'' and ``Sn Sentence ''; 
Relation ``Feats'' :''Nr'' is the primary key and ``Token'' is the external key that serves for the join with 
``Tokens''; 

Relation ``SnSentence '' : ``NSentence '' is the primary key and the external key for the join with `` Sentence''. 


3.4 The E­R Model 
Between Tokens and Clit and Clitv there exists a relation and its type is part­of. 

4. Results 

The base text for our research has 40,000 words: however, we have completed the computation of all 
indices for the first 10,000 tokens and are currently working on the remaining 3/4 of it. The sources of the 
completed portion of our corpus are two Italian magazines : ``Genius'' and ``Mondo'' , 1986. It consists of a 
scientific text divided into three different parts: 
About e­mail service; 
An interview with a famous architect; 
About Italian physicists. 
SENTENCES ANAPHORA 
1 n n n 
SNFRASI FEATS 




TOKENS 
TOKENS 
Clit Clitv 

The anaphoric relations have been done non­automatically (by hand), using a simple interface. We have 
found about 240 anaphoric relations. This number shows that the use of clitics is not a very frequent 
phenomenon. 
As said at the beginning, we also studied the case when the subject is not morphologically expressed. 
The null­subject is a common feature of all Romance languages. The number of occurrences of the null subject 
is 91, i.e. it is the most frequent one. 
As it can be seen in the following table the ``impersonal si'' is ranked second; enclitics, represented by 
tag "vcl" is ranked third, and the referential pro­clitics are only 53 overall. 
Morphological 
Characteristic 
Number of 
occurrence 
``vt'',``vin'',``vsup'',``vc'' 91 
Impersonal ``si'' 37 
``vcl'' 33 
``c'',``ci'' 14 
``clitac'' 18 
``clitabl'' 17 
``clitdat'' 10 
Other clitics 20 
For an automatic approach of the clitics problem it is useful to know the distance between the anaphor 
and its antecedent. The following table shows the distance between the sentences in which the two anaphoric 
elements occur and the number of occurrences for any case: we also calculated the probability for the 
occurrences of those distances. 
Distance 
between 
sentences 
Number of 
occurrences 
Probability of 
Occurrence 
0 163 0.68 
1 45 0.187 
2 17 0.056 
3 5 0.016 
4 3 0.01 
5 5 0.056 
6 3 0.016 
7 3 0.016 
8 1 0.003 
9 3 0.016 
12 1 0.003 
As we expected, the case in which the two elements are in the same sentence has the highest rank. This 
happens because there are many ``impersonal si'' and the antecedents for those clitics do not exist. 
 

5. CONCLUSIONS 

The information encoded in the morpho­syntactic annotation as well as the structure of the database have been 
very helpful in the work we carried out. In particular, the presence of morphological features and of a specific 
case marking has proved useful in the determination of the relation antecedent­anaphor. 
We intend to complete our study of antecedent­anaphora relations in our corpus by taking into account the depth 
of embedding of constituentscontaining anaphoric elements, and to use this as a further possible metrics for 
evaluating difficulties in automatic anaphora resolution algorithms. 
As to current available data, the conclusions we may safely draw are two: 
. the phenomenon of referential relations expressed in terms of morphologically expressed and unexpressed 
clitics in a Romance language like Italian is not very prominent even though the total number of sentences 
in the 10,000 tokens sub­corpus is 402: this would make a anaphora/sentence ratio of 1 each 2 sentences 
approximately; 
. however, since most clitics are made up by impersonal "si" which does not require a search for a specific 
antecedent and that there is a certain number of sentences which contain more than one clitic this will reduce 
the number of real referential search at discourse level to less than 1 every 3 sentences; 
. in general, pronominal expressions like clitics search locally, sentence­internally for their antecedent, and 
this applies for 75% of all antecedent­anaphora relations; 
. in addition, at discourse level, the expected distance for anaphora resolution in terms of number of sentences 
is between one and two. 
We report in the Appendix excerpts from our database showing the details of linguistic information contained in 
the text annotation generated from GETARUN. 
7. REFERENCES 
[1] Delmonte R., D.Bianchi(1994), Computing Discourse Anaphora from Grammatical Representation, in 
D.Ross & D.Brink(eds.), Research in Humanities Computing 3, Clarendon Press, Oxford, 179­199. 
[2] Delmonte R., D.Bianchi(1999), Determining Essential Properties of Linguistic Objects for Unrestricted Text 
Anaphora Resolution Proc. Workshop on Procedures in Discourse, Iasi (Romania), Pisa, pp.10­24. 
[3] Aone C.& D.McKee(1994), A Language­Independent Anaphora Resolution System for Understanding 
Multilingual Texts, in Proc.ACL, 156­163. 
[4] Rich E. & S.Luperfoy(1988), An Architecture for Anaphora Resolution, in Proc.2nd Conference on 
Applied Natural Language Processing, 18­24. 
[5] Paola Monachesi(1999), ``A Lexical analysis of italian clitics'', Proceeding of Vextal ­Venezia, 57­65. 
[6] Mate Deliverable D1.1 Supported Coding Schemes, Expressions which may enter into co­specification 
relations. 

APPENDIX 

THE LINGUISTIC CONTENTS OF THE ANAPHORIC 
RELATIONS DATABASE 
A. AMBIGUOUS TOKENS WITH POSITIONAL INDEX 
tl(534, 14, è, [ause, vc], 2, 2005). 
tl(535, 14, infatti, [congf], 1, 24374). 
tl(536, 14, sull, [part], 1, 42601). 
tl(537, 14, efficienza, [n], 1, 42699). 
tl(538, 14, dell, [partd], 1, 39634). 
tl(539, 14, ambiente, [ag, n], 2, 7733). 
tl(540, 14, ',', [punt], 1, nil). 
tl(541, 14, nelle, [part], 1, 34187). 
tl(542, 14, sue, [poss], 1, 42770). 
tl(543, 14, componenti, [ag, n, ppre], 3, 42823). 
tl(544, 14, infrastrutturali, [ag], 1, 24896). 
tl(545, 14, ',', [punt], 1, nil). 
tl(546, 14, organizzative, [ag], 1, 43010). 
tl(547, 14, e, [cong, congf], 2, 5538). 
tl(548, 14, di, [pd, pt], 2, 448). 
tl(549, 14, qualità, [n], 1, 20800). 
tl(550, 14, della, [partd], 1, 9434). 
tl(551, 14, vita, [n], 1, 43079). 
tl(552, 14, ',', [punt], 1, nil). 
tl(553, 14, che, [pk, exc, int, q, rel], 5, 3531). 
tl(554, 14, si, [clit, clitdat], 2, 3791). 
tl(555, 14, determinano, [vt], 1, 43138). 
tl(556, 14, il, [art], 1, 8296). 
tl(557, 14, grado, [n, vt], 2, 43299). 
B. DISAMBIGUATED TOKENS 
i(534­14, ause­ibar­è, è­ause). 
i(535­14, congf­svt­infatti, infatti­[type=exp]). 
i(536­14, part­sp­sull, su­[cat1=prep, p2=il, cat2=art, type=det, num=s]). 
i(537­14, n­sp­efficienza, efficienz­[type=com, gen=f, num=s]). 
i(538­14, partd­spd­dell, di­[cat1=prep, p2=il, cat2=art, type=det, num=s]). 
i(539­14, n­sn­ambiente, ambient­[type=com, gen=m, num=s]). 
i(540­14, punt­fp­',', ','­punt). 
i(541­14, part­sp­nelle, in­[cat1=prep, p2=il, cat2=art, type=det, gen=f, num=p]). 
i(542­14, poss­sp­sue, su­[gen=f, num=p]). 
i(543­14, n­sn­componenti, componenti­n). 
i(544­14, ag­sa­infrastrutturali, infrastruttural­[gen=mf, num=p]). 
i(545­14, punt­fp­',', ','­nil). 
i(546­14, ag­fp­organizzative, organizzativ­[gen=f, num=p]). 
i(547­14, cong­sn­e, e­cong). 

i(548­14, pd­spd­di, di­pd). 
i(549­14, n­sn­qualità, qualità­[type=invar, gen=f]). 
i(550­14, partd­spd­della, di­[cat1=prep, p2=il, cat2=art, type=det, gen=f, num=s]). 
i(551­14, n­spd­vita, vit­[type=com, gen=f, num=s]). 
i(552­14, punt­fp­',', ','­nil). 
i(553­14, pk­fac­che, che­[type=pk]). 
i(554­14, clit­ibar­si, si­(clit­si­[case=nom, num=p])). 
i(555­14, vt­ibar­determinano, determin­[mood=indic, tense=pres, pers=3, num=p, scat=tr]). 
C. FEATURES 
0­sw(1­le­[art, clitac, clitdat]­3­[art­il­[type=def, pred=il, gen=f, num=p], clitac­le­[case=acc, gen=f, num=p], 
clitdat­le­[case=dat, gen=f, num=s]]). 
120­sw(2­infrastrutture­[n]­1­[n­infrastruttur­[type=com, gen=f, num=p]]). 
195­sw(3­come­[avv, ccom, int]­3­[avv­come­[type=r], ccom­come­[type=comp], int­come­[type=int, gen=any, 
num=any]]). 
312­sw(4­fattore­[n, agn]­2­[n­fattor­[type=com, gen=m, num=s], agn­fatt­[type=adj, gen=m, num=s], agn­fatt­ 
[type=adj, gen=m, num=s]]). 
448­sw(5­di­[pd, pt]­2­[[pt, pd]­di­[cat=prep]]). 
498­sw(6­competitività­[n]­1­[n­competitività­[type=invar, gen=f]]). 
567­sw(7­'Angela'­[nh]­1­[nh­'Angela'­[type=hum, gen=f]]). 
626­sw(8­'Airoldi'­[npro]­1­[[pred='Airoldi', feat=proper]]). 
688­sw(9­negli­[part]­1­[part­(in)­[cat1=prep, p2=il, cat2=art, type=det, gen=m, num=p]]). 
6. SENTENCES 
f(1,1,6,[le, infrastrutture, come, fattore, di, competitività, .]). 
f(2,2,3,[di, 'Angela', 'Airoldi', .]). 
f(3,3,51,[negli, ultimi, anni, la, dinamica, dei, fenomeni, economici, è, stata, sempre, più, caratterizzata, dall, 
emergere, di, una, crescente, concorrenza, che, si, è, progressivamente, spostata, dalle, singole, imprese, ai, 
sistemi, economici, e, territoriali, ',', determinando, l, esigenza, di, una, riconsiderazione, dei, rapporti, 
esistenti, tra, soggetti, produttivi, e, ambiente, in, cui, questi, operano, .]) 
f(3,4,37,[il, raggiungimento, e, il, mantenimento, di, posizioni, competitive, sono, sempre, più, il, risultato, 
della, interazione, tra, le, azioni, dei, singoli, soggetti, '(', non, solo, economici, ')', e, la, disponibilità, di, 
risorse, presenti, nel, contesto, socio_economico, di, riferimento, .]).. 
7. SYNTACTIC STRUCTURES 
Oppure, se si vuol parlare di applicazioni, sarà bene ricordare che sono fisici anche tutti coloro che si occupano 
di laser, per materiali per l'elettronica, di superfreddo e superconduttività. Perché i fisici delle particelle sono alla 
ribalta molto più spesso dei loro colleghi che si occupano di laser o di stato solido? "Prima di tutto", risponde 
Roberto Fieschi, docente di fisica dello stato solido dell'Università di Parma, "perché i fisici nucleari e 
subnucleari hanno il loro istituto, l'Infn, agile ed efficiente malgrado le pastoie del parastato. Mentre gli altri 
gruppi di ricercatori fisici sono dispersi negli istituti e nei centri del Cnr, e soffrono tutte le ben note difficoltà di 
questo ente." 
cp­[fc­[cong­Oppure], 
fp­[punt­,], 
fs­[cosu­se, 
f­[ibar­[clit­si, vsup­vuol, vit­parlare], 
compin­[spd­[pd­di, sn­[n­applicazioni]]]]], 

fp­[punt­,], 
f­[ibar­[clit­si, vit­ricorderà], 
compt­[fac­[pk­che, 
f­[ibar­[vc­sono, svc­[sa­[ag­fisici]]], 
sn­[in­anche, qc­tutti, deit­coloro, 
f2­[rel­che, ibar­[clit­si, vt­occupano], 
compin­[ 
coord­[spd­[pd­di, sn­[n­laser]], 
fp­[punt­,], 
sp­[p­per, sn­[n­materiali], 
sp­[p­per, sn­[art­l, n­elettronica]]], 
fp­[punt­,], 
spd­[pd­di, sn­[n­superfreddo]], 
cong­e, 
sn­[n­superconduttività]]], 
f­[punto­.]]]]]]]] 
cp­[cosu­Perché, 
f­[sn­[art­i, n­fisici, spd­[partd­delle, sn­[n­particelle]]], 
ibar­[vc­sono], 
svc­[sp­[part­alla, sn­[n­ribalta]], savv­[in­molto, in­più, avv­spesso], 
spd­[partd­dei, sn­[poss­loro, n­colleghi, 
f2­[rel­che, ibar­[clit­si, vt­occupano], 
coord­[spd­[pd­di, sn­[n­laser]], 
cong­o, 
spd­[pd­di, n­stato, sa­[ag­solido]]], 
f­[puntint­ ?]]]]]]] 
cp­[fp­[par­"], 
sp­[php­prima_di, sa­[avv­tutto]], 
fp­[par­"], 
fp­[punt­,], 
f­[ibar­[vin­risponde], 
sn­[nh­roberto_fieschi], 
fp­[punt­,], 
sn­[n­docente, spd­[pd­di, sn­[n­fisica_dello_stato_solido]], 
spd­[partd­dell, sn­[n­Università, spd­[pd­di, sn­[np­Parma]]]]]], 
fp­[punt­,], 
fp­[par­"], 
fs­[cosu­perché, 
f­[sn­[art­i, n­fisici, 
coord­[sa­[ag­nucleari], 
cong­e, 
sa­[ag­subnucleari]]], 
ibar­[vc­hanno], 
compc­[sn­[art­il, poss­loro, n­istituto], 
fp­[punt­,], 
sn­[art­l, npro­Infn], 
fp­[punt­,], 
coord­[sa­[ag­agile, 
cong­ed, 

sa­[ag­efficiente]]]], 
fs­[cong­malgrado, 
f­[sn­[art­le, n­pastoie, spd­[partd­del, sn­[n­parastato]]]]], 
f­[punto­.]]]] 
fs­[cosu­Mentre, 
f­[sn­[art­gli, ag­altri, n­gruppi, 
spd­[pd­di, sn­[n­ricercatori], sn­[ag­fisici]]], 
ibar­[ause­sono, vppt­dispersi], 
compin­[coord­[ 
sp­[part­negli, sn­[n­istituti]], 
cong­e, 
sp­[part­nei, sn­[n­centri, spd­[partd­del, sn­[npro­Cnr]]]]]]], 
fp­[punt­,], 
fc­[cong­e, 
f­[ibar­[vt­soffrono], 
compt­[sn­[qc­tutte, art­le, in­ben, ag­note, n­difficoltà, 
spd­[pd­di, sn­[dim­questo, n­ente]]]], 
f­[punto­.]]]] 
fp­[par­"],