TextLink: Resource survey compiled by WG1 Inventories
Transcription
TextLink: Resource survey compiled by WG1 Inventories
10.12.15 TextLink:Resourcesurvey compiledbyWG1 JiriMirovsky ManfredStede Inventories • Lexconn1.0,2009 • CharloIeRoze,LaurenceDanlos,Philippe Muller(Paris,Toulouse) • freelyavailableXML • French • 328connecSvewords,phrases • syntacSctype,semanScrelaSon(SDRT/RST) 1 10.12.15 <connecteurcat="adv"id="c225"relaSons="narraSon"type="coord"> <forme> ensuite </forme> </connecteur> <connecteurcat="adv"id="c226"relaSons="conSnuaSon"type="coord"> • <forme> ensuite </forme> <exemple> Jetrouvaismilleprétextespournepasdescendredevoiture. D'abord,iln'yavaitpasdeplacepoursegarer.Ensuite,ilsdevaientêtreen traindedînerquelquepart.Deschosescommeça. </exemple> </connecteur> Inventories • • • • • • DimLex2.0,2015(1.0,1998) ManfredStede,TatjanaScheffler(Potsdam) freelyavailableXML German 290connecSvewords,mulSwords syntacScfeatures,ambiguityinfo,semanSc relaSon(PDTB3) 2 10.12.15 <entryid="k5"word="allerdings“> <orthtype="cont"canonical="1"onr="k5o1"> <parttype="single">allerdings</part> </orth> <disambi> <conn_d>1</conn_d> <sem_d>0</sem_d> </disambi> <focuspart>0</focuspart> <non_conn_reading> <exampletype="ADV"jreq="0">"Hastdudasgehoert?""Allerdings“!</example> </non_conn_reading> <sIs><exampletype="ADV"jreq="236">DasneueCabrioistaufdemMarkt,allerdingsist essoteuer,dassmanessichnichtkaufensollte.</example></sIs> <syn><cat>konnadv</cat> <ordering><ante>0</ante><post>1</post><insert>0</insert></ordering> </syn> </entry> Inventories HebrewDiscourseMarkers1.0,2009 YaelMaschler(Haifa) availableinprintonly Hebrew textual,interpersonaldiscoursemarkersin spokenlanguage • semanSc,pragmaSc,prosodicfeatures • • • • • 3 10.12.15 Inventories • SpanishDiscourseParScles • AntonioBriz,SalvadorPons,JoséPortolés (Valencia) • dicSonaryforhumanusers,availableonline: hIp://www.dpde.es/ • Spanish • syntax,semanScs,prosody,...,English/Italian/ Portugesecorrespondents Inventories • TextLinkacSviSessofar: – CorrespondencesbetweenFrenchLexconnand GermanDiMLex(STSM,MargotColinet2015) – ...more? 4 10.12.15 Corpora Corpora:twosubgroups • availableandsomeXML • ConnecSves(etc),alaPDTBorsimilar – – – – – – – FrenchDiscourseTreebank ?CSTNewsCorpus(BrazilianPortugese) PotsdamCommentaryCorpus(German) PragueDependencyTreebank(Czech,English) PennDiscourseTreebank(English) TurkishDiscourseBank RSTSignalingCorpus(English) • CoherencerelaSonsalaRST,SDRT,SandersEtAl – – – – – – Annodis(French) PotsdamCommentaryCorpus(German) DiscAn(Dutch) RSTSignalingCorpus(English) BasqueRSTTreebank MulSlingualRSTTreebank(English,Spanish,Basque) 5