C-ORAL-ROM: Integrated Reference Corpora for Spoken Romance LanguagesThe C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition. |
Comentarios de usuarios - Escribir una reseña
No hemos encontrado ninguna reseña en los sitios habituales.
Índice
CHAPTER | 1 |
APPENDIX | 4 |
CHAPTER 2 | 71 |
CHAPTER 3 | 111 |
CHAPTER 4 | 135 |
CHAPTER 5 | 163 |
CHAPTER 6 | 209 |
Evaluation of consensus on the annotation of terminal and nonterminal | 257 |
Binary comparison file | 263 |
Discussion | 272 |
Otras ediciones - Ver todo
C-ORAL-ROM: Integrated Reference Corpora for Spoken Romance Languages Emanuela Cresti,Massimo Moneglia Vista previa restringida - 2005 |
C-ORAL-ROM: Integrated Reference Corpora for Spoken Romance Languages, Volumen 1 Emanuela Cresti,Massimo Moneglia No hay ninguna vista previa disponible - 2005 |
Términos y frases comunes
according adjectives adverbs alignment allows analysis annotation automatic boundaries C-ORAL-ROM Class collected common compared comparison compound concerned considered context conversations coordination corpora corpus corresponding defined definition determiners dialogue discourse distinction distribution domain elements errors evaluation event example expressions fact Figure formal forms four fragmentation French frequency functions given included Italian label Lemma lexical linguistic marked markers means morpho-syntactic multiword names noted nouns occur original percentage performed person phenomena Portuguese PoS tagging position possible present pronouns prosodic break Rank Lemma recorded reference regarding relation relative relevant represented resource respect Romance rules sampling shows simple sound Spanish speaker specific speech spoken language spontaneous speech strategy structure studies syntactic Table tagger tagging tagset terminal transcribed transcription turn types unit utterance variation verbal verbless verbs words written
Referencias a este libro
Reduced Parenthetical Clauses as Mitigators: A Corpus Study of Spoken French ... Stefan Schneider Vista previa restringida - 2007 |
Der Imperativ im gesprochenen Italienisch: Form und Funktion im ... Friederike Kleinknecht Vista previa restringida - 2008 |