You're viewing this item in the new Europeana website. View this item in the original Europeana.

Korpus ORAL: sestavení, lemmatizace a morfologické značkování

The ORAL corpus: construction, lemmatization and morphological tagging

The goal of this paper is to provide an overview of the structure and contents of the soon-to-be available ORAL corpus, which combines previously published corpora (ORAL2006, ORAL2008 and ORAL2013) with newly transcribed material into a single conveniently accessible and more richly annotated resource, about 6 million running words in length. The recordings and corresponding transcripts span a dec…