Skip to content
Research data finder
FI|EN

FIN-CLARIN

Followers 0

Search for datasets

368 datasets found
  • Metadata: 2/5

    Open Richly Annotated Cuneiform Corpus, Korp Version, May 2019

    Open Richly Annotated Cuneiform Corpus (Oracc) brings together the work of several Assyriological projects to publish online editions of cuneiform texts. The Korp version of Oracc allows extensive searches on the texts and presents the results as a KWIC concordance list. Korp also offers statistical information and comparison of the search results....
  • Metadata: 1/5

    The Yle MeMAD Media Corpus

    The corpus contains video programs from the archives of Yle, The Finnish Broadcasting Company. Journalistic programs (news, current affairs etc, no drama) have been selected on various topics and from time period ranging from 1966 to 2018. Each browse-quality video file is accompanied with their descriptive metadata and subtitles. Main audio and subtitle...
  • Metadata: 2/5

    Wanca, Korp Version

    The Korp version of Wanca is a collection of web corpora in small Uralic languages. The collection is composed of 29 sentence corpora in different languages. The corpora have been collected from the Internet using the automated system developed in the Finno-Ugric Languages and the Internet project (SUKI) supported by the Kone foundation from their...
  • Metadata: 2/5

    Multimodal Translation with the Blind: Team

    The mutable-team subcorpus is part of the MUTABLE corpus (Multimodal Translation with the Blind), which entails video recordings of the work processes related to audio description as well as of the interaction between sighted and blind participants. The mutable-team subcorpus consists of appr. 25 h of video of authentic teamwork and the respective...
  • Metadata: 2/5

    Multimodal Translation with the Blind: Art

    The mutable-art subcorpus is part of the MUTABLE corpus (Multimodal Translation with the Blind), which entails video recordings of the work processes related to audio description as well as of the interaction between sighted and blind participants. The mutable-art subcorpus consists of appr. 2 h of video of authentic live audio description in art...
  • Metadata: 1/5

    Finnish Parliament original statutes from 1734-2018, downloadable version

    Finnish Parliament original statutes in Finnish from 1734, 1868, 1889, 1895, 1896, 1898, 1901, 1906, 1907 and 1917-2018 and in Swedish from 1920-2018. The statutes are published in the Language Bank's Download service at korp.csc.fi/download in vrt format.
  • Metadata: 1/5

    Finnish News Agency Archive

    The Finnish News Agency Archive corpus comprises newswire articles made public by the Finnish News Agency (STT) during1992 to 2018. The corpora will be available through the corpus interface Korp (korp.csc.fi) as scrambled sentences (CC BY NC) and in the download service as whole texts (CLARIN RES).
  • Metadata: 1/5

    Finnish Supreme and Supreme Administrative Court decisions from 1980-2018 in ...

    Finnish Supreme Court (KKO) decisions in Swedish from 1980-2018 and Supreme Administrative Court (KHO) decisions from 2001-2018 in Swedish. The decisions are available in vrt format. KKO decisions: 5688. KHO decisions: 2603. For most decisions, the language used in court has been Finnish. In that case, the document contains just an abstract in Swedish.
  • Metadata: 1/5

    Finnish Supreme and Supreme Administrative Court decisions from 1980-2018 in ...

    A collection of Finnish Supreme Court (KKO) decisions from 1980-2018 and Supreme Administrative Court (KHO) decisions from 2001-2018. The decisions are in Swedish. The decisions are available in the Korp interface korp.csc.fi. KKO decisions: 5688. KHO decisions: 2603. For most decisions, the language used in court has been Finnish; in that case, there is...
  • Metadata: 1/5

    Finnish Supreme and Supreme Administrative Court decisions from 1980-2018 in ...

    Finnish Supreme Court (KKO) decisions in Finnish from 1980-2018 and Supreme Administrative Court (KHO) decisions from 1987-2018 in Finnish. The decisions are available in vrt format. KKO decisions: 5651. KHO decisions: 7633. For most decisions, the language used in court has been Finnish. In that case, the document contains the whole decision. If the...
  • Metadata: 1/5

    Finnish Parliament original statutes from 1734-2018 in Finnish, Korp version

    Finnish Parliament original statutes in Finnish from 1734, 1868, 1889, 1895, 1896, 1898, 1901, 1906, 1907 and 1917-2018. The statutes are available in the Korp interface korp.csc.fi.
  • Metadata: 1/5

    Finnish Parliament original statutes from 1920-2018 in Swedish, Korp version

    A collection of Finnish Parliament original statutes in Swedish from 1920-2018. The statutes are available in the Korp interface korp.csc.fi
  • Metadata: 1/5

    Finnish Parliament original statutes from 1920-2018, Korp version (Finnish-Sw...

    A collection of Finnish Parliament original statutes in Finnish and Swedish from 1920-2018. The statutes are available in the Language Bank of Finland Korp service korp.csc.fi
  • Metadata: 1/5

    Finnish Supreme and Supreme Administrative Court decisions from 1980-2018 in ...

    Finnish Supreme Court (KKO) decisions from 1980-2018 and Supreme Administrative Court (KHO) decisions from 1987-2018. The decisions are in Finnish. The decisions are available in the Korp interface korp.csc.fi. KKO decisions: 5651. KHO decisions: 7633. For some decisions, the language used in court has been Swedish; in that case the Finnish version...
  • Metadata: 2/5

    Samples of Northern Saami

    The corpus contains audio samples of spoken Northern Saami dialects (Sea Saami, Finnmark Saami and Torne Saami). It is available in LAT (https://lat.csc.fi/). Each audio file contains one interview. The material has been morphologically glossed and the transcripts have been translated into Finnish and English. log 26.11.2018 link...
  • Metadata: 2/5

    Helsinki Corpus TEI-XML Edition (2011), Korp

    Information on the corpus: http://www.helsinki.fi/varieng/CoRD/corpora/HelsinkiCorpus/HC_XML.html The corpus will be made available at https://korp.csc.fi/ For detailed information on the license of the resource see http://urn.fi/urn:nbn:fi:lb-2019061301
  • Metadata: 2/5

    Helsinki Corpus of English Texts (1991)

    The Helsinki Corpus of English Texts is a structured multi-genre diachronic corpus, which includes periodically organized text samples from Old, Middle and Early Modern English. Each sample is preceded by a list of parameter codes giving information on the text and its author. The Corpus is useful particularly in the study of the change of linguistic...
  • Metadata: 2/5

    Yle Swedish News Archive 2012-2018, source

    The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2012 onwards up to 2018 inclusive, is available at korp.csc.fi/download The licence is available at http://urn.fi/urn:nbn:fi:lb-2019032201 Change Log 8.4.2019 "motivated application" requirement removed from License
  • Metadata: 2/5

    Yle Finnish News Archive 2011-2018, source

    The corpus, containing the articles from YLE https://yle.fi from 2011-2018, is available at korp.csc.fi/download The licence is available at http://urn.fi/urn:nbn:fi:lb-2019032203 log: 6.3.2019 edited time frame 2017 into 2018 13.3.2019. edited name and shortname 15.3.2019. edited name, license 8.4.2019 "motivated application" requirement removed from...
  • Metadata: 2/5

    Fenno-Ugrica, Kielipankki Version

    The Kielipankki version of Fenno-ugrica (http://urn.fi/urn:nbn:fi:lb-2014073056) is available in Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-2015103001 An updated version of this corpus will be published in Korp (www.korp.csc.fi) in 2017. Change Log: 26.2.2019 - Missing Eastern Mari added to location-URN...