Skip to content
Research data finder
FI|EN

Search for a Dataset

668 datasets found
More categories…
  • Metadata: 2/5

    Iijoki, Oulun yliopiston Päätalo-kokoelma

    Kielipankin Iijoki-korpus tullaan julkaisemaan konkordanssipalvelu Korpissa korp.csc.fi. Iijoki-korpus on Oulun yliopiston Kielipankkiin tallettama kirjailija Kalle Päätalon (11.11.1919-20.11.2000) omaelämäkerrallinen pääteos. Päätaloa voidaan luonnehtia ainutlaatuiseksi suomalaisen lähihistorian ja työn kuvaajaksi sekä Koillismaan murteen...
  • Metadata: 2/5

    Finnish News Agency Archive 1992-2018, source

    The Finnish News Agency Archive corpus comprises newswire articles in Finnish sent to media outlets by the Finnish News Agency (STT) between 1992-2018. The corpus includes about 2,8 million items in total. Most of the material is news articles that vary from short “news flashes” to telegrams and longer articles. News articles are categorized by department...
  • Metadata: 1/5

    Finnish News Agency Archive

    The Finnish News Agency Archive corpus comprises newswire articles made public by the Finnish News Agency (STT) during the 2000's. More detailed information about the time frame will be available on the publication of the corpus. The corpus will be available through the corpus interface Korp (korp.csc.fi) as scrambled sentences (CC BY NC) and in the...
  • Metadata: 2/5

    Yle Finnish News Archive 2011-2018, source

    The corpus, containing the articles from YLE https://yle.fi from 2011-2018, is available at korp.csc.fi/download The licence is available at http://urn.fi/urn:nbn:fi:lb-2019032203 log: 6.3.2019 edited time frame 2017 into 2018 13.3.2019. edited name and shortname 15.3.2019. edited name, license 8.4.2019 "motivated application" requirement removed from...
  • Metadata: 4/5

    Secondary-aged learners' practices for information seeking and evaluation in ...

    Interviews with students The data collected in the CogAHealth project funded by the Academy of Finland consist of, among other data, transcripts of interviews with secondary school students (Grades 8, 9, and 10) in northern Finland. The interviews (N8 = 17, N9 = 16, N10 = 4) were semi-structured, based on tailored interview frameworks, focusing on either...
  • Metadata: 2/5

    The Morpho-Syntactic Database of Mikael Agricola's Works

    The database (beta) is available through the Interface Korp at http://korp.csc.fi/ through http://urn.fi/urn:nbn:fi:lb-201803273. The Morpho-Syntactic Database of Mikael Agricola's Works contains the Finnish parts of Mikael Agricola’s works (Abckiria, Rukouskiria, Se Wsi testamenti, Käsikiria, Messu, Piina, Psaltari, Veisut, Profeetat). The database was...
  • Metadata: 2/5

    Aalto University DSP Course Conversation Corpus 2013-2015, Helsinki Korp Version

    The resource, a variant of Aalto University DSP Course Conversation Corpus 2013-2015, Downloadable Version (http://urn.fi/urn:nbn:fi:lb-2016051604), is available at korp.csc.fi.
  • Metadata: 2/5

    Plenary Sessions of the Parliament of Finland, Kielipankki Korp Version 1.1

    The corpus, containing the first version of the transcriptions of of the plenary sessions of the parliament of Finland from 10.09.2008 to 1.7.2016, is available in Kielipankki, the Language Bank of Finland (Korp service), see Access location.
  • Metadata: 3/5

    University of Oulu Kikosa Collection

    The Kikosa Collection consists of video recorded everyday interactions among multicultural families and groups of friends. The collection is housed at the University of Oulu Department of Languages and Literature and it can be used for studies of language and interaction.
  • Metadata: 2/5

    Finnish Corpus (Literature) (UHLCS)

    The corpus is available in Kielipankki - the Language Bank of Finland (taito-shell.csc.fi, access rights instructions: http://www.kielipankki.fi/access). Contents: HKV corpus: consists of samples of the Finnish literature representing various text types. The corpus is documented in the following publication: Auli Hakulinen & Fred Karlsson &...
  • Metadata: 2/5

    The Downloadable Version of the Finnish Text Collection

    The resource is available in Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-2014052719 For more information see http://urn.fi/urn:nbn:fi:lb-201403268 Corpus location instructions: https://www.kielipankki.fi/support/corpus-location/ (in Finnish: https://www.kielipankki.fi/tuki/aineiston-sijainti-kielipankissa/).
  • Metadata: 2/5

    The "Hallituskausi 2011–2015" Translation Memory

    The "Hallituskausi 2011–2015" translation memory is intended for those translating administrative texts between Finnish and English. It includes key policy reports published by the Finnish ministries on their websites during the ongoing electoral period. The memory features some 11,000 Finnish-to-English translation segments. The translation memory runs...
  • Metadata: 2/5

    The "Hallituskausi 2007–2011" Translation Memory

    The "Hallituskausi 2007–2011" translation memory is intended for those translating administrative texts between Finnish and English. It includes key policy reports published by the Finnish ministries on their websites. The memory features some 58,000 Finnish-to-English translation segments. The tmx format requires a SDL Trados Studio programme, whereas...
  • Metadata: 2/5

    The FinINTAS corpus of spontaneous and read-aloud Finnish speech

    The FinINTAS corpus consists of two subcorpora: - FinDialogue: ten spontaneous dialogues between friends, duration 45-55 minutes each. - FinRead: read-aloud speech from the same speakers The speakers were native Finns from the capital city region in Finland. Ten speakers were 20 to 30 years of age, whereas the rest of the speakers were between 45-65...
  • Metadata: 2/5

    FinDe Corpus

    This contrastive language corpus contains German and Finnish literature and press texts and their respective translations into the other language. log 25.11.2018 link islrn.org/resources/026-780-830-030-5 removed
  • Metadata: 2/5

    Corpus of Age-related Voice Disguise

    This corpus includes normal and age-related disguised speech uttered by 60 native Finnish speakers (31 females and 29 males). The speakers were asked to read the same text fragments several times, in their modal voice and in two disguised voices, first pretending to be an elderly speaker and then pretending to be a child. The texts consisted of the...
  • Metadata: 2/5

    Samples of Northern Saami

    The corpus contains audio samples of spoken Northern Saami dialects (Sea Saami, Finnmark Saami and Torne Saami). It will be published in LAT (https://lat.csc.fi/). Each audio file contains one interview. The material has been morphologically glossed and the transcripts have been translated into Finnish and English. log 26.11.2018 link...
  • Metadata: 2/5

    The Advanced Finnish Learners’ Corpus, Downloadable Version

    The resource, which is the downloadable version of the The Advanced Finnish Learners’ Corpus , is available at http://urn.fi/urn:nbn:fi:lb-2018050402 For more information see http://urn.fi/urn:nbn:fi:lb-201407167 The purpose of the resource use must be outlined in a research plan. Distribution of copies is not allowed. If the resource is used as material...
  • Metadata: 3/5

    The HS.fi News and Comments Corpus

    The corpus is available in the Language Bank's Korp service (http://urn.fi/urn:nbn:fi:lb-2015050503). The HS.fi News and Comments Corpus contains the domestic news of the Helsingin Sanomat website and their comments from 5.9.2011 to 4.9.2012. The corpus starts with the first news of 5.9.2011 and ends with a news published in the morning on 3.9.2012 and...
  • Metadata: 3/5

    Academic publisher costs in Finland 2010–2017

    This dataset includes academic publisher costs paid by Finnish research organizations to publishers and suppliers during the years 2010–2017. Dataset includes total costs of license contracts made with individual publishers or suppliers. Dataset also includes information on the different materials and types the contracts included. Also included is the...