Skip to content
Research data finder
FI|EN

IMPORTANT INFORMATION ABOUT ETSIN! Old Etsin (etsin.avointiede.fi) will be migrated into new Etsin (etsin.fairdata.fi) at the end of June 2019. After the migration all PUBLISHED datasets will be visible in new Etsin.
Describing the datasets in Etsin will not be possible after 12th June 2019. Instead, describing the datasets will be done in new metadata tool, Qvain, which will be launched at the begin of July 2019.
Note! Remember to publish your dataset if you want it to be migrated into new Etsin.

Search for a Dataset

6 datasets found
  • Metadata: 2/5

    Corpus of Historical American English - Kielipankki Korp version 2017H1

    The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). The Corpus of Historical American English (COHA) contains about 385 million words and 115 000 texts from the years 1810-2009. Each decade has roughly the same balance of fiction, popular magazine, newspaper, and non-fiction books. Access and license: This version of the...
  • Metadata: 2/5

    Corpus of Contemporary American English - Kielipankki Korp version 2017H1

    The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). The Corpus of Contemporary American English (COCA) contains about 440 million words and 190 000 texts from the years 1990-2012. The corpus is evenly divided into spoken, fiction, magazine, newspaper, academic genres (~88 million words each). Access and license: This...
  • Metadata: 2/5

    Corpus of Global Web-Based English - Kielipankki Korp version 2017H1

    The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). The Corpus of Global Web-Based English (GloWbE) contains about 1.8 billion words and 1 800 000 texts from web pages in United States, Great Britain, Australia, India, and 16 other countries. About 60 % of the texts come from blogs. Access and license: This version of the...
  • Metadata: 2/5

    Corpus of Contemporary American English - Kielipankki download version 2017H1

    The corpus is available in Kielipankki - the Language Bank of Finland for download. The Corpus of Contemporary American English (COCA) contains about 440 million words and 190 000 texts from the years 1990-2012. The corpus is evenly divided into spoken, fiction, magazine, newspaper, academic genres (~88 million words each). License details: Researchers in...
  • Metadata: 2/5

    Corpus of Contemporary American English - Kielipankki

    The Corpus of Contemporary American English (COCA) contains about 440 million words and 190 000 texts from the years 1990-2012. The corpus is evenly divided into spoken, fiction, magazine, newspaper, academic genres (~88 million words each). License details: The corpus is available for searching by logged-in staff and students of the FIN-CLARIN member...
  • Metadata: 2/5

    Corpus of Global Web-Based English - Kielipankki download version 2017H1

    The corpus is available in Kielipankki - the Language Bank of Finland for download. The Corpus of Global Web-Based English (GloWbE) contains about 1.8 billion words and 1 800 000 texts from web pages in United States, Great Britain, Australia, India, and 16 other countries. About 60 % of the texts come from blogs. License details: Researchers in the...