Skip to content
Research data finder
FI|EN

IMPORTANT INFORMATION ABOUT ETSIN! Old Etsin (etsin.avointiede.fi) will be migrated into new Etsin (etsin.fairdata.fi) at the end of June 2019. After the migration all PUBLISHED datasets will be visible in new Etsin.
Describing the datasets in Etsin will not be possible after 12th June 2019. Instead, describing the datasets will be done in new metadata tool, Qvain, which will be launched at the begin of July 2019.
Note! Remember to publish your dataset if you want it to be migrated into new Etsin.

Search for a Dataset

3 datasets found
  • Metadata: 2/5

    Helsinki Corpus of Swahili 2.0 (HCS 2.0)

    Helsinki Corpus of Swahili 2.0 is now available for research purposes in Kielipankki - the Language Bank of Finland. The corpus contains about 25 million words of written text, and it is available in two formats. The annotated version contains morphological and syntactic annotation as well as glosses in English. The not annotated version contains plain...
  • Metadata: 2/5

    Helsinki Corpus of Swahili 2.0 (HCS 2.0) Annotated Version

    The Helsinki Corpus of Swahili 2.0 Annotated Version containing about 25 million words is available in Kielipankki - the Language Bank of Finland in Korp (https://korp.csc.fi/) for academic use. This means that students and staff of universities can use the corpus by simply logging in with their university credentials. Alumni have the option to apply for...
  • Metadata: 2/5

    Helsinki Corpus of Swahili 2.0 (HCS 2.0) Not Annotated Version

    The Helsinki Corpus of Swahili 2.0 Not Annotated Version, consisting of plain text without linguistic codes, contains about 25 million words. The corpus is available in Kielipankki, the Language Bank of Finland, download location: http://urn.fi/urn:nbn:fi:lb-2016042801 Preparation of the material Most of the corpus material was retrieved from the Web....