Skip to content
Research data finder
FI|EN

IMPORTANT INFORMATION ABOUT ETSIN! Old Etsin (etsin.avointiede.fi) will be migrated into new Etsin (etsin.fairdata.fi) at the end of June 2019. After the migration all PUBLISHED datasets will be visible in new Etsin.
Describing the datasets in Etsin will not be possible after 12th June 2019. Instead, describing the datasets will be done in new metadata tool, Qvain, which will be launched at the begin of July 2019.
Note! Remember to publish your dataset if you want it to be migrated into new Etsin.

Search for a Dataset

14 datasets found
  • Metadata: 2/5

    The Tampere Bilingual Corpus of Finnish and English

    The Tampere Bilingual Corpus of Finnish and English consists of: a) a fiction sub-corpus, meaning long extracts (15,000 words/50 pages) from 16 English novels (plus their Finnish translations) and similar extracts 16 Finnish novels (plus their English translations); b) a non-fiction sub-corpus, meaning long extracts (and their translations) from...
  • Metadata: 2/5

    Statistics Finland Translation Memory Finnish-English

    Translation memory used by English translators at Statistics Finland. The resource will be made available for download at https://kielipankki.fi/download/
  • Metadata: 2/5

    JRC-Acquis Multilingual Parallel Corpus

    The Acquis Communautaire (AC) is the total body of European Union (EU) law applicable in the the EU Member States. This collection of legislative text changes continuously and currently comprises selected texts written between the 1950s and now. As of the beginning of the year 2007, the EU had 27 Member States and 23 official languages. The Acquis...
  • Metadata: 4/5

    DigiSami Conversational Speech

    Introduction The DigiSami project (www.helsinki.fi/digisami/) aims to study the effect of digitalisation on small Finno-Ugric language communities and to support visibility and revitalisations of the endangered languages by creating digital content as well as developing language and speech technology tools, resources, and applications that can be used for...
  • Metadata: 4/5

    DigiSami Read Speech

    Introduction The DigiSami project (www.helsinki.fi/digisami/) aims to study the effect of digitalisation on small Finno-Ugric language communities and to support visibility and revitalisations of the endangered languages by creating digital content as well as developing language and speech technology tools, resources, and applications that can be used for...
  • Metadata: 2/5

    Total Suspended Matter, FI-LAKES, 2012-06-20

    Spatial distribution of quantitative water quality parameters, generated by the Modular Inversion and Processing System (MIP) of EOMAP / Germany. Product type: Total Suspended Matter (TSM) in [mg/l] or related Turbidity in [NTU]; Flags: greyvalue 0 land or not classified; greyvalue 250-254 unreliable pixel; 255 cloud;...
  • Metadata: 2/5

    Total suspended matter, FI-SOUTH, 2009-09-15

    Spatial distribution of quantitative water quality parameters, generated by the Modular Inversion and Processing System (MIP) of EOMAP / Germany. Product type: Total Suspended Matter (TSM) in [mg/l]; Flags: greyvalue 0 land or not classified; greyvalue 250-254 unreliable pixel; 255 cloud; Calculation of concentrations...
  • Metadata: 4/5

    Finnish First Encounter

    Introduction The research material consists of the Finnish first encounters dialogue corpus collected as part of the NOMCO project, a Nordic cooperation project. The project aim for developing and analyzing multi-modal spoken language corpora in the Nordic countries, and to compare communication strategies in three closely related languages (Danish,...
  • Metadata: 4/5

    Estonian First Encounter

    Introduction Within the project MINT (Multimodal Interaction – intercultural and technological aspects of video data collection, analysis, and use) we have collected a corpus of Estonian First Encounter dialogs. The goals of the MINT project are: to create Estonian multi-modal video corpus on various conversational activities, to provide analysis and...
  • Metadata: 3/5

    WordNet

    WordNet is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptual-semantic and lexical relations. This RDF version of WordNet 3.1 incorporates direct links to the previous W3C WordNets, UBY,...
  • Metadata: 4/5

    Halias Bird Observation Data as Linked Data

    This dataset contains systematic bird observations made at the Hanko Bird Observatory (Halias) during ca. 30 years. The data is provided originally by the Helsingin Seudun Lintutieteellinen Yhdistys Tringa r.y. The data is integrated with weather observation data for the same time period. The weather data comes from the Finnish Meteorological Institute....
  • Metadata: 4/5

    Six Degrees of Francis Bacon

    Six Degrees of Francis Bacon is a digital reconstruction of the early modern social network (EMSN) that scholars and students from all over the world will be able to collaboratively expand, revise, curate, and critique. Historians and literary critics have long studied the way that early modern people associated with each other and participated in various...
  • Metadata: 4/5

    FBTEE: The French Book Trade in Enlightenment Europe

    The French Book Trade in Enlightenment Europe (FBTEE) project is a digital humanities project of international significance mapping the production, marketing, dissemination, policing, and reception of books (and hence ideas) in the late eighteenth century. It aims to bring together and make interoperable and publicly available in a single digital resource...
  • Metadata: 3/5

    Dynamic system of scots pine needles.

    This dataset has no description