4 2026-01-26 fi

Registry Contact Details

Contact Name: Registry support
Email address: support@bigroom.eco

Label Generation Rules for Finnish

Overview

This document specifies a set of Label Generation Rules (LGR) for the Finnish language using a language-specific repertoire for the second level domain or domains identified above. The format of this file follows [RFC 7940]. This LGR is adapted from the “Reference LGR for the Second Level for the Finnish Language” [Ref-LGR-fi-Latn], for details, see Change History below.

Standalone LGR: This LGR is designed to be used in a zone that does not cater to IDNs other than those valid under this LGR. This LGR lacks features that would allow its use in the context of another LGR in the same zone, and it may contain other features incompatible with such use.

Repertoire

Most references converge on 30 Latin code points.

The list of IDN characters by the .fi ccTLD includes the core Finnish alphabet as well as extensions for the Sami languages written in Finland.

Excluded code points

Letters documented in some references but not included:

U+00E0 LATIN SMALL LETTER A WITH GRAVE
U+00E7 LATIN SMALL LETTER C WITH CEDILLA
U+00E8 LATIN SMALL LETTER E WITH GRAVE
U+00E9 LATIN SMALL LETTER E WITH ACUTE
U+00EA LATIN SMALL LETTER E WITH CIRCUMFLEX
U+00EB LATIN SMALL LETTER E WITH DIAERESIS
U+00ED LATIN SMALL LETTER I WITH ACUTE
U+00EE LATIN SMALL LETTER I WITH CIRCUMFLEX
U+00EF LATIN SMALL LETTER I WITH DIAERESIS
U+00F0 LATIN SMALL LETTER ETH

The following code points are not included in the current repertoire;they are extensions needed to write Lule Sami, Skolt Sami, and Northern Sami, minority languages in Finland.

U+010D LATIN SMALL LETTER C WITH CARON
U+0111 LATIN SMALL LETTER D WITH STROKE
U+014B LATIN SMALL LETTER ENG
U+0167 LATIN SMALL LETTER T WITH STROKE
U+01E5 LATIN SMALL LETTER G WITH STROKE
U+01E7 LATIN SMALL LETTER G WITH CARON
U+01E9 LATIN SMALL LETTER K WITH CARON
U+01EF LATIN SMALL LETTER EZH WITH CARON
U+0292 LATIN SMALL LETTER EZH

Note: the code point U+00E1 LATIN SMALL LETTER A WITH ACUTE (Northern Sami, Inari Sami) is already part of the core repertoire for other uses and the code point U+00F5 LATIN SMALL LETTER O WITH TILDE (Skolt Sami) is part of set of extended code points.

Extended code points

A number of letters not considered essential to writing the core vocabulary of the language are nevertheless in common use. Where they have not been added to the core repertoire, they are flagged as “extended-cp” in the table of code points. A context rule is provided that by default will prohibit labels with such extended code points. To support extended single code points or code point sequences, delete the context “extended-cp” from their repertoire definition.

Variants

No variants are applicable when using the LGR in a standalone fashion.

Character Classes

This LGR does not define named character classes.

Rules

Common Rules

By default, the LGR includes the rules and actions to implement the following restrictions mandated by the IDNA protocol. They are marked with ⍟.

Hyphen Restrictions — restrictions on the allowable placement of hyphens (no leading/ending hyphen and no hyphen in positions 3 and 4). These restrictions are described in Section 4.2.3.1 of RFC 5891 [150]. They are implemented here as context rule on U+002D (-) HYPHEN-MINUS.
Leading Combining Marks — restrictions on the allowable placement of combining marks (no leading combining mark). This rule is described in Section 4.2.3.2 of RFC 5891 [150].

Actions

This LGR includes the default actions for LGRs as well as the action needed to invalidate labels with misplaced combining marks. They are marked with ⍟. For a description see [RFC 7940].

Methodology and Contributors

The LGR in this document has been adapted from the corresponding Reference LGR for the Second Level. The Second Level Reference LGR for the Finnish Language was developed by Michel Suignard and Asmus Freytag, including input by Michael Everson, Nicholas Ostler, and Wil Tan, and based on multiple open public consultations.

Changes from Version Dated 10 October 2016

Language tag has been updated.

Changes from Version Dated 18 May 2021

Unicode Version has been updated.

Changes from Version Dated 25 October 2024

Adopted from the Second Level Reference LGR for the Finnish Language [Ref-LGR-fi-Latn] without normative changes.

References

General references for the language:

Finska språbykrån. 1992. In Icelandic Council for Standardization. 1992. Nordic cultural requirements on information technology. Reykjavík: Staðlaráð Íslands. ISBN 9979-9004-3-1
Wikipedia: “Finnish orthography”, https://en.wikipedia.org/wiki/Finnish_alphabet
Omniglot: Finnish (suomi) https://www.omniglot.com/writing/finnish.htm

Other references cited in this document:

[RFC 7940]: Davies, K. and A. Freytag, “Representing Label Generation Rulesets Using XML”, RFC 7940, August 2016, https://www.rfc-editor.org/info/rfc7940
[Ref-LGR-fi-Latn]: ICANN, Second Level Reference Label Generation Rules for the Finnish Language (fi-Latn), 25 October 2024 (XML) https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-finnish-language-25oct24-en.xml non-normative HTML presentation: https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-finnish-language-25oct24-en.html
[Unicode 11.0.0]: The Unicode Consortium. The Unicode Standard, Version 11.0.0, (Mountain View, CA: The Unicode Consortium, 2018. ISBN 978-1-936213-19-1) https://www.unicode.org/versions/Unicode11.0.0/

In the listing of the repertoire by code point, references starting from [0] refer to the version of the Unicode Standard in which the corresponding code point was initially encoded. Other references (starting from [100]) document usage of code points. Entries in the table may have multiple source reference values. In the listing of whole label evaluation and context rules, reference [150] indicates the source for common rules. For more details, see the Table of References below.

]]> The Unicode Consortium. The Unicode Standard, Version 1.1 Internetstiftelsen i Sverige (IIS), “IDN Reference table for Finnish language” https://github.com/dotse/IDN-ref-tables/blob/master/language-tables/finnish-lang-ref-table.txt accessed on 2016-10-16 RFC 5891, Internationalized Domain Names in Applications (IDNA): Protocol https://tools.ietf.org/html/rfc5891 ISO/IEC 6937 Third Ed. 2001-12-17- Information technology — Coded graphic character set for text communication — Latin alphabet: Table D.1 (p 35) Use of Latin alphabetic characters. Everson, Michael. The Alphabets of Europe: “Finnish” https://www.evertype.com/alphabets/finnish.pdf Everson, Michael. The Alphabets of Europe: “Finnish” https://www.evertype.com/alphabets/finnish.pdf Everson, Michael. The Alphabets of Europe: “Finnish” https://www.evertype.com/alphabets/finnish.pdf The Unicode Consortium, Common Locale Data Repository (CLDR) Version 28 (2015-09-16) - Locale Data Summary for Finnish [fi] https://www.unicode.org/cldr/charts/28/summary/fi.html The Unicode Consortium, Common Locale Data Repository (CLDR) Version 28 (2015-09-16) - Locale Data Summary for Finnish [fi] https://www.unicode.org/cldr/charts/28/summary/fi.html Finska språbykrån. 1992. In Icelandic Council for Standardization. 1992. Nordic cultural requirements on information technology. Reykjavík: Staðlaráð Íslands. ISBN 9979-9004-3-1 Finska språbykrån. 1992. In Icelandic Council for Standardization. 1992. Nordic cultural requirements on information technology. Reykjavík: Staðlaráð Íslands. ISBN 9979-9004-3-1 Finska språbykrån. 1992. In Icelandic Council for Standardization. 1992. Nordic cultural requirements on information technology. Reykjavík: Staðlaráð Íslands. ISBN 9979-9004-3-1 Finska språbykrån. 1992. In Icelandic Council for Standardization. 1992. Nordic cultural requirements on information technology. Reykjavík: Staðlaráð Íslands. ISBN 9979-9004-3-1 ISO/IEC 646:1991 — Information technology — ISO 7-bit coded character set for information interchange Wikipedia: “Latin Alphabets” https://en.wikipedia.org/wiki/Latin_alphabets - accessed 2015-10-31 Wikipedia: “Latin Alphabets” https://en.wikipedia.org/wiki/Latin_alphabets accessed 2015-10-31 Wikipedia: “Finnish orthography" https://en.wikipedia.org/wiki/Finnish_orthography- Finnish Communications Regulatory Authority, (fi-domain) “Native language characters in domain names” https://domain.fi/info/en/index/hakeminen/mitavoihakea/aakkoset.html