What is an ISO Language Code?

17th December 2014

Emily Robertshaw

The first standardised language code system ISO 639, was introduced by the International Organisation for Standardisation with the aim of representing language and language groups. The main purpose for creating these codes was to ensure that languages and language families, could be recognised internationally regardless of a person’s native language.

The ISO 639 codes now consist of 6 parts and are either 2, 3 or 4 letters long depending on which ISO code is used.

Where would you use ISO codes?

Using ISO codes to represent languages, allows for easier referencing across the world. The standardisation of languages is used for bibliographical referencing in libraries whilst also having a great use in modern technology. Websites use ISO codes to represent different language versions of a website. This allows websites to create different language versions that are optimised for different locations/languages, and display those based on the location of the user.

Different types of ISO code

ISO 639-1 – This is the Alpha-2 code. This was the first ISO code created and features a two letter code to represent a language. There are 136 IS0 639-1 codes representing languages all over the world. Examples of ISO 639-1 codes include:

Code	Language
en	English
de	German
fr	French
es	Spanish

We have also created a handy ISO-639-1 guide, which features a more comprehensive list!

ISO 639-2 – This is a 3 letter code. The 3 letter code offers more possible language combinations and there are 464 codes. The ISO 639-2 coding system was developed as a result of the first ISO coding system not allowing for the formation of groups of languages. This coding system allows for grouping of collective languages; for example:

Code	Language Group
bat	Baltic Languages
cel	Celtic Languages
gem	Germanic Languages
kor	Korean Languages

ISO 639-3 – Like the ISO 639-2, the ISO 639-3 code system is also made up of 3 letters, however this coding system also includes extinct and ancient languages as well as languages still around today. This code doesn’t take into consideration ‘group’ languages, which has led to many codes in ISO 1 and 2 being referred to as ‘macro-languages’ which can be seen to have been broken down into micro languages in ISO 3. This coding system has helped to address the issue of the digital divide, allowing for more languages to be optimised for the web and giving more opportunity for languages to be included online.

ISO 639-4 – This refers to the general principles of language coding and presents guidelines for using the ISO 639 codes.

ISO 639-5 – This code provides a 3 letter reference for language families and groups. This code further expands on the grouping carried out in ISO 639-2.

ISO 639-6 – This is the most recent coding system developed by the ISO and is a 4 letter code that allows us to distinguish between variants of languages and language families. This is the most comprehensive code and attempts to distinguish between as many enumerations of languages as possible including extinct, living, ancient, major, minor languages and even includes all written, spoken and signed languages. The ISO 6 system establishes a hierarchical framework which allows us to create a historical timeline of languages as the code links each language to its principle ancestor. For example;

Code	Languages
eng	English Language
emen	Early Modern English
emse	Early Midland and South Eastern Middle English
menge	Middle English
ango	Anglo Saxon
angl	Anglic
nsea	North Sea
gmcw	Germanic West
grmc	Germanic
ineu	Indo-European
wrld	World