Characteristics of the Turkish language in AX Semantics

Fundamentals

Turkish has no genders for nouns. There are two numbers: singular and plural. Additionally, Turkish has six cases for nouns.

grammatical namevaluesexamples
numbersingulareski araba
(the old car)
pluraleski arabalar
(the old cars)
cases (noun)nominativeev
(the house)
accusativeEvi boyuyorum.
(I paint the house.)
dativeEve yeni bir renk kattım.
(I put new color on the house.)
genitiveevin rengi
(the house's color)
ablativeBir evden diğerine yürüdüm.
(I walked from the house to another.)
locativeEvde kalıyorum.
(I stay at house/home.)
adjectives (noun)before nounkırmızı ev
(red house)
verb tensespresent (imperfective)o satın alıyor
(he is buying)
present (aorist)o satın alır
(he buys)
pasto satın aldı
(he bought)

The standard order of a noun phrase in Turkish is the following:

determiner + numeral + adjective + noun + preposition

Note that in the below example, the noun (kitap) stays in singular form because it is proceeded by a cardinal numeral:

bu             üç     popüler  kitap         hakkında
these[pl,dem]  three  popular  book[sg,nom]  about
DET            NUM    ADJ      NOUN          PREP
"about these three popular books"

Lexicon

Nouns

Turkish nouns are inflected for number and case. Lexicon entries for nouns may also be necessary for inflecting determiners, adjectives and pronouns correctly. They are omitted, if a lexicon entry is required, but missing.

Examples

The basic lexicon entry for ev (house) contains:

  • inflection table for case and number:
SingularPlural
Nominativeevevler
Accusativeevievleri
Dativeeveevlere
Genitiveevinevlerin
Ablativeevdenevlerden
Locativeevdeevlerde

Note

If you need lexicon entries for countries, write to the support about that and you will get them for Turkish with automatic handling of prepositions.

Adjectives

Turkish adjectives do not inflect, therefore no lexicon entries for them need to be created. For adjective position, the default is "before noun".

Verbs

Turkish verbs inflect for person, number, and tense. The most common verbs are encoded in our software. If a verb inflects incorrectly, you should add it to the lexicon.

Container settings

Determiner

The AX NLG platform supports the following determiners for Turkish: indefinite, demonstrative, possessive, and quantifier (every).

Possessive determiners are expressed by adding certain possessive suffixes to the possessor (noun), which are put between the plural suffix and the case suffix. The AX NLG platform will automatically inflect nouns with a possessive suffix, when a possessive determiner is chosen. Take ev (house) for example:

evim [possessive 1st sg] -> "my house"
evl-er-im-i [pl, possessive 1st sg, acc] -> "my houses"

Additionally, when using the indefinite determiner "bir", the word order will change. The AX NLG platform changes the word order automatically, for example:

bu güzel günde [det adj noun] -> "this nice day"
güzel bir gün [adj det noun] -> "a nice day"

Numerals

The noun will automatically agree with the numeral number when a numeral variable is used. Four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit.

As the example below shows, note that nouns (e.g, gün) stay in singular form even when the cardinal numeral is more than 1:

cardinalordinal
textdokuz gün
(nine days)
dokuzuncu gün
(the ninth day)
digit9 gün
(9 days)
9. gün
(the 9th day)

For Turkish, both cardinal and ordinal numerals are written out until 100 on the platform, otherwise (above 100) the output is in digit form. Take cardinal numerals for example:

yüz araba
(one hundred cars)
vs.
101 araba
(101 cars)

Preposition/Postposition

Turkish rather uses postpositions than prepositions. Therefore, the AX NLG platform automatically puts prepositions behind the noun in a container for this language. Prepositions function as postpositions despite being called prepositions in the container, since they are analogous to prepositions in English (both are adpositions).

araba ile [noun postposition] -> "with the car"

Vowel harmony

The AX platform applies vowel harmonies automatically. Vowel harmony means that many inflection suffixes have different variants, that are used depending on what kind of vowels a word contains.

For Turkish vowel harmony depends on whether the last vowel of the word‘s stem or the preceding suffix is a front or back vowel and whether it is rounded or unrounded.

Take Turkish noun pluralization, for example, where the suffix can be either -ler or -lar: if the last vowel of the stem is an undotted/back vowel (a, ı, o, u ), the plural suffix -lar is used. Vice-versa, if the last vowel of the stem is a dotted/front vowel (e, i, ö, ü), -ler is used as the plural suffix.

SingularPlural
göz (eye)gözler (eyes)
kuş (bird)kuşlar (birds)

Consonant Assimilation

The AX platform also applies consonant assimilation automatically. This involves that consonants change from unvoiced consonants (e.g. k/p/ç) to voiced consonants (e.g. ğ/b/c). Consonant assimilation is not applicable for one-syllable words, foreign words and proper nouns.

The following example shows a consonant change from p to b, when adding the possessive suffix to the noun kitap (book):

kitap (book) -> Benim kitabım (my book)

Buffer letters

Turkish makes use of three buffer letters [n/y/s] to keep vowels apart when adding suffixes. The AX platform automatically adds them as need during inflection.

The following example shows a buffer letter being added to a word ending in a vowel, when putting it into the accusative form:

kedi (cat, lemma) -> kediyi (cat, acc. case)