Characteristics of the Turkish language in AX Semantics
Turkish has no genders for nouns. There are two numbers: singular and plural. Additionally, Turkish has six cases for nouns.
(the old car)
(the old cars)
(I paint the house.)
|dative||Eve yeni bir renk kattım.|
(I put new color on the house.)
(the house's color)
|ablative||Bir evden diğerine yürüdüm.|
(I walked from the house to another.)
(I stay at house/home.)
|adjectives (noun)||before noun||kırmızı ev|
|verb tenses||present||o satın alır|
|past||o satın aldı|
The standard order of a noun phrase in Turkish is the following:
determiner + numeral + adjective + noun + preposition. Note that in the below example, the noun (kitap) stays in
singular form because it is proceeded by a cardinal numeral:
bu üç popüler kitap hakkında these[pl,dem] three popular book[sg,nom] about DET NUM ADJ NOUN PREP "about these three popular books"
Turkish nouns are inflected for number and case. Lexicon entries for nouns may also be necessary for inflecting determiners, adjectives and pronouns correctly. They are omitted, if a lexicon entry is required, but missing.
The basic lexicon entry for ev (house) contains:
- inflection table for case and number:
If you need lexicon entries for countries, write to the support about that and you will get them for Turkish with automatic handling of prepositions.
Turkish adjectives do not inflect, therefore no lexicon entries for them need to be created. For adjective position, the default is "before noun".
Turkish verbs inflect for person, number, and tense. The most common verbs are encoded in our software. If a verb inflects incorrectly, you should add it to the lexicon.
The AX NLG platform supports the following determiners for Turkish: indefinite, demonstrative, possessive, and quantifier (every).
Possessive determiners are expressed by adding certain possessive suffixes to the possessor (noun), which are put between the plural suffix and the case suffix. The AX NLG platform will automatically inflect nouns with a possessive suffix, when a possessive determiner is chosen. Take ev (house) for example:
evim [possessive 1st sg] -> "my house" evl-er-im-i [pl, possessive 1st sg, acc] -> "my houses"
Additionally, when using the indefinite determiner "bir", the word order will change. The AX NLG platform changes the word order automatically, for example:
bu güzel günde [det adj noun] -> "this nice day" güzel bir gün [adj det noun] -> "a nice day"
The noun will automatically agree with the numeral number when a numeral variable is used. Four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit.
As the example below shows, note that nouns (e.g, gün) stay in
singular form even when the cardinal numeral is more than 1:
(the ninth day)
(the 9th day)
For Turkish, both cardinal and ordinal numerals are written out until 100 on the platform, otherwise (above 100) the output is in digit form. Take cardinal numerals for example:
yüz araba (one hundred cars) vs. 101 araba (101 cars)
Turkish rather uses postpositions than prepositions. Therefore, the AX NLG platform automatically puts prepositions behind the noun in a container for this language. Prepositions function as postpositions despite being called prepositions in the container, since they are analogous to prepositions in English (both are adpositions).
araba ile [noun postposition] -> "with the car"
The AX platform applies vowel harmonies automatically. Vowel harmony means that many inflection suffixes have different variants, that are used depending on what kind of vowels a word contains.
For Turkish vowel harmony depends on whether the last vowel of the word‘s stem or the preceding suffix is a front or back vowel and whether it is rounded or unrounded.
Take Turkish noun pluralization, for example, where the suffix can be either
-lar: if the last vowel of the stem is an undotted/back vowel (
a, ı, o, u ), the plural suffix
-lar is used. Vice-versa, if the last vowel of the stem is a dotted/front vowel (
e, i, ö, ü),
-ler is used as the plural suffix.
|göz (eye)||gözler (eyes)|
|kuş (bird)||kuşlar (birds)|
The AX platform also applies consonant assimilation automatically. This involves that consonants change from unvoiced consonants (e.g. k/p/ç) to voiced consonants (e.g. ğ/b/c). Consonant assimilation is not applicable for one-syllable words, foreign words and proper nouns.
The following example shows a consonant change from
b, when adding the possessive suffix to the noun kitap (book):
kitap (book) -> Benim kitabım (my book)
Turkish makes use of three buffer letters [n/y/s] to keep vowels apart when adding suffixes. The AX platform automatically adds them as need during inflection.
The following example shows a buffer letter being added to a word ending in a vowel, when putting it into the accusative form:
kedi (cat, lemma) -> kediyi (cat, acc. case)