Characteristics of the Latvian language in AX Semantics
In Latvian, you need to know the gender and number of a noun to form the accompanying adjectives, determiners and pronouns correctly.
Latvian has two gender for nouns: masculine and feminine. There are two numbers: singular and plural. Additionally, Latvian has seven cases for nouns.
|number||singular||viena veca automašīna|
(one old car)
|plural||piecas vecas automašīnas|
(five old cars)
|cases (noun)||nominative||suns (dog)|
|dative||Es dodu bumbu sunim. |
(I give the ball to the dog.)
|accusative||Es redzu suni.|
(I see the dog.)
|instrumental||Neredzīgais cilvēks staigā kopā ar suni.|
(The blind man walks with a dog.)
|locative||Kauls bija sunī.|
(The bone was in the dog.)
|vocative (mainly for noun)||Ei, suni!|
|adjectives (noun)||before noun||sarkanais ābols|
|verb tenses||present||viņš pērk|
The standard order of a noun phrase in Latvian is the following:
preposition + determiner + numeral + adjective + noun. See for example:
par šīm trim populārajām grāmatām about these three popular books[dat,pl] PREP DET NUM ADJ NOUN "about these three popular books"
Latvian nouns are inflected for number and case. When the lexicon entry is missing, the NLG platform will try to find the most probable gender based on heuristics. However, nouns should be added to the lexicon with their grammatical gender if they do not inflect regularly.
Lexicon entries for nouns may also be necessary for inflecting determiners, adjectives and pronouns correctly. They are omitted, if a lexicon entry is required, but missing.
The basic lexicon entry for draugs (friend) contains:
- inflection table for case and number:
If you need lexicon entries for countries, write to the support about that and you will get them for Latvian with automatic handling of prepositions.
In the lexicon, the inflection table encodes definiteness, case, number and gender. For adjective position, the default is "before noun".
Additionally, Latvian adjectives not only agree with the noun, but also express definiteness. Take the adjective
vecs (old) for example:
Viņa nopirka [vecu] māju. (She bought [an old] house.) vs. Viņa nopirka [veco] māju. (She bought [the old] house.)
Besides, if adjectives stay after possessive and demonstrative determiners, the definite form will also be used. See for example:
Viņa nopirka šo [veco] māju. (She bought this [old] house.) Viņas [veco] māju. (Her [old] house.)
Latvian verbs inflect for person, number, and tense. The most common verbs are encoded in our software. If a verb inflects incorrectly, you should add it to the lexicon.
The AX NLG platform supports the following determiners for Latvian: demonstrative, distal, and possessive. Although there are no articles in Latvian, definiteness can be indicated by the endings of adjectives. Therefore, "definite" still can be chosen in the determiners setting.
The noun will automatically agree with the numeral number when a numeral variable is used. Four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit.
(the ninth day)
(the 9th day)
For Latvian, both cardinal and ordinal numerals are written out until 20, otherwise (above 20) the output is in digit form. For example:
divdesmit automašīnas (twenty cars) 21 automašīna (21 cars)
Note that when the cardinal is above 9, the noun (+ adjective) should be genitive case:
|1-9||septiņi veci studenti[nom, m]|
(seven old students)
piecas grāmatas[nom, f]
|above 9||Divdesmit septiņi vecu studentu[gen, m]|
(twenty-seven old students)
desmit grāmatu[gen, f]
The AX NLG platform will automatically apply consonant shift for Latvian nouns. Consonant shift refers to the phenomenon that consonants in the beginning or middle of a word change depending on the grammatical context (e.g.
For example, the plural nominative case for the noun
brālis (brother) is formed with the suffix
-i. With pluralization a consonant shift in the form of palatalization happens when the
l in the middle of the noun changes to
ļ, and thus
brālis (brother) becomes
brāļi(brothers). More examples can be seen below:
upe (river) -> upju (gen.pl.) = iotation "p → pj" lasis (salmon) -> lašu (gen.pl.) = palatalization "s → š"