Characteristics of the Croatian language in AX Semantics
In Croatian, you need to know the gender of a noun to form (together with number and case) the accompanying adjectives, determiners, numerals, and pronouns correctly.
Croatian has three genders for nouns: masculine (animate and inanimate), feminine and neuter. There are two numbers: singular and plural. Additionally, Croatian has seven cases for nouns.
|gender||masculine inanimate||stari auto|
(the old car)
|masculine animate||stari doktor|
(the old doctor)
(the old woman)
(the old theater)
(the old car)
(the old cars)
(I see the dog.)
|dative||Dajem loptu psu.|
(I give the ball to the dog.)
(the dog's bell)
|vocative||Ovo je bilo dobro, psu.|
(This was good, dog.)
|locative||Ptica sleti na psu.|
(The bird lands on the dog.)
|instrumental||Slijepac hoda sa psom.|
(The blind man walks with a dog.)
|adjectives (noun)||before noun||crvena kuća|
|verb tenses||present||on radi|
|past (imperfect)||on rađaše|
|past participle passive||rađen|
|past participle active||on je radio|
The standard order of a noun phrase in Croatian is the following:
preposition + determiner + numeral + adjective + noun. See for example:
o ove tri popularne knjige about these three popular book PREP DET NUM ADJ NOUN "about these three popular books"
Croatian nouns are inflected for number and case. Lexicon entries for nouns may also be necessary for inflecting determiners, adjectives and pronouns correctly. They are omitted, if a lexicon entry is required, but missing.
The basic lexicon entry for kuća (house) contains:
- gender: feminine
- inflection table for case and number:
If you need lexicon entries for countries, write to the support about that and you will get them for Croatian with automatic handling of prepositions.
In the lexicon, the inflection table encodes gender/animacy, case, number and definiteness. For adjective position, the default is "before noun".
Croatian verbs inflect for person, number, tense, and in some cases gender. The most common verbs are encoded in our software. If a verb inflects incorrectly, you should add it to the lexicon.
The future tense in Croatian can be formed via present tense of
htjeti + infinitive verb. Thus, you can easily use present tense in verb containers to express the future tense. Take the verb
jesti (to eat) for example:
jesti (to eat [inf.]) Ja ću jesti. (I will eat. [future])
Participles in Croatian can be used in a particular voice: active or passive. See the examples with the verb
izgraditi (pf, to build) below:
Maria je izgradila kuću. = Maria has built the house. (active) Kuća je izgrađena. = The house was built. (passive)
The AX NLG platform supports the following determiners for Croatian: definite, indefinite, demonstrative, medial, distal, and possessive.
The noun will automatically agree with the numeral number when a numeral variable is used. Four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit.
(the first day)
(the 1st day)
In Croatian, case and number for nouns and adjectives change based on numerals (if no other case than nominative or accusative is set). See for example:
|Numeral||case / number||example|
|1||Nominative Singular||1 crvena kuća|
(1 red house)
|2-4||Genitive/Singular||3 crvene kuće|
(3 red houses)
|>= 5||Genitive/Plural||7 crvenih kuća|
(7 red houses)
For Croatian, both cardinal and ordinal numerals are written out until 20, otherwise (above 20) the output is in digit form. Take cardinal numerals for example:
dvadeset automobila (twenty cars) 21 automobil (21 cars)
If users configure prepositions in the container, they are automatically adapted, if vocalization needs to happen. As the below example shows,
sa when the next word starts with a sibilant or similar consonant (e.g.,
s autom (with the car) sa školom (with the school)
On the AX NLG platform, the settings for the container Njemačka (Germany) are:
u Njemačkoj (in Germany)
For the below sentence with a different place (i.e. Island (Iceland)), the platform settings are the same as above (
preposition="u", case="loc"), but the lexical information changes the preposition and shows a switch from
na Islandu (in Iceland)
"Fleeting a" describes the phenomenon of a short
a appearing and disappearing in the inflected forms of certain nouns. See the noun
bórac (fighter) for example:
bórac [nom. sg.] bórca [gen. sg.] bȏrācā [gen. pl.]
Palatalization is a sound change that happens when the inflection suffix starts with the vowel
e and the word stem ends with
k, g or h (which change to
č, ž and š). The AX NLG platform applies palatalization automatically, for example:
pȅći (to bake) → pèčēm (I bake) vrag (devil) → vraže (devils) jùnāk (hero) → jȕnāče [voc. sg.]