Characteristics of the Macedonian language in AX Semantics

Fundamentals

In Macedonian, you need to know the gender, case, and number of a noun to form the accompanying adjectives, determiners, pronouns, and numerals correctly.

Macedonian has three genders for nouns: masculine, feminine, and neuter. There are two numbers: singular and plural. Additionally, Macedonian has two cases for nouns.

grammatical namevaluesexamples
gendermasculineнов текст
(new text)
feminineнова книга
(new book)
neuterново списание
(new magazine)
numbersingularголема куќа
(big house)
pluralголеми куќи
(big houses)
casenominativeмачка
(cat)
vocativeЕј, мачко!
(Hey, cat!)
adjectives (noun)before nounцрвено јаболко
(red apple)
verb tensespresentтој гледа
(he sees)
pastтој гледаше
(he saw)

The standard order of a noun phrase in Macedonian is the following:

preposition + determiner + numeral + adjective + noun

See for example:

за     овие    три    популарни книги
about  these   three  popular   books
PREP   DET     NUM    ADJ       NOUN
"about these three popular books"

Lexicon

Nouns

Macedonian nouns inflect for case, number, gender, and definiteness. Nouns should be added to the lexicon if they are not regular.

Lexicon entries for nouns may also be necessary for inflecting determiners, adjectives, numerals and pronouns correctly. They are omitted, if a lexicon entry is required, but missing.

Examples

The basic lexicon entry for текст (text) contains:

  • gender: masculine
  • inflection table for case and number:
SingularPlural
Nominativeтексттекстови
Vocativeтекстутекстови

Adjectives

In Macedonian, the default position for an adjective is "before noun". In the inflection table case, number, gender, and definiteness can be encoded.

If there are adjectives modifying the noun, only the first adjective is marked for definiteness (i.e., то in the below example), while the other adjectives and nouns are not marked. See the examples below:

кучето (the dog)
vs.
убаво куче (beautiful dog)
vs.
убавото куче (the beautiful dog)
vs.
старото и убаво куче(the old and beautiful dog)

Verbs

Macedonian verbs inflect for person, number, and tense. The most common verbs are encoded in our software. If a verb inflects the wrong way, you should add it to the lexicon.

Container settings

Determiner

The AX NLG platform supports the following determiners for Macedonian: definite, demonstratives (distal + proximal + medial) and possessives.

The definite article in Macedonian is mostly expressed by a suffix on the noun. However, if adjectives or numerals exist in the phrase, the definite article is suffixed to the adjective or numeral instead of the noun. For example:

куче (dog)
vs.
кучето (the dog)
vs.
старото куче (the old dog)
vs.
првото куче (the first dog)

Pronoun

The AX NLG platform supports the following pronouns for Macedonian: personal, demonstratives (distal + proximal), and possessives.

Numerals

Four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit. Take ден (day) for example:

cardinalordinal
textдевет дена
(nine days)
деветтиот ден
(the ninth day)
digit9 дена
(9 days)
9. ден
(the 9th day)

For Macedonian, ordinal numerals are written out until 10, otherwise (above 10) the output is in digit form. Cardinal numerals are written out until 12, otherwise (above 12) the output is in digit form. Take cardinal numerals for example:

дванаесет кучиња
(twelve dogs)
vs.
13 кучиња
(13 dogs)

Preposition switch

On the AX NLG platform the settings for the container Германија (Germany) are: preposition="во" and case="nom".

во Германија
(in Germany)

For the below sentence where only the country (i.e. the Philippines) is different, the platform settings are the same as above (preposition="во" and case="nom"), but the lexical information changes the result:

на Филипините
(in the Philippines)

This example shows a preposition switch from во to на.

Determiner Switch

Determiners can be switched according to lexical information. If you set the determiner switch for a specific noun in the lexicon, it will automatically switch to another determiner when you add the determiner you intend to switch in the container. The container setting in the first example is the following: preposition="до", determiner is unset (blank) and case="nominative".

до Берлин
(to Berlin)

As for the second example, the container settings for Филипини (Philippines) are still: preposition="до", determiner is unset (blank), and case="nominative". However, the determiner switches from none to definite, because it is configured in the lexicon entry for Филипини.

до Филипините
(to the Philippines)

Note

If the lexicon entry of a country includes a switch from none to definite, there is still a way to use the country without the article (e.g., just "Philippines"). The determiner will always remain None by setting determiner=None in the container. Only an unset determiner (blank) triggers the switch from none to another determiner.