Characteristics of the Bulgarian language in AX Semantics
Fundamentals
In Bulgarian, you need to know the gender of a noun to form (together with number and case) the accompanying adjectives, determiners, numerals, and pronouns correctly.
Bulgarian has three genders for nouns: masculine, feminine, and neuter. There are two numbers: singular and plural. Additionally, Bulgarian has two cases (nominative and vocative) for nouns, but personal pronouns have accusative and dative additionally.
grammatical name | values | examples |
---|---|---|
gender | masculine | старият лекар (the old doctor) |
feminine | старата жена (the old woman) | |
neuter | старото село (the old village) | |
number | singular | старият автомобил (the old car) |
plural | старите автомобили (the old cars) | |
cases (noun) | nominative | профе́сор (a professor) |
vocative | Как сте, професоре? (How are you, professor?) | |
cases (pronoun) | nominative | Той ми помага. (He helps me.) |
accusative | Родителите му водеха него и негов приятел на кино. Те го харесват. (His parents took him and a friend to the movies. They like him.) | |
dative | Непознатият му дава подарък. (The stranger gave him a gift.) | |
adjectives (noun) | before noun | червена къща (red house) |
verb tenses | present | той работи (he works) |
past (aorist) | той работи (he worked) | |
imperfect | рабо́теше (worked) | |
past participle | той е работил (he has worked) | |
future | той ще работи (he will work) |
The standard order of a noun phrase in Bulgarian is the following:
preposition + determiner + numeral + adjective + noun
See for example:
за тези три популярни книги
about these three popular book
PREP DET NUM ADJ NOUN
"about these three popular books"
Lexicon
Nouns
Bulgarian nouns are inflected for number (singular/plural), case (nominative/vocative), and definiteness (indefinite/definite). When the lexicon entry is missing, the NLG platform will first try to find the most probable gender based on heuristics. However, nouns should be added to the lexicon with their grammatical gender if they do not inflect regularly.
Lexicon entries for nouns may also be necessary for inflecting determiners, adjectives and pronouns correctly. They are omitted, if a lexicon entry is required, but missing.
Examples
The basic lexicon entry for се́ло (village) contains:
- gender: neuter
- inflection table for case and number:
Singular | Plural | |
---|---|---|
Nominative | се́ло | села́ |
Vocative | се́лое | села́и |
Definiteness is expressed by a definite article that is suffixed to the noun. Take стол (student)
for example:
стол (chair)
столът (the chair [subject])
стола (the chair [object])
Note
If you need lexicon entries for countries, write to the support about that and you will get them for Bulgarian with automatic handling of prepositions.
Adjectives
In the lexicon, the inflection table encodes gender, case, number, and definiteness. For adjective position, the default is "before noun".
The definite article is postfixed to the first adjective if there is one, otherwise it is added to the noun. For example:
добър ученик (a good student)
добрият ученик (the good student)
Verbs
Bulgarian verbs inflect for person, number, tense, and in some cases gender. The most common verbs are encoded in our software. If a verb inflects incorrectly, you should add it to the lexicon.
The future tense in Bulgarian can be formed via the auxiliary verb ще
+ the present form of the main verb. Thus, to express the future tense, you can put ще
as plain text and then use a verb container with present tense for the main verb. Take the verb играя (to play [inf.])
for example:
Те ще играят.
(They will play. [future])
Аз ще играя.
(I will play. [future])
Past participles in Bulgarian can be used for present and past perfect tense. The present perfect tense is formed via the present tense of съм
+ the past participle of the main verb. Additionally, the past perfect tense is composed of the imperfect tense of съм
+ the past participle of the main verb. You need to configure two verb containers for both perfect tenses. See the examples with the verb питам (impf, to ask)
below:
той е питал [present perfect]
(He has asked.)
той беше поискал питал [past perfect]
(He had asked.)
Container settings
Determiner
The AX NLG platform supports the following determiners for Bulgarian: definite, demonstrative, distal, relative, and possessive.
In Bulgarian, definiteness (definite/indefinite) is expressed by a definite article (e.g., то
in the below example) which is suffixed to the noun, adjective, or numeral. Note that the definite article is suffixed to the first adjective/numeral if there is one, otherwise it is added to the noun.
село (village[indef.])
vs.
селото (the village[def.])
vs.
старото село (the old[def.] village)
vs.
старото красиво село(the old[def.] beautiful village)
vs.
първото красиво село (the first[def.] beautiful village)
Numerals
The noun will automatically agree with the numeral number when a numeral variable is used. Four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit.
cardinal | ordinal | |
---|---|---|
text | девет дни (nine days) | деветия ден (the ninth day) |
digit | 9 дни (9 days) | 9. ден (the 9th day) |
For Bulgarian, ordinal numerals are written out until 20 on the platform, otherwise (above 20) the output is in digit form. Cardinal numerals are written out for 1-20, 30, 40, …, and 100, otherwise the output is in digit form. Take cardinal numerals for example:
деветдесет автомобила
(ninety cars)
vs.
101 автомобила
(101 cars)
Additionally, cardinal numerals take the special ending -ма
, when referring to men, see два (two)
for example:
два стола
(2 chairs)
двама братя
(2 brothers)
Prepositions
If users configure prepositions in the container, they are automatically adapted, if needed (в
→във
/ с
→ със
). As the below example shows, в
becomes във
when the next word starts with a similar consonant (e.g., в
or ф
).
пристигам в хотела
(arrive at the hotel)
във Франция
(in France)
Preposition switch
On the AX NLG platform, the settings for the container Германия (Germany) are: preposition="в"
and case="nom"
.
в Германия
(in Germany)
For the below sentence with a different place (i.e. Филипините (the Philippines)), the platform settings are the same as above (preposition="в", case="nom"
), but the lexical information changes the preposition and shows a switch from в
to на
:
на Филипините
(in the Philippines)