Characteristics of the Italian language in the AX NLG platform
Fundamentals
In Italian, you need to know the gender of the noun in order to form determiners, adjectives and numerals correctly. There are two genders: masculine and feminine. And there are two numbers: singular and plural. As for other Romance languages there is only one case for nouns: nominative. But for personal pronouns Italian has additionally accusative, genitive and dative.
grammatical name | values | examples |
---|---|---|
gender | feminine | la casa nera (the black house) |
masculine | il porto nero (the black port) | |
number | singular | una grande casa (a big house) |
plural | due grandi porti (two big harbors) | |
cases (noun) | nominative | casa (house) |
cases (pronoun) | nominative | Mario sta costruendo una casa. Lui sta costruendo una casa. (Mario is building a house. He is building a house.) |
accusative | Maria sta costruendo una casa. Maria la costruisce. (Maria is building a house. Maria builds it.) | |
genitive | C'è del formaggio? Ne voglio una fetta. (Is there any cheese? I want a slice of it.) | |
dative | Maria regala un libro a Luigi. Maria gli regala un libro. (Maria gives Luigi a book. Maria gives him a book.) | |
adjective positions | pre-noun | grandi porti (large ports) |
post-noun (default) | casa rossa (red house) | |
verb tenses | present | lui parla (he speaks) |
imperfect | lui parlava (he talked) | |
gerund | parlando (speaking) | |
future | lui parlerà (he will speak) |
The standard order of a noun phrase in Italian is the following: preposition + determiner + numeral + noun + adjective
.
See for example:
su questi tre libri popolari
about these three books[pl] popular
PREP DET NUM NOUN ADJ
"about these three popular books"
Lexicon
Nouns
For Italian nouns, the lexicon needs to encode gender, preposition changes and if a determiner is present. If the lexicon entry is missing, the NLG platform tries to find the most probable gender based on heuristics.
Examples
The lexicon entry for porto (port) contains
- gender: masculine
- inflection table for case and number:
Singular | Plural | |
---|---|---|
Nominative | porto | porti |
The lexicon entry for Australia contains:
- gender: female
- inflection table for case and number
- remove the definite determiner for the preposition in
And the lexicon entry for Malta contains:
- remove all definite determiners
- inflection table for case and number
- replace preposition in with a
note
If you need lexicon entries for countries please write the support about that and you will get them for Italian with automatic handling of determiners.
Adjectives
For adjectives the lexicon can encode gender for singular and plural and the adjective position (before the noun or after the noun).
For position, the default is after the noun. When an adjective should be before the noun, this has to be encoded in the lexicon.
The most important cases (e.g. grande, buono, etc.) are already annotated as before nouns, but others may need a lexicon entry.
Verbs
The most common verbs are encoded in our software. If a verb inflects the wrong way, you should add it to the lexicon.
Container settings
Numerals
The noun will automatically agree with the numeral number when you use a numeral variable. There is no need to add additional branches for numeral. Four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit.
cardinal | ordinal | |
---|---|---|
text | Nove topi nella mia stanza. (Nine mice in my room.) | Il noni giorno di scuola. (The ninth day at school.) |
digit | 9 topi nella mia stanza. (9 mice in my room.) | Il 9i giorno di scuola. (The 9th day at school.) |
For Italian, ordinal numerals are written out until 20, otherwise (above 20) the output is in digit form. Cardinal numerals are written out until 30, otherwise (above 30) the output is in digit form. Take cardinal numerals for example:
trenta auto
(thirty cars)
31 auto
(31 cars)
Prepositions and determiners: contractions
On the AX NLG platform prepositions are automatically joined with determiners for the following prepositions: a, da, di, in and su.
Determiners and preposition may also be changed separately. For example the preposition a changes to ad before vowels like in ad Andorra and determiners like la or una are contracted with an apostrophe like in l'amica.
Examples
Contraction
In the sentence
Stavano viaggiando dall'Australia a Parigi.
(They were traveling from Australia to Paris.)
the preposition da is joined with the female definite article la and the article is adapted to the following vowel a to dall‘.
On AX NLG platform, when the settings for the container dall‘Australia are: preposition="da"
and determiner=definite
, the lexical information about gender and the vowels following each other (dalla A -> dall'A) results automatically in correct inflection.
Determiner switch
When Australia is used with the preposition in no article is used. For example:
Vivono in Australia.
(They live in Australia.)
On the AX NLG platform, the noun container is used the same way as in the first example: preposition="in"
and determiner=definite
. The lexical information about the removal of the determiner when the preposition is in is automatically applied by using the lexicon. The determiner=definite
is important for other nouns that could be inserted here, i.e. if the inserted noun is always plural (plurale tantum), then in+definiteness is changed to nelle
Vivono nelle Emirati Arabi Uniti.
(They live in the United Arab Emirates.)
Preposition switch
Another case here is the preposition switch from in to a for (mostly) islands. Let's use Malta as an example here. The sentence would be:
Vivono a Malta.
(They live in Malta.)
so the preposition changes to a and there is no determiner. So Malta has no gender in the lexicon, but two switches: replace preposition in with a and they don't use any determiners with prepositions.
Defaults: Some best practices for Italian nouns with switches
- If there are definite determiners (e.g., for country names) possible for a noun phrase they should be activated for the container and then the determiner is removed via the lexicon where appropriate. This assumes "with definite article" is the default.
- The default for the preposition switch (only used for in -> a in Italian) is in as in most other languages.