Characteristics of the Chinese language in AX Semantics


Chinese has three genders for nouns: masculine, feminine and neuter, and there are two numbers: singular and plural. Often the gendered distinction exists in the case of the pronouns "他[ta]" (he) and "她[ta]" (she).

Chinese grammar intensively relies on particles. Verb tenses are configured by particles instead of morphology/inflection.

grammatical namevaluesexamples
gendermasculine高的男人 (tall-高的/man-男人)
feminine高的女人 (tall-高的/woman-女人)
neuter高的树 (tall-高的/tree-树)
numbersingular一栋红的房子 (one+classifier-一栋/red-红的/house-房子)
plural兩栋红的房子 (two+classifier-兩栋/red-红的/houses-房子)
adjectivebefore noun (big)
verb tensepresent (go)
past去了 (went)

The standard order of a noun phrase in Chinese is the following:

preposition + determiner + numeral + adjective + noun

See for example:

关于    这     三本    受欢迎的  书
about  these  three  popular  book
PREP   DET    NUM    ADJ      NOUN
"about these three popular books"



For Chinese nouns, the lexicon needs to encode gender and number.


The lexicon entry for 孩子 [hai zi] (child) contains:

  • gender: neuter
  • inflection table for case and number:
Nominative孩子[hai zi] (child)孩子[hai zi men] (children)

The lexicon entry for 女孩 [nu hai] (girl) contains:

  • gender: feminine
  • inflection table for case and number:
Nominative女孩[nu hai] (girl)女孩[nu hai men] (girls)

Alternative: Chinese number can be changed by adding particles.

For example, by adding the particles "很多[hen duo]"(many) to "孩子[hai zi]"(child), it makes singular nouns become plural.


For Chinese verbs, the lexicon only needs to encode tense. The container will generate the most common tense particle, "了[le]". Therefore, you can make use of the lexicon when you need a different tense particle. For example, you might want to have a different tense particle "过[guo]" for the verb "玩[wan]"(play).


The lexicon entry for 玩[wan] (play) contains

  • past tense: 玩过[wan guo] (played)

Container settings


The AX NLG platform supports the following determiners for Chinese: demonstrative and possessive.


Nouns need a classifier when cardinal numerals are used. If a cardinal is used in a container, the AX NLG platform will add the most common classifier, "个[ge]", like in the following example:

  • A classifier "个[ge]" is added between "三[san]"(three) and "人[ren]"(people).
 三   "个" 人
[san "ge" ren]
(three-三"个" people-人)

However, if different classifiers are needed, they have to be added manually between number, like in the example below:

  • A classifier "栋[dong]" is added between "两[liang]"(two) and "房子[fangzi]"(houses).
 两     "栋"   房子
[liang "dong" fangzi]
(two-两"栋" houses-房子)

Additionally, four types of numerals are possible on the AX NLG platform: cardinal, cardinal as digit, ordinal, and ordinal as digit:

(nine girls)
(the ninth girl)
(9 girls)
(the 9th girl)

For Chinese, both cardinal and ordinal numerals are written out until 100 on the platform, otherwise (above 100) the output is in digit form. Below you can see an example with a cardinal numeral, which also includes a classifier:

[yibai liang che]
(one hundred:一百 classifier:辆 cars:车)
[yibai ling yi liang che]
(101 classifier:辆 cars:车)


The adjectives in Chinese are very regular. As shown in the following example, a particle "的[de]" is put between adjective and noun. The container will automatically deal with that, so adding adjectives in the lexicon is unnecessary.


[da "de" fangzi]
(big-大"的" house-房子)
[Hong de maozi]
(red-红"的" hat-帽子)


In Chinese, verb tense is expressed via particles. The AX NLG platform includes the most common tense particles and automatically adds the tense particles right after the verb stem. If you want to add different particles after the verb stem, you can add them in the verb lexicon. If you need to change the position of tense particles, please see the following advanced examples.

Common Examples

Past tense particle "了[le]" is added right after the verb stem (e.g. "去[qu]"(go) and "吃[chi]"(eat)).

[qu/qu "le"]
[chi/chi "le"]

Advanced examples

If you need particles in a different position, you have to add them manually or create branches for different tense particles. For example, present progressive tense uses the particle "正[zheng]" to indicate tense, but does not add particle right after the verb.

[Tamen "zheng" cong bali fei wang sai she er]
(They-他们 particle-"正" from-从 Paris-巴黎 are flying to-飛往 the seychelles-塞舌尔)

Additionally, past tense uses the particle "了[le]", but does not add particle right after the verb.

[Tamen cong bali fei wang sai she er "le"]
(They-他们 from-从 Paris-巴黎 flew to-飛往 the seychelles-塞舌尔 particle-"了")

As for future tense, it uses the particle "将[jiang]", but does not add particle right after the verb.

[Tamen "jiang" cong bali fei wang sai she er]
(They-他们 particle-"将" from-从 Paris-巴黎 will fly to-飛往 the seychelles-塞舌尔)

Alternative: You can also clarify tense directly with the time expression (like today, tomorrow, yesterday) as part of the sentence.

A time expression is usually addressed at the beginning of the sentence, like in the following example.

[zuótiān wǒ chī zhū ròu]
(Yesterday-昨天 I-我 ate-吃 pork-猪肉)

Language Variants

The AX NLG platform offers 5 variants of the Chinese language:

  • China (simplified Chinese)
  • Hong Kong (Cantonese)
  • Macao (Cantonese)
  • Taiwan (traditional Chinese)
  • Singapore (simplified Chinese)

Firstly, they differ in their characters. Simplified Chinese characters have easier structure than their traditional ones. For instance, the character, "叶[Ye]" meaning ‘leaf’ has notably less embellishments than the traditional version, "葉[Ye]". This means that also determiners and pronouns differ between variants. For example, the Cantonese demonstrative pronoun is 呢[ne], whereas in simplified Chinese it is 该[gai].

Secondly, there are some phrases specific to each language. For example, "鬼佬[Gwai Lo]" in Cantonese refers to a foreigner. In Mandarin, the phrase is "老外[Lao Wai]".