All the English content of this site moved to
www.semantic-knowledge.com



Click here to go to
www.semantic-knowledge.com


We will remove this outdated content soon,
so do not bookmark these pages.

Semantic engine

How Tropes operates

To process a text, Tropes operates in 6 stages:

  1. sentences and propositions hashing,
  2. ambiguity solving (regarding the words of the text),
  3. identification of Equivalent classes,
  4. statistics, detection of Bundles and Episodes,
  5. detection of the Most characteristic parts of text,
  6. layout and display of the result.

Results of first three stages are used by Zoom Semantic Search Engine.

Words are grouped together in several main Word categories. Among these, six are of interest to us:

To achieve an analysis, the software carries out a complex processing aiming at: assigning all the significant words to the above categories; analyzing their distribution into subcategories (Word categories, Equivalent classes, see below); examining their occurrence order, both within the propositions (Relations, Actants and Acted) and throughout the text (Distribution graph, Bundles, Episodes, Most characteristic parts of text, see below).

Propositional hashing

To simplify the analysis, Tropes divides the text into propositions (simple sentences). This first stage is based on a scrutiny of the punctuation, and on complex syntax analysis functions, which will not be detailed here. Thus, you obtain co-occurrence statistics (Relations) of high reliability, since it is not possible for two words to fit into the same grammatical proposition if they are not closely connected. Propositional hashing is bound to involve errors (propositions that are either too short or too long), but this does not alter the results.

Ambiguity solving

The automatic interpretation of words in any living language, either written or spoken, requires the solving of numerous ambiguities:

One of the main functions of this software is to solve these ambiguities by means of several problem-solving algorithms. Though a perfect result is impossible to achieve, the error rate is low enough to guarantee an accurate analysis of your text.

Word categories

The verbs are either:

Connectors (coordinating and subordinating conjunctions, conjunctive phrases) link together various parts of the discourse through concepts of:

Personal pronouns are displayed in gender ("I", "You", "He", etc.) and in number ("They", "We", etc.) The middle/old English "Thou" form is also detected here.

Modalities (adverbs or adverbial phrases) enable the speaker to get involved in what he says, or to locate what he says in time and space, through concepts of:

Adjectives are either:

Other word categories include pronouns, articles, prepositions and non-qualifying adjectives. Do not take these categories into account, because they are used only for ambiguity solving.

Use of Word categories

Broadly speaking, we can say that:

Statistical, probabilistic and cognitive analyses

Tropes carries out different sorts of text analyses:

Among other things, statistics are used to build the graphs and to lay out the results.

The Frequent word categories and the Text Style are obtained by comparing the distribution of the occurrence frequency of the categories observed in the text with linguistic production norms. These norms have been elaborated after studying a great number of different texts. They are stored into specific in-built tables.

Equivalent classes and Relations between equivalents

The Equivalent classes group together closely related References (common nouns, proper nouns, trademarks) appearing frequently throughout the text. For example, "father" and "mother" are grouped together into the "family" class.

The Reference fields group together the words comprising the Equivalent classes in order to enable the software to elaborate a representation of the context. To achieve this, the Semantic equivalents dictionary of Tropes is composed of three different classification levels. At the lower level are the References, which are next merged more broadly into Reference fields 2, which, in turn, are merged into Reference fields 1.

In the example below, the word "Lord Chancellor" belongs to the "minister" Reference, included in the "government" field 2, which is part of the "politics" field 1. The "politics" field 1 includes broader concepts, such as "political system", "foreign policy", etc.

Fields 1 Fields 2 References Words
Politics Political system Communism Communism
Politics Political system Communism Marxism
Politics Political system Democracy Democracy
Politics Political system Democracy Republic
Politics Government Federal government Federal government
Politics Government Head of Government Head of Government
Politics Government Head of Government Prime Minister
Politics Government Minister Lord Chancellor
Politics Government Minister Minister
Politics Government Minister Secretary of State
Politics Government Government Government

Bundles and Episodes

Tropes carries out two different tools to study the chronology of a discourse. It is based on two notions, Bundles and Episodes:

Most characteristic parts of text

The contraction of the text reveals the Most characteristic parts of text. These are "propositions introducing main themes or characters, expressing events that are essential to the progression of the story (causal attributions of consequences, results, aims)".

To extract these propositions, Tropes carries out a complex Cognitive-Discursive Analysis processing (CDA). To simplify matters, let us say that each proposition of the text is allotted a score, depending on its relative weight, its occurrence order and its argumentative role. The propositions are then sorted out according to their respective scores. To enable you to control the amount of displayed propositions, and to insure that the result obtained reflects the text analyzed, Tropes provides the means to adjust the contraction rate of the text.

Text Style

The software makes a diagnosis of the Text Style and of its Setting according to the statistical indicators retrieved during the analysis.

Here are the possible Styles:

Style Explanation:
Argumentative the speaker involves himself, argues, explains or analyzes in order to try to convince the interlocutor
Narrative a narrator states a series of events, happening at a given time, and in a given place
Enunciative the speaker and the interlocutor establish a mutual relation of influence, make their standpoints known
Descriptive a narrator describes, identifies or classifies something or somebody

 


This is the first page of this section This is the first page of this section Next Last page

Copyright ACETIC 2004
http://www.acetic.fr