|
All the English content of this site moved to www.semantic-knowledge.com ![]() Click here to go to www.semantic-knowledge.com |
We will remove this outdated content soon,
so do not bookmark these pages.
To process a text, Tropes operates in 6 stages:
Results of first three stages are used by Zoom Semantic Search Engine.
Words are grouped together in several main Word categories. Among these, six are of interest to us:
To achieve an analysis, the software carries out a complex processing aiming at: assigning all the significant words to the above categories; analyzing their distribution into subcategories (Word categories, Equivalent classes, see below); examining their occurrence order, both within the propositions (Relations, Actants and Acted) and throughout the text (Distribution graph, Bundles, Episodes, Most characteristic parts of text, see below).
To simplify the analysis, Tropes divides the text into propositions (simple sentences). This first stage is based on a scrutiny of the punctuation, and on complex syntax analysis functions, which will not be detailed here. Thus, you obtain co-occurrence statistics (Relations) of high reliability, since it is not possible for two words to fit into the same grammatical proposition if they are not closely connected. Propositional hashing is bound to involve errors (propositions that are either too short or too long), but this does not alter the results.
The automatic interpretation of words in any living language, either written or spoken, requires the solving of numerous ambiguities:
One of the main functions of this software is to solve these ambiguities by means of several problem-solving algorithms. Though a perfect result is impossible to achieve, the error rate is low enough to guarantee an accurate analysis of your text.
The verbs are either:
Connectors (coordinating and subordinating conjunctions, conjunctive phrases) link together various parts of the discourse through concepts of:
Personal pronouns are displayed in gender ("I", "You", "He", etc.) and in number ("They", "We", etc.) The middle/old English "Thou" form is also detected here.
Modalities (adverbs or adverbial phrases) enable the speaker to get involved in what he says, or to locate what he says in time and space, through concepts of:
Adjectives are either:
Other word categories include pronouns, articles, prepositions and non-qualifying adjectives. Do not take these categories into account, because they are used only for ambiguity solving.
Broadly speaking, we can say that:
Tropes carries out different sorts of text analyses:
Among other things, statistics are used to build the graphs and to lay out the results.
The Frequent word categories and the Text Style are obtained by comparing the distribution of the occurrence frequency of the categories observed in the text with linguistic production norms. These norms have been elaborated after studying a great number of different texts. They are stored into specific in-built tables.
The Equivalent classes group together closely related References (common nouns, proper nouns, trademarks) appearing frequently throughout the text. For example, "father" and "mother" are grouped together into the "family" class.
The Reference fields group together the words comprising the Equivalent classes in order to enable the software to elaborate a representation of the context. To achieve this, the Semantic equivalents dictionary of Tropes is composed of three different classification levels. At the lower level are the References, which are next merged more broadly into Reference fields 2, which, in turn, are merged into Reference fields 1.
In the example below, the word "Lord Chancellor" belongs to the "minister" Reference, included in the "government" field 2, which is part of the "politics" field 1. The "politics" field 1 includes broader concepts, such as "political system", "foreign policy", etc.
Tropes carries out two different tools to study the chronology of a discourse. It is based on two notions, Bundles and Episodes:
The contraction of the text reveals the Most characteristic parts of text. These are "propositions introducing main themes or characters, expressing events that are essential to the progression of the story (causal attributions of consequences, results, aims)".
To extract these propositions, Tropes carries out a complex Cognitive-Discursive Analysis processing (CDA). To simplify matters, let us say that each proposition of the text is allotted a score, depending on its relative weight, its occurrence order and its argumentative role. The propositions are then sorted out according to their respective scores. To enable you to control the amount of displayed propositions, and to insure that the result obtained reflects the text analyzed, Tropes provides the means to adjust the contraction rate of the text.
The software makes a diagnosis of the Text Style and of its Setting according to the statistical indicators retrieved during the analysis.
Here are the possible Styles:
Copyright
ACETIC 2004
Use of Word categories
Statistical, probabilistic and cognitive analyses
Equivalent classes and Relations between equivalents
Fields 1
Fields 2
References
Words
Politics
Political system
Communism
Communism
Politics
Political system
Communism
Marxism
Politics
Political system
Democracy
Democracy
Politics
Political system
Democracy
Republic
Politics
Government
Federal government
Federal government
Politics
Government
Head of Government
Head of Government
Politics
Government
Head of Government
Prime Minister
Politics
Government
Minister
Lord Chancellor
Politics
Government
Minister
Minister
Politics
Government
Minister
Secretary of State
Politics
Government
Government
Government
Bundles and Episodes
Most characteristic parts of text
Text Style
Style
Explanation:
Argumentative
the speaker involves himself, argues, explains or analyzes in order to try to convince the interlocutor
Narrative
a narrator states a series of events, happening at a given time, and in a given place
Enunciative
the speaker and the interlocutor establish a mutual relation of influence, make their standpoints known
Descriptive
a narrator describes, identifies or classifies something or somebody
![]()
![]()
![]()
![]()
http://www.acetic.fr