Beadavi asked 7 months ago

I would like to know if the attribution of a reference to a cluster based on keywords coocurences (author keywords and ISI keywords) is relevant, even if the number of occurrences of the terms in the document is always 0 or 1 since it is not free text.
Lionel Staff replied 7 months ago

Lionel Staff answered 7 months ago

Dear Béatrice,
If I have understood well: it is not a question of 0 and 1, neither of full text nor indexed keywords provided by the data provider.
A document is represented by a specific set of combination of keywords (cooccurrences). No matter if these keywords come from authors keywords or if they are extracted from full text.

Summing them for all the documents of the corpus, these cooccurrences build a matrix: the relationships between keywords are calculated with the chosen proximity measure. CorText Manager classify these cooccurrences based on their relationships to build the clusters.

To project a document on top of the clusters, CorText Manager will only measure which are the closest clusters to the document, based on the comparison of the keywords listed for the document and the keywords and their relationships of the clusters. The best match is chosen, if “Assign a unique cluster to each record (best match)” is set on “yes”.

I hope it helps!