Help! Issues with Terms Extraction

CorText Manager Q&A forumCategory: Text processingHelp! Issues with Terms Extraction
Mate Foldi asked 6 years ago

Good morning everyone! 
Unfortunately, there seems to be an issue with doing a Terms Extraction script for my corpus. 
Earlier in the week I performed multiple terms extractions successfully, however when I tried to do them again — following the same steps, only increasing the list number — the results showed up empty, with no terms list created. Once the the script was finished the results were blank. I even tried running the script again with same list number as before, but that too yielded the same result. 
My corpus is in a CSV format. 
Is there some kind of problem with the program? 
Thank you and kind regards, 
Máté
 
 
 

Jean-Philippe Cointet Staff replied 6 years ago

Nothing has changed in how the scripts functions, is the log information also empty ?

Mate Foldi replied 6 years ago

My log information says that it was a success but there is no option to view any of the results. The corpus being analysed is also still the same one. Below is my what my log says:

2018-11-26 11:52:03 INFO : Script Terms Extraction Started
2018-11-26 11:52:03 INFO :
Lexical extraction parameters:
Textual Fields:
– Text
Minimum Frequency: ‘3.’
List length: ‘250’
Monogramms are forbidden: true
Maximal length (max number of words): ‘3’
Lexical extraction advanced settings: false
grammatical criterion: noun phrase
Optionnaly you can name the new indexation that will be generated: ”
Dynamics:
Choose Original Timescale: Standard Periods
Number of time slices: ‘1’
time slices distribution: homogeneous

2018-11-26 11:52:03 INFO : Starting Extraction
2018-11-26 11:52:03 INFO : Period: [‘0_0’]
2018-11-26 11:52:03 INFO : Linguistic Pre-processing
2018-11-26 11:53:52 INFO : corpus POS/TAGed
2018-11-26 11:54:15 INFO : coprus chunked
2018-11-26 11:54:15 INFO : Word Frequency computed
2018-11-26 11:54:15 INFO : Ngrams counted
2018-11-26 11:54:16 INFO : Corpus Stemmed
2018-11-26 11:54:16 INFO : C-value computed
2018-11-26 11:54:16 INFO : Checking hyphenation
2018-11-26 11:54:16 INFO : Computing specificty score
2018-11-26 11:54:17 INFO : End of the Extraction
2018-11-26 11:54:17 INFO : Filtering extracted terms
2018-11-26 11:54:17 INFO : Indexing terms
2018-11-26 11:54:17 INFO : Script Corpus Terms Indexer Started
2018-11-26 11:54:20 INFO : 7412 occurrences indexed
2018-11-26 11:54:20 INFO : csv file read
2018-11-26 11:54:20 INFO : New table created: ISIterms (will simply be shown as Terms in the script parameter forms)
2018-11-26 11:54:20 INFO : Corpus successfully indexed
2018-11-26 11:54:20 INFO : List generated 250 terms included
2018-11-26 11:54:20 INFO : Script Term Extraction ended successfully

1 Answers
Philippe Breucker Staff answered 6 years ago

I checked, and in fact this seams to be a new bug, confirmed : we work on it and will tell you as soon as it is fixed.

Mate Foldi replied 6 years ago

Thank you very much! Roughly how long will it take to fix?

Philippe Breucker Staff replied 6 years ago

it should be ok now : could you test again ?

Mate Foldi replied 6 years ago

Yes, it’s working again! Thank you!