Data Processing

In this category, you will find instructions for preparing and parsing your original raw data. Data Slicer is specifically dedicated to transform numeric data into binned categories. Finally  querying facilities are explained to extract sub-corpora or enrich existing tables in your corpus.

Data processing documentation

Data formats

CorText Manager proposes a full ecosystem of modeling and exploratory tools for analyzing data, which can be more or less calibrated. A wide range of data formats can be imported into CorText Manager, leaving you great flexibility in terms of data type that can be processed.   CorText Manager is particularly adapted for processing text:...

Upload corpus

Once you have collected data (see Data formats section for more information about data supported by CorText Manager), you should first zip the file(s) that compose your corpus into a single zip archive before you can upload it into CorText Manager. The zip archive should be performed, regardless of the format of your original data...

Data Parsing

After uploading your corpus, the first mandatory step to be able to apply analysis scripts in CorText Manager is to run the “Data Parsing” script. Data parser transforms your corpus in a sqlite database (see “what does the parsing step?” paragraph below for more details about the database structure). “Data Parsing” script can only process...

Query Corpus

This script allows you to query your corpus and build a subcorpus or create new fields with sql-like queries. Two modes of querying are proposed, sql begin the standard one. Querying your corpus in a sql-like mode The principle is to allow users to directly perform sql-like queries on their corpus. Query type Choose sql...

Data Slicer

Data Slicer simply slices numeric data (provided that they are integer values) into any given number of quantiles (to be chosen in the form). For example, if one has a database compiling information about individuals including their age, it may be useful to transform this field in bins of various significant ages. In turn, it...

Latest questions in the Q&A forum on data processing

Filter:AllOpenResolvedClosedUnanswered
AnsweredJacopo asked 3 weeks ago • 
101 views7 answers0 votes
AnsweredNathalie KAKPO asked 3 months ago • 
163 views1 answers0 votes
Answeredevelyne lhoste asked 5 months ago • 
207 views1 answers0 votes
AnsweredHannah asked 2 years ago • 
557 views3 answers0 votes
AnsweredChristophe Gauld asked 3 years ago • 
2984 views2 answers0 votes
Answeredalbertoc asked 3 years ago • 
1052 views3 answers0 votes
Answeredorianabras asked 1 year ago • 
428 views1 answers0 votes
AnsweredEmma Bogler asked 2 years ago • 
388 views2 answers0 votes
Answeredleo zhang asked 2 years ago • 
512 views3 answers0 votes