i’m trying to run a Contrast Analysis opposing two targets, but I get many “useless terms” as “http” “les” etc.
Is there any way to get rid of them, they parasite the other words.
On a previous post, someone kinda ask the same question, and she’s been told to use “categorical” instead of “text” but it still doesnt work..
Any ideas ?
Thanks a lot
As long as your field is made of textual content, you should proabably stick to “text” and avoid “categorical in my opinion.
You may try to use the alternative tokenizer that is slower but may be more efficient (it may not cut URLs and better manage accents or other characters).