troubles parsing json files from ISTEX

CorText Manager Q&A forumCategory: Data processingtroubles parsing json files from ISTEX
Déborah Abhervé asked 3 years ago

Hi there ! We are trying to analyse a corpus of articles from the French journal “La Houille Blanche”, using the web service ISTEX : https://dl.istex.fr/?q=host.title%3A%22la+houille+blanche%22+&extract=enrichments%5Bmulticat%2Cnb%2Crefbibs%2Cteeft%2Cunitex%5D%3Bfulltext%5Bpdf%2Czip%5D&size=5841&rankBy=qualityOverRelevance&archiveType=zip&compressionLevel=9&sid=istex-dl&usage=1

We downloaded 100 articles with text files and json metadata. Now CorText won’t recognise the json files (“log file not found”) when attempting to parse. Here are the options we used : https://photos.app.goo.gl/oMgBDPFJXC3BoN2G8 Here’s the file used : https://drive.google.com/file/d/1Z62eRiXZNaE-Rg1SA5yOVEUV65lDgeTb/view?usp=sharing

Any idea ? Thanks in advance.

6 Answers
Déborah Abhervé answered 3 years ago

Thank you so much Lionel !
It finally works !
Déborah