TRTH download issues

We have some issues with TRTH v2 api. We have different jobs stuck. I've canceled them using the api, but they are still stuck there (screenshot below).


The real issue I have is with the download:
- It's an on-demand extraction of about 700mb csv file. The extraction has successfully completed and the file is ready for the download. I use the url

https://hosted.datascopeapi.reuters.com/RestApi/v1/Extractions/RawExtractionResults('0x05c9d0

26dafb3026')/$value

to download the stream to a local file.

The problem I have with a job like this one is that the download stops at 177mb, always!

Is there a download limit?

Can we also run 10-20 parallel downloads?

P.S: since yesterday night the username/password stopped to work (invalid username/password). I'm calling reuters..

Best Answer

  • alvise.susmel
    alvise.susmel Explorer
    Answer ✓

    The way I made the compression and the download work properly is avoiding to use "requests" library but using urllib3 python lib instead.

    With it is possible to ask to don't "decode" the content (if it gzipped it would uncompress it)

    headers = {
    'content-type': 'text/plain',
    'Authorization': "Token " + token,
    'Prefer': 'respond-async'
    }

    http = urllib3.PoolManager(timeout=urllib3.util.Timeout(connect=60.0, read=60.0))
    r = http.request("GET",url,headers=headers,preload_content=False,decode_content=False)

    # r is a stream

Answers