question

Upvotes
1 0 1 2

Retrieving TRIT relevance & confidence scores within Data Fusion

Can you point out how to retrieve these three scores using the Data Fusion API?

TRIT output in the viewer provides three different scores.

Relevance 80%

Continuous Relevance85%

score 0.5898915

In the TRIT RDF the scores are output in two blocks.

</rdf:Description> <rdf:Description rdf:about="http://d.opencalais.com/er/company/ralg-oa/4295897734">; <rdf:type rdf:resource="http://s.opencalais.com/1/type/er/Company"/>; <c:docId rdf:resource="http://d.opencalais.com/dochash-1/22067b0c-b3e7-3ab5-8511-ef6631deae1d"/>; <c:name>BURBERRY GROUP PLC</c:name> <c:permid>4295897734</c:permid> <c:primaryric>BRBY.L</c:primaryric> <c:ispublic>true</c:ispublic> <c:commonname>Burberry Group</c:commonname> <c:score>0.5898915</c:score> <!--Burberry--> <c:subject rdf:resource="http://d.opencalais.com/comphash-1/af06f35f-41bd-3065-beb4-5f65223bdb35"/>;

. <rdf:Description rdf:about="http://d.opencalais.com/dochash-1/22067b0c-b3e7-3ab5-8511-ef6631deae1d/Relevance/2">; <rdf:type rdf:resource="http://s.opencalais.com/1/type/sys/RelevanceInfo"/>; <c:docId rdf:resource="http://d.opencalais.com/dochash-1/22067b0c-b3e7-3ab5-8511-ef6631deae1d"/>; <c:subject rdf:resource="http://d.opencalais.com/comphash-1/af06f35f-41bd-3065-beb4-5f65223bdb35"/>; <c:relevance>0.8</c:relevance> <c:relevancecont>0.85</c:relevancecont> </rdf:Description>

newsdata-fusionintelligent-tagging
icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

1 Answer

Upvotes
1.2k 6 10 8

Based on http://www.opencalais.com/wp-content/uploads/folder/ThomsonReutersOpenCalaisAPIUserGuideR11_0.pdf

c:score: A confidence score on a scale of 0 to 1. The value indicates the probability that the topic is indeed discussed in the text and also how centric the topic is to the text. The higher the value, the higher the probability.

TRIT connector has a setting

and would ignore anything with c:score under .50.

relevancecont: A more granular relevance score. This attribute is relevant only to Company entities.

which we mostly ignore and focus on

relevance: The relevance score. The higher the score, the greater the relevance of the entity to the document.

as a score into the edge automatically in the TRIT connector. We're also ignoring anything with relevance under .20.


icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.