question

Upvotes
1 1 1 2

Retrieving TRIT relevance & confidence scores within Data Fusion

Can you point out how to retrieve these three scores using the Data Fusion API?

TRIT output in the viewer provides three different scores.

Relevance 80%

Continuous Relevance85%

score 0.5898915

In the TRIT RDF the scores are output in two blocks.

</rdf:Description> <rdf:Description rdf:about="http://d.opencalais.com/er/company/ralg-oa/4295897734">; <rdf:type rdf:resource="http://s.opencalais.com/1/type/er/Company"/>; <c:docId rdf:resource="http://d.opencalais.com/dochash-1/22067b0c-b3e7-3ab5-8511-ef6631deae1d"/>; <c:name>BURBERRY GROUP PLC</c:name> <c:permid>4295897734</c:permid> <c:primaryric>BRBY.L</c:primaryric> <c:ispublic>true</c:ispublic> <c:commonname>Burberry Group</c:commonname> <c:score>0.5898915</c:score> <!--Burberry--> <c:subject rdf:resource="http://d.opencalais.com/comphash-1/af06f35f-41bd-3065-beb4-5f65223bdb35"/>;

. <rdf:Description rdf:about="http://d.opencalais.com/dochash-1/22067b0c-b3e7-3ab5-8511-ef6631deae1d/Relevance/2">; <rdf:type rdf:resource="http://s.opencalais.com/1/type/sys/RelevanceInfo"/>; <c:docId rdf:resource="http://d.opencalais.com/dochash-1/22067b0c-b3e7-3ab5-8511-ef6631deae1d"/>; <c:subject rdf:resource="http://d.opencalais.com/comphash-1/af06f35f-41bd-3065-beb4-5f65223bdb35"/>; <c:relevance>0.8</c:relevance> <c:relevancecont>0.85</c:relevancecont> </rdf:Description>

newsdata-fusionintelligent-tagging
icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Asked Tomasz Adamusiak, Ola Bildtsen and Vladimir Kubatin to take a look into this.

1 Answer

· Write an Answer
Upvotes
1.2k 8 11 8

Based on http://www.opencalais.com/wp-content/uploads/folder/ThomsonReutersOpenCalaisAPIUserGuideR11_0.pdf

c:score: A confidence score on a scale of 0 to 1. The value indicates the probability that the topic is indeed discussed in the text and also how centric the topic is to the text. The higher the value, the higher the probability.

TRIT connector has a setting

and would ignore anything with c:score under .50.

relevancecont: A more granular relevance score. This attribute is relevant only to Company entities.

which we mostly ignore and focus on

relevance: The relevance score. The higher the score, the greater the relevance of the entity to the document.

as a score into the edge automatically in the TRIT connector. We're also ignoring anything with relevance under .20.


icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Write an Answer

Hint: Notify or tag a user in this post by typing @username.

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.