SocialTag: constrict analytics of text

Hello, I'm testing text POSTs to https://api.thomsonreuters.com/permid/calais. The text I send contains some html tags and plenty of punctuation marks. I understand I could write some regex to pull out the tags and punctuation before POST, but I was asked to see if there is a way to pass in some switches to disregard html and punctuation in text analytics. If there are other possibilities, I'd love to hear about it. Thank you for your time!

Find more posts tagged with

intelligent-tagging

intelligent-tagging-api

open-calais-api

semantic-metadata-tagging

Accepted answers

Eran S.

Hello,

You may try to use the header content-type=text/html and check the results.

Regards.

All comments

unknown

The issue is because of text containing html and punctualion, one of the Social Tags returned is "HTML" with importance 1 and another is "Punctuation" with importance 1. In a best case scenario for what we're thinking of doing, it would be best to constrict text analytics in a way to ignore html and punctuation.

Eran S.

Hello,

You may try to use the header content-type=text/html and check the results.

Regards.

unknown

Thank you Eran, that fixed my issue

EXPLORE OUR SITES