Discover Refinitiv
MyRefinitiv Refinitiv Perspectives Careers
Created with Sketch.
All APIs Questions & Answers  Register |  Login
Ask a question
  • Questions
  • Tags
  • Badges
  • Unanswered
Search:
  • Home /
  • Calais /
avatar image
REFINITIV
Question by jirapongse.phuriphanvichai · Feb 23, 2018 at 01:23 AM · webinartritwebinar tritsolr

Could you explain a little more about how I would use relevance or importance to boost ranking of those documents? (from TRIT Webinar Feb 8)

I really liked the SOLR query on the content aware search – could you explain a little more about how I would use relevance or importance to boost ranking of those documents. You mentioned some sort of custom ranking – how do I get to that from BM25?

People who like this

0 Show 0
Comment
10 |1500 characters needed characters left characters exceeded
▼
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Viewable by all users

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

1 Reply

  • Sort: 
avatar image
REFINITIV
Best Answer
Answer by jirapongse.phuriphanvichai · Feb 23, 2018 at 03:56 AM

Basically BM25 is just a complicated math formula that assigns a score to a document for a particular query. This does work great but if you read about it, it really only takes you so far when it comes to a quality result.

You need to bring in other factors to the score, like in the page rank example of links to the document and links out from the document. This doesn't apply for everyone use case but its a very good digestible example and let's say number of clicks from other users in the past month or some form of rate of decay over time for usage. I am going to over simplify this next part. During the scoring process you can add these three features together to come up with your new score.

This is a great start but you aren't done yet because not all things are created equal you don't want to treat your features with the same weight. Depending on how much money and time you have (Also domain experts are important) you can come up with various approaches to finding the right weights to assign to each of these features before you add them up to a score for the document. Some use gold data, meaning for 100 queries for example (you will want to do a lot more) categorize the documents within the results you get back and use a support vector machine to have it come up with the weights you should use.

Another way to do this is looking from a demo like UI, make tweaks to the weights and iterate till its good enough. Doing the latter approach can be useful for a POC to internally prove the value of something, but you will have to be very careful if you introduce that to your external customers unless you very clearly set their expectations.

This question was first asked in How to Enhance Your Search Platforms webinar (8th of Feb). Please see here: How to Enhance Your Search Platforms

Comment

People who like this

0 Show 0 · Share
10 |1500 characters needed characters left characters exceeded
▼
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
  • Advanced visibility
Viewable by all users

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Watch this question

Add to watch list
Add to your watch list to receive emailed updates for this question. Too many emails? Change your settings >
5 People are following this question.

Related Questions

If I want to handle all possible output from TRIT is there a full taxonomy that I could look at to understand more about Topics or Social Tags or Relationships say? (from TRIT Webinar Feb 8)

How long does it take to work with large corpuses – how long do documents take to tag typically? (from TRIT Webinar Feb 8)

What is the max file size I can send to TRIT? (from TRIT Webinar Feb 8)

Do you have any integration of TRIT with Salesforce? (from TRIT Webinar Feb 8)

Do you have any integration of TRIT with Dynamics CRM or Sharepoint? (from TRIT Webinar Feb 8)

  • Feedback
  • Copyright
  • Cookie Policy
  • Privacy Statement
  • Terms of Use
  • Careers
  • Anonymous
  • Sign in
  • Create
  • Ask a question
  • Spaces
  • Alpha
  • App Studio
  • Block Chain
  • Bot Platform
  • Calais
  • Connected Risk APIs
  • DSS
  • Data Fusion
  • Data Model Discovery
  • Datastream
  • Eikon COM
  • Eikon Data APIs
  • Elektron
    • EMA
    • ETA
    • WebSocket API
  • Legal One
  • Messenger Bot
  • Messenger Side by Side
  • ONESOURCE
    • Indirect Tax
  • Open PermID
    • Entity Search
  • Org ID
  • PAM
    • PAM - Logging
  • ProView
  • ProView Internal
  • Product Insight
  • Project Tracking
  • Refinitiv Data Platform
    • Refinitiv Data Platform Libraries
  • Rose's Space
  • Screening
    • Qual-ID API
    • Screening Deployed
    • Screening Online
    • World-Check One
    • World-Check One Zero Footprint
  • Side by Side Integration API
  • TR Knowledge Graph
  • TREP APIs
    • CAT
    • DACS Station
    • Open DACS
    • RFA
    • UPA
  • TREP Infrastructure
  • TRIT
  • TRKD
  • TRTH
  • Thomson One Smart
  • Transactions
    • REDI API
  • Velocity Analytics
  • Wealth Management Web Services
  • World-Check Data File
  • Explore
  • Tags
  • Questions
  • Badges