Legacy Product

Fusion 5.10
    Fusion 5.10

    Solr MoreLikeThis Stage

    Table of Contents

    This stage uses the content of the current document to query for similar documents, using Solr’s MoreLikeThis component.

    This stage provides content-based recommendations. For collaborative recommendations, use the Recommend Items for Item stage.

    See Recommendations and Boosting for more information.

    Tips

    • The incoming query must include an id field in order to get recommendations from this stage. The stage returns documents similar to the one specified by this field.

    • Since these secondary queries tend to be large, this stage can impact search performance. You can improve performance by first clustering your documents, then configuring this stage to query a specific document cluster instead of all documents.

    Configuration

    When entering configuration values in the UI, use unescaped characters, such as \t for the tab character. When entering configuration values in the API, use escaped characters, such as \\t for the tab character.

    Returns results similar to a given item using Solr's MoreLikeThis component. Provide parameters for the component here. For more information, see https://cwiki.apache.org/confluence/display/solr/MoreLikeThis

    skip - boolean

    Set to true to skip this stage.

    Default: false

    label - string

    A unique label for this stage.

    <= 255 characters

    condition - string

    Define a conditional script that must result in true or false. This can be used to determine if the stage should process or not.

    useQueryParser - booleanrequired

    Specifies whether to use the MLT Query Parser. Note, if you choose to use this you MUST specify a document id to run the MLT Query on and a Field to run the MLT with.

    Default: true

    moreLikeThisFields - array[string]

    Specifies the name of the field you want to run the mlt on. NOTE: If you don't supply any fields we will default to using the body field.

    docId - string

    Specifies the name of the id field we are finding more like this terms on

    Default: id

    mindf - integer

    Specify the frequency at which words will be ignored which occur in at least this many docs

    Default: 2

    maxdf - integer

    Specify the frequency at which words will be ignored which occur in more than this many docs

    Default: 10000

    count - integer

    Specifies the number of similar documents to be returned for each result.

    mintf - integer

    Specify the frequency below which terms will be ignored in the source doc

    minwl - integer

    Sets the minimum word length for words to be recognized by the MoreLikeThis

    Default: 3

    maxwl - integer

    Sets the maximum word length

    maxqt - integer

    Sets the max number of query terms that will be included in any generate query

    maxntp - integer

    Sets the max number of tokens to parse in each example doc that is not stored with TV support

    boost - boolean

    Specifies if the query will be boosted by the interesting term relevant

    matchInclude - boolean

    Specifies whether the response should include the matched doc

    matchOffset - integer

    Specifies an offset to the main query to find the doc on which the MoreLikeThis query should operate. By default it is 0

    interestingTerms - string

    Controls how the More Like This component presents the interesting terms. Supports 3 settings, list lists the terms, none lists no terms and details lists the terms with the boosts

    Allowed values: listnonedetails