Legacy Product

Fusion 5.10
    Fusion 5.10

    Jobs Configuration

    These reference topics provide complete information about configuration properties for the Spark jobs that are enabled with a Fusion AI license.

    For conceptual information and instructions for configuring and scheduling jobs, see Jobs and Schedules.

    Additional jobs are available as part of the basic Fusion Server feature set.

    • ALS Recommender

      Use this job when you want to compute user recommendations or item similarities using a collaborative filtering recommender. You can also implement a user-to-item recommender in the advanced section of this job’s configuration UI. This job uses SparkML’s Alternating Least Squares (ALS).

    • Cluster Labeling

      Use this job when you already have clusters or well-defined document categories, and you want to discover and attach keywords to see representative words within those existing clusters. (If you want to create new clusters, use the Document Clustering job.)

    • Co-occurrence Similarity

      Use this job when you only want to compute item-to-item similarities. This method is more lightweight than the generic Recommendations job.

    • Collection Analysis

      Use this job when you want to compute basic metrics about your collection, like average word length, phrase percentages, and outlier documents (with very many or very few documents).

    • Ground Truth

      Estimate ground truth queries using click signals and query signals, with document relevance per query determined using a click/skip formula.

    • Head/Tail Analysis

      Perform head/tail analysis of queries from collections of raw or aggregated signals, to identify underperforming queries and the reasons. This information is valuable for improving overall conversions, Solr configurations, auto-suggest, product catalogs, and SEO/SEM strategies, in order to improve conversion rates.

    • Item Similarity Recommender

      Use this job when you want to compute user recommendations based on pre-computed item similarities.

    • Levenshtein Spell Checking

      Compute the edit distance between all values in a field.

      Levenshtein Spell Checking job is deprecated as of Fusion AI 4.1.0. Use the Token and Phrase Spell Correction job instead.
    • Logistic Regression Classifier Training

      Train a regularized logistic regression model for text classification.

    • Matrix Decomposition-Based Query-Query Similarity

      Train a collaborative filtering matrix decomposition recommender using SparkML’s Alternating Least Squares (ALS) to batch-compute query-query similarities.

    • Outlier Detection

      Use this job when you want to find outliers from a set of documents and attach labels for each outlier group.