跳转至

Similarity score(相似度分数(Similarity score))

Supported in: Batch

Returns the similarity score of two embedding vectors.

Expression categories: Distance measurement, Numeric

Declared arguments

  • Left embedded vector: The left embedded vector.
    Expression\
  • Right embedded vector: The right embedded vector.
    Expression\
  • Similarity metric: The similarity metric for comparing the left and right embeddings.
    Enum\

Type variable bounds: T accepts Array\

Output type: Double

Examples

Example 1: Base case

Description: Cosine similarity of the Ada embeddings for the word 'palantir' and 'foundry'.

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: COSINE_SIMILARITY
leftEmbeddedVector rightEmbeddedVector Output
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... 0.7814455755180517

Example 2: Base case

Description: Cosine similarity between the Ada embeddings for the word 'palantir'.

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: COSINE_SIMILARITY
leftEmbeddedVector rightEmbeddedVector Output
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... 1.0

Example 3: Base case

Description: Dot product of the Ada embeddings for the word 'palantir' and 'foundry'.

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: DOT_PRODUCT
leftEmbeddedVector rightEmbeddedVector Output
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... 0.7814455030932973

Example 4: Base case

Description: Dot product of the Ada embeddings for the word 'palantir'.

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: DOT_PRODUCT
leftEmbeddedVector rightEmbeddedVector Output
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... 1.0

Example 5: Base case

Description: Euclidean distance between the Ada embeddings for the word 'palantir' and 'foundry'.

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: EUCLIDEAN_DISTANCE
leftEmbeddedVector rightEmbeddedVector Output
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... 0.6611420486192364

Example 6: Base case

Description: Euclidean distance between the Ada embeddings for the word 'palantir'.

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: EUCLIDEAN_DISTANCE
leftEmbeddedVector rightEmbeddedVector Output
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... 0.0

Example 7: Null case

Description: Null inputs should have a null output

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: COSINE_SIMILARITY
leftEmbeddedVector rightEmbeddedVector Output
null null null

Example 8: Null case

Description: Null inputs should have a null output

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: DOT_PRODUCT
leftEmbeddedVector rightEmbeddedVector Output
null null null

Example 9: Null case

Description: Null inputs should have a null output

Argument values:

  • Left embedded vector: leftEmbeddedVector
  • Right embedded vector: rightEmbeddedVector
  • Similarity metric: EUCLIDEAN_DISTANCE
leftEmbeddedVector rightEmbeddedVector Output
null null null

Example 10: Edge case

Description: Regular arrays become null when arrays have different length

Argument values:

  • Left embedded vector: leftArray
  • Right embedded vector: rightArray
  • Similarity metric: DOT_PRODUCT
leftArray rightArray Output
[ 1.0, 2.0 ] [ 1.0, 3.0 ] 7.0
[ 1.0, 2.0, 3.0 ] [ 1.0, 3.0 ] null
[ 1.0, 2.0 ] [ 1.0, 2.0, 3.0 ] null
[ 1.0, 2.0 ] null null
null [ 1.0, 2.0 ] null


中文翻译


相似度分数(Similarity score)

支持模式:批处理(Batch)

返回两个嵌入向量(embedding vectors)的相似度分数。

表达式类别: 距离测量(Distance measurement)、数值(Numeric)

声明的参数

  • 左嵌入向量(Left embedded vector): 左侧的嵌入向量。
    表达式\
  • 右嵌入向量(Right embedded vector): 右侧的嵌入向量。
    表达式\
  • 相似度度量(Similarity metric): 用于比较左右嵌入的相似度度量标准。
    枚举\<余弦距离(Cosine Distance)、余弦相似度(Cosine Similarity)、点积(Dot Product)、欧几里得距离(Euclidean Distance)>

类型变量约束: T 接受 Array\

输出类型: Double

示例

示例 1:基础情况

描述: 单词 'palantir' 和 'foundry' 的 Ada 嵌入之间的余弦相似度。

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: COSINE_SIMILARITY
leftEmbeddedVector rightEmbeddedVector 输出
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... 0.7814455755180517

示例 2:基础情况

描述: 单词 'palantir' 的 Ada 嵌入之间的余弦相似度。

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: COSINE_SIMILARITY
leftEmbeddedVector rightEmbeddedVector 输出
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... 1.0

示例 3:基础情况

描述: 单词 'palantir' 和 'foundry' 的 Ada 嵌入之间的点积。

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: DOT_PRODUCT
leftEmbeddedVector rightEmbeddedVector 输出
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... 0.7814455030932973

示例 4:基础情况

描述: 单词 'palantir' 的 Ada 嵌入之间的点积。

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: DOT_PRODUCT
leftEmbeddedVector rightEmbeddedVector 输出
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... 1.0

示例 5:基础情况

描述: 单词 'palantir' 和 'foundry' 的 Ada 嵌入之间的欧几里得距离。

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: EUCLIDEAN_DISTANCE
leftEmbeddedVector rightEmbeddedVector 输出
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... 0.6611420486192364

示例 6:基础情况

描述: 单词 'palantir' 的 Ada 嵌入之间的欧几里得距离。

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: EUCLIDEAN_DISTANCE
leftEmbeddedVector rightEmbeddedVector 输出
[ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... 0.0

示例 7:空值情况

描述: 空输入应返回空输出

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: COSINE_SIMILARITY
leftEmbeddedVector rightEmbeddedVector 输出
null null null

示例 8:空值情况

描述: 空输入应返回空输出

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: DOT_PRODUCT
leftEmbeddedVector rightEmbeddedVector 输出
null null null

示例 9:空值情况

描述: 空输入应返回空输出

参数值:

  • 左嵌入向量: leftEmbeddedVector
  • 右嵌入向量: rightEmbeddedVector
  • 相似度度量: EUCLIDEAN_DISTANCE
leftEmbeddedVector rightEmbeddedVector 输出
null null null

示例 10:边界情况

描述: 当数组长度不同时,常规数组将变为空值

参数值:

  • 左嵌入向量: leftArray
  • 右嵌入向量: rightArray
  • 相似度度量: DOT_PRODUCT
leftArray rightArray 输出
[ 1.0, 2.0 ] [ 1.0, 3.0 ] 7.0
[ 1.0, 2.0, 3.0 ] [ 1.0, 3.0 ] null
[ 1.0, 2.0 ] [ 1.0, 2.0, 3.0 ] null
[ 1.0, 2.0 ] null null
null [ 1.0, 2.0 ] null