Similarity score(相似度分数(Similarity score))¶
Supported in: Batch
Returns the similarity score of two embedding vectors.
Expression categories: Distance measurement, Numeric
Declared arguments¶
- Left embedded vector: The left embedded vector.
Expression\ - Right embedded vector: The right embedded vector.
Expression\ - Similarity metric: The similarity metric for comparing the left and right embeddings.
Enum\
Type variable bounds: T accepts Array\
Output type: Double
Examples¶
Example 1: Base case¶
Description: Cosine similarity of the Ada embeddings for the word 'palantir' and 'foundry'.
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
COSINE_SIMILARITY
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... | 0.7814455755180517 |
Example 2: Base case¶
Description: Cosine similarity between the Ada embeddings for the word 'palantir'.
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
COSINE_SIMILARITY
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | 1.0 |
Example 3: Base case¶
Description: Dot product of the Ada embeddings for the word 'palantir' and 'foundry'.
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
DOT_PRODUCT
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... | 0.7814455030932973 |
Example 4: Base case¶
Description: Dot product of the Ada embeddings for the word 'palantir'.
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
DOT_PRODUCT
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | 1.0 |
Example 5: Base case¶
Description: Euclidean distance between the Ada embeddings for the word 'palantir' and 'foundry'.
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
EUCLIDEAN_DISTANCE
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... | 0.6611420486192364 |
Example 6: Base case¶
Description: Euclidean distance between the Ada embeddings for the word 'palantir'.
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
EUCLIDEAN_DISTANCE
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | 0.0 |
Example 7: Null case¶
Description: Null inputs should have a null output
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
COSINE_SIMILARITY
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| null | null | null |
Example 8: Null case¶
Description: Null inputs should have a null output
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
DOT_PRODUCT
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| null | null | null |
Example 9: Null case¶
Description: Null inputs should have a null output
Argument values:
- Left embedded vector:
leftEmbeddedVector - Right embedded vector:
rightEmbeddedVector - Similarity metric:
EUCLIDEAN_DISTANCE
| leftEmbeddedVector | rightEmbeddedVector | Output |
|---|---|---|
| null | null | null |
Example 10: Edge case¶
Description: Regular arrays become null when arrays have different length
Argument values:
- Left embedded vector:
leftArray - Right embedded vector:
rightArray - Similarity metric:
DOT_PRODUCT
| leftArray | rightArray | Output |
|---|---|---|
| [ 1.0, 2.0 ] | [ 1.0, 3.0 ] | 7.0 |
| [ 1.0, 2.0, 3.0 ] | [ 1.0, 3.0 ] | null |
| [ 1.0, 2.0 ] | [ 1.0, 2.0, 3.0 ] | null |
| [ 1.0, 2.0 ] | null | null |
| null | [ 1.0, 2.0 ] | null |
中文翻译¶
相似度分数(Similarity score)¶
支持模式:批处理(Batch)
返回两个嵌入向量(embedding vectors)的相似度分数。
表达式类别: 距离测量(Distance measurement)、数值(Numeric)
声明的参数¶
- 左嵌入向量(Left embedded vector): 左侧的嵌入向量。
表达式\ - 右嵌入向量(Right embedded vector): 右侧的嵌入向量。
表达式\ - 相似度度量(Similarity metric): 用于比较左右嵌入的相似度度量标准。
枚举\<余弦距离(Cosine Distance)、余弦相似度(Cosine Similarity)、点积(Dot Product)、欧几里得距离(Euclidean Distance)>
类型变量约束: T 接受 Array\
输出类型: Double
示例¶
示例 1:基础情况¶
描述: 单词 'palantir' 和 'foundry' 的 Ada 嵌入之间的余弦相似度。
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
COSINE_SIMILARITY
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... | 0.7814455755180517 |
示例 2:基础情况¶
描述: 单词 'palantir' 的 Ada 嵌入之间的余弦相似度。
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
COSINE_SIMILARITY
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | 1.0 |
示例 3:基础情况¶
描述: 单词 'palantir' 和 'foundry' 的 Ada 嵌入之间的点积。
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
DOT_PRODUCT
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... | 0.7814455030932973 |
示例 4:基础情况¶
描述: 单词 'palantir' 的 Ada 嵌入之间的点积。
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
DOT_PRODUCT
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | 1.0 |
示例 5:基础情况¶
描述: 单词 'palantir' 和 'foundry' 的 Ada 嵌入之间的欧几里得距离。
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
EUCLIDEAN_DISTANCE
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.0046147984, -0.014344796, -0.022795992, -0.035806388, -0.028467191, 0.026243191, -0.028161392, ... | 0.6611420486192364 |
示例 6:基础情况¶
描述: 单词 'palantir' 的 Ada 嵌入之间的欧几里得距离。
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
EUCLIDEAN_DISTANCE
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | [ -0.019182289, -0.02127992, 0.009529043, -0.008066221, -0.0014429842, 0.019154688, -0.023556953, -0... | 0.0 |
示例 7:空值情况¶
描述: 空输入应返回空输出
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
COSINE_SIMILARITY
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| null | null | null |
示例 8:空值情况¶
描述: 空输入应返回空输出
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
DOT_PRODUCT
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| null | null | null |
示例 9:空值情况¶
描述: 空输入应返回空输出
参数值:
- 左嵌入向量:
leftEmbeddedVector - 右嵌入向量:
rightEmbeddedVector - 相似度度量:
EUCLIDEAN_DISTANCE
| leftEmbeddedVector | rightEmbeddedVector | 输出 |
|---|---|---|
| null | null | null |
示例 10:边界情况¶
描述: 当数组长度不同时,常规数组将变为空值
参数值:
- 左嵌入向量:
leftArray - 右嵌入向量:
rightArray - 相似度度量:
DOT_PRODUCT
| leftArray | rightArray | 输出 |
|---|---|---|
| [ 1.0, 2.0 ] | [ 1.0, 3.0 ] | 7.0 |
| [ 1.0, 2.0, 3.0 ] | [ 1.0, 3.0 ] | null |
| [ 1.0, 2.0 ] | [ 1.0, 2.0, 3.0 ] | null |
| [ 1.0, 2.0 ] | null | null |
| null | [ 1.0, 2.0 ] | null |