Compute usage with AIP(AIP 计算用量)¶
AIP compute usage involves large language models (LLMs). Fundamentally, LLMs take text as an input and respond with text as an output. The amount of text input and output is measured in tokens. Compute usage for LLMs is measured in compute-seconds per some number of tokens. Different models may have different rates for compute usage, as described below.
Tokens in AIP¶
Tokens are the basic units of text that LLMs use to process and understand input. A token can be as short as a single character or as long as a whole word depending on the language and the specific model.
Importantly, tokens do not map one-to-one with words. For example, common words might be a single token, but longer or less common words may be split into multiple tokens. Even punctuation marks and spaces can be considered tokens.
Different model providers have distinct definitions for what constitutes a token; for instance, OpenAI ↗ and Anthropic ↗. On average, tokens are around 4 characters long, with a character being a single letter or punctuation mark.
In AIP, tokens are consumed by applications that send prompts to and receive prompts from LLMs. Each of these prompts and responses consist of a measurable number of tokens. These tokens can be sent to multiple LLM providers; due to differences between providers, these tokens are converted into compute-seconds to match the price of the underlying model provider.
All applications that provide LLM-backed capabilities consume tokens when being used. See the following list for the set of applications that may use tokens when you interact with their LLM-backed capabilities.
- AIP Assist
- AIP Logic
- AIP Error Enhancer
- AIP Code Assist
- AIP Analyst
- AI FDE
- Workshop LLM-backed tools
- Quiver LLM-backed tools
- Pipeline Builder LLM-backed tools
- Direct calls to the Language Model Service (including both Python and TypeScript libraries)
AIP routes text directly to backing LLMs which run the tokenization themselves. The size of the text will dictate the amount of compute that is used by the backing model to serve the response.
Take the following example sentence that is sent to the GPT-4o model.
AIP incorporates all of Palantir's advanced security measures for the protection of sensitive data in compliance with industry regulations.
This sentence contains 140 characters and will tokenize in the following way, with a | character separating each token. Note that a token is not always equivalent to a word; some words are broken into multiple tokens, like AIP and Palantir in the example below.
A|IP| incorporates| all| of| Pal|ant|ir|'s| advanced| security| measures| for| the| protection| of| sensitive| data| in| compliance| with| industry| regulations|.
This sentence contains 24 tokens and will use the following number of compute-seconds:
compute-seconds = 24 tokens * 43 compute-seconds / 10,000 tokens
compute-seconds = 24 * 43 / 10,000
compute-seconds = 0.1032
The number of tokens and characters in the above sentence was verified with OpenAI's Tokenizer feature ↗.
Understanding drivers of compute usage with AIP¶
Usage of compute-seconds resulting from LLM tokens is attached directly to the individual application resource that requests the usage. For example, if you use AIP to automatically explain a pipeline in Pipeline Builder, the compute-seconds used by the LLM to generate that explanation will be attributed to that specific pipeline. This is true across the platform; keeping this in mind will help you track where you are using tokens.
In some cases, compute usage is not attributable to a single resource in the platform; examples include AIP Assist and Error Explainer, among others. When usage is not attributable to a single resource, the tokens will be attributed to the user folder initiating the use of tokens.
We recommend staying aware of the tokens that are sent to LLMs on your behalf. Generally, the more information that you include when using LLMs, the more compute-seconds will be used. For example, the following scenarios describe different ways of using compute-seconds.
- In Pipeline Builder, you can ask AIP to explain your transformation nodes; the number of selected nodes affects the number of tokens used by the LLM to generate a response, and thus compute-second usage. This is because as the number of nodes increases, so does the amount of text the LLM must process regarding the configuration of those nodes.
- In AIP Assist, asking the LLM to generate large blocks of code requires more output tokens. Shorter responses use fewer tokens and thus less compute.
- In AIP Logic, sending large amounts of text with your prompts requires more tokens and thus more compute-seconds.
Exporting AIP token usage data¶
To analyze your enrollment's LLM usage in detail, you can export the AIP Token Usage dataset from the Internal dataset export section in Control Panel. This dataset provides daily breakdowns of token consumption by model and resource, along with the corresponding usage in compute-seconds and currency. For more information, see Internal dataset export.
Measuring compute with AIP¶
:::callout{theme="neutral"} If you have an enterprise contract with Palantir, contact your Palantir representative before proceeding with compute usage calculations. :::
| Model | Foundry cloud provider | Foundry region | Compute seconds per 10k input tokens | Compute seconds per 10k output tokens |
|---|---|---|---|---|
| Grok-2 ↗ | AWS | North America | 36 | 182 |
| AWS | EU / UK | 31 | 154 | |
| AWS | South America / APAC / Middle East | 25 | 125 | |
| Grok-2-Vision ↗ | AWS | North America | 36 | 182 |
| AWS | EU / UK | 31 | 154 | |
| AWS | South America / APAC / Middle East | 25 | 125 | |
| Grok-3 ↗ | AWS | North America | 55 | 273 |
| AWS | EU / UK | 46 | 231 | |
| AWS | South America / APAC / Middle East | 38 | 188 | |
| Grok-3-Mini-Reasoning ↗ | AWS | North America | 5.5 | 9.1 |
| AWS | EU / UK | 4.6 | 7.7 | |
| AWS | South America / APAC / Middle East | 3.8 | 6.3 | |
| Grok-4 <= 128k tokens ↗ | AWS | North America | 54.5 | 272.7 |
| AWS | EU / UK | 46.2 | 230.8 | |
| AWS | South America / APAC / Middle East | 37.5 | 187.5 | |
| Grok-4 > 128k tokens ↗ | AWS | North America | 109.1 | 545.5 |
| AWS | EU / UK | 92.3 | 461.5 | |
| AWS | South America / APAC / Middle East | 75.0 | 375.0 | |
| Grok-4 Fast Reasoning <= 128k tokens ↗ | AWS | North America | 3.6 | 9.1 |
| AWS | EU / UK | 3.1 | 7.7 | |
| AWS | South America / APAC / Middle East | 2.5 | 6.3 | |
| Grok-4 Fast Reasoning > 128k tokens ↗ | AWS | North America | 7.3 | 18.2 |
| AWS | EU / UK | 6.2 | 15.4 | |
| AWS | South America / APAC / Middle East | 5.0 | 12.5 | |
| Grok-4 Fast Non-Reasoning <= 128k tokens ↗ | AWS | North America | 3.6 | 9.1 |
| AWS | EU / UK | 3.1 | 7.7 | |
| AWS | South America / APAC / Middle East | 2.5 | 6.3 | |
| Grok-4 Fast Non-Reasoning > 128k tokens ↗ | AWS | North America | 7.3 | 18.2 |
| AWS | EU / UK | 6.2 | 15.4 | |
| AWS | South America / APAC / Middle East | 5.0 | 12.5 | |
| Grok Code Fast 1 ↗ | AWS | North America | 3.6 | 27.3 |
| AWS | EU / UK | 3.1 | 23.1 | |
| AWS | South America / APAC / Middle East | 2.5 | 18.8 | |
| Grok-4.1 Fast Non-Reasoning ↗ | AWS | North America | 3.6 | 9.1 |
| AWS | EU / UK | 3.1 | 7.7 | |
| AWS | South America / APAC / Middle East | 2.5 | 6.3 | |
| Grok-4.1 Fast Reasoning ↗ | AWS | North America | 3.6 | 9.1 |
| AWS | EU / UK | 3.1 | 7.7 | |
| AWS | South America / APAC / Middle East | 2.5 | 6.3 | |
| Grok-4.20 Reasoning <= 200k tokens ↗ | AWS | North America | 36.4 | 109.1 |
| AWS | EU / UK | 30.8 | 92.3 | |
| AWS | South America / APAC / Middle East | 25.0 | 75.0 | |
| Grok-4.20 Reasoning > 200k tokens ↗ | AWS | North America | 72.7 | 218.2 |
| AWS | EU / UK | 61.5 | 184.6 | |
| AWS | South America / APAC / Middle East | 50.0 | 150.0 | |
| Grok-4.20 Non-Reasoning <= 200k tokens ↗ | AWS | North America | 36.4 | 109.1 |
| AWS | EU / UK | 30.8 | 92.3 | |
| AWS | South America / APAC / Middle East | 25.0 | 75.0 | |
| Grok-4.20 Non-Reasoning > 200k tokens ↗ | AWS | North America | 72.7 | 218.2 |
| AWS | EU / UK | 61.5 | 184.6 | |
| AWS | South America / APAC / Middle East | 50.0 | 150.0 | |
| GPT-4.5 ↗ | AWS | North America | 1159.1 | 2318.2 |
| AWS | EU / UK | 980.8 | 1961.5 | |
| AWS | South America / APAC / Middle East | 796.9 | 1593.8 | |
| GPT-4o ↗ | AWS | North America | 43 | 172 |
| AWS | EU / UK | 36 | 145 | |
| AWS | South America / APAC / Middle East | 30 | 118 | |
| GPT-4o mini ↗ | AWS | North America | 2.6 | 10.3 |
| AWS | EU / UK | 2.2 | 8.7 | |
| AWS | South America / APAC / Middle East | 1.8 | 7.1 | |
| GPT-4.1 ↗ | AWS | North America | 31 | 124 |
| AWS | EU / UK | 26 | 105 | |
| AWS | South America / APAC / Middle East | 21 | 85 | |
| GPT-4.1-mini ↗ | AWS | North America | 6.2 | 24.7 |
| AWS | EU / UK | 5.2 | 20.9 | |
| AWS | South America / APAC / Middle East | 4.3 | 17 | |
| GPT-4.1-nano ↗ | AWS | North America | 1.5 | 6.2 |
| AWS | EU / UK | 1.3 | 5.2 | |
| AWS | South America / APAC / Middle East | 1.1 | 4.3 | |
| GPT-5 ↗ | AWS | North America | 20.5 | 163.6 |
| AWS | EU / UK | 17.3 | 138.5 | |
| AWS | South America / APAC / Middle East | 14.1 | 112.5 | |
| GPT-5-mini ↗ | AWS | North America | 4.1 | 32.7 |
| AWS | EU / UK | 3.5 | 27.7 | |
| AWS | South America / APAC / Middle East | 2.8 | 22.5 | |
| GPT-5-nano ↗ | AWS | North America | 0.82 | 6.5 |
| AWS | EU / UK | 0.69 | 5.5 | |
| AWS | South America / APAC / Middle East | 0.56 | 4.5 | |
| GPT-5-pro ↗ | AWS | North America | 231.8 | 1854.5 |
| AWS | EU / UK | 196.2 | 1569.2 | |
| AWS | South America / APAC / Middle East | 159.4 | 1275.0 | |
| GPT-OSS-20B ↗ | AWS | North America | 1.1 | 4.9 |
| AWS | EU / UK | 1.0 | 4.2 | |
| AWS | South America / APAC / Middle East | 0.79 | 3.4 | |
| GPT-OSS-120B ↗ | AWS | North America | 2.5 | 9.8 |
| AWS | EU / UK | 2.1 | 8.3 | |
| AWS | South America / APAC / Middle East | 1.7 | 6.8 | |
| GPT-5 Codex ↗ | AWS | North America | 20.5 | 163.6 |
| AWS | EU / UK | 17.3 | 138.5 | |
| AWS | South America / APAC / Middle East | 14.1 | 112.5 | |
| GPT-5.1 Codex Mini ↗ | AWS | North America | 5.5 | 36.4 |
| AWS | EU / UK | 4.6 | 30.8 | |
| AWS | South America / APAC / Middle East | 3.8 | 25 | |
| GPT-5.1 Codex ↗ | AWS | North America | 23.6 | 181.8 |
| AWS | EU / UK | 20 | 153.8 | |
| AWS | South America / APAC / Middle East | 16.3 | 125 | |
| GPT-5.1 ↗ | AWS | North America | 23.6 | 181.8 |
| AWS | EU / UK | 20 | 153.8 | |
| AWS | South America / APAC / Middle East | 16.3 | 125 | |
| GPT-5.1 Codex Max ↗ | AWS | North America | 22.7 | 181.8 |
| AWS | EU / UK | 19.2 | 153.8 | |
| AWS | South America / APAC / Middle East | 15.6 | 125.0 | |
| GPT-5.2 ↗ | AWS | North America | 31.8 | 254.5 |
| AWS | EU / UK | 26.9 | 215.4 | |
| AWS | South America / APAC / Middle East | 21.9 | 175.0 | |
| GPT-5.2 Codex ↗ | AWS | North America | 32.7 | 254.5 |
| AWS | EU / UK | 27.7 | 215.4 | |
| AWS | South America / APAC / Middle East | 22.5 | 175 | |
| GPT-5.2 Pro ↗ | AWS | North America | 381.8 | 3054.5 |
| AWS | EU / UK | 323.1 | 2584.6 | |
| AWS | South America / APAC / Middle East | 262.5 | 2100.0 | |
| GPT-5.3 Codex ↗ | AWS | North America | 31.8 | 254.5 |
| AWS | EU / UK | 26.9 | 215.4 | |
| AWS | South America / APAC / Middle East | 21.9 | 175.0 | |
| GPT-5.4 <= 272k tokens ↗ | AWS | North America | 45.5 | 272.7 |
| AWS | EU / UK | 38.5 | 230.8 | |
| AWS | South America / APAC / Middle East | 31.3 | 187.5 | |
| GPT-5.4 > 272k tokens ↗ | AWS | North America | 90.9 | 409.1 |
| AWS | EU / UK | 76.9 | 346.2 | |
| AWS | South America / APAC / Middle East | 62.5 | 281.3 | |
| GPT-5.4 Pro <= 272k tokens ↗ | AWS | North America | 545.5 | 3272.7 |
| AWS | EU / UK | 461.5 | 2769.2 | |
| AWS | South America / APAC / Middle East | 375.0 | 2250.0 | |
| GPT-5.4 Pro > 272k tokens ↗ | AWS | North America | 1090.9 | 4909.1 |
| AWS | EU / UK | 923.1 | 4153.8 | |
| AWS | South America / APAC / Middle East | 750.0 | 3375.0 | |
| GPT-5.4-mini ↗ | AWS | North America | 13.6 | 81.8 |
| AWS | EU / UK | 11.5 | 69.2 | |
| AWS | South America / APAC / Middle East | 9.4 | 56.3 | |
| GPT-5.4-nano ↗ | AWS | North America | 3.6 | 22.7 |
| AWS | EU / UK | 3.1 | 19.2 | |
| AWS | South America / APAC / Middle East | 2.5 | 15.6 | |
| GPT-5.5 <= 272k tokens ↗ | AWS | North America | 81.8 | 490.9 |
| AWS | EU / UK | 69.2 | 415.4 | |
| AWS | South America / APAC / Middle East | 56.3 | 337.5 | |
| GPT-5.5 > 272k tokens ↗ | AWS | North America | 163.6 | 736.4 |
| AWS | EU / UK | 138.5 | 623.1 | |
| AWS | South America / APAC / Middle East | 112.5 | 506.3 | |
| GPT Realtime ↗ | AWS | North America | 72.7 | 290.9 |
| AWS | EU / UK | 61.5 | 246.2 | |
| AWS | South America / APAC / Middle East | 50 | 200 | |
| GPT Realtime 1.5 ↗ | AWS | North America | 72.7 | 290.9 |
| AWS | EU / UK | 61.5 | 246.2 | |
| AWS | South America / APAC / Middle East | 50.0 | 200.0 | |
| o1 ↗ | AWS | North America | 232 | 927 |
| AWS | EU / UK | 196 | 785 | |
| AWS | South America / APAC / Middle East | 159 | 638 | |
| o1-mini ↗ | AWS | North America | 17 | 68 |
| AWS | EU / UK | 14 | 58 | |
| AWS | South America / APAC / Middle East | 12 | 47 | |
| o3 ↗ | AWS | North America | 31 | 124 |
| AWS | EU / UK | 26 | 105 | |
| AWS | South America / APAC / Middle East | 21 | 85 | |
| o3-mini ↗ | AWS | North America | 17 | 68 |
| AWS | EU / UK | 14 | 58 | |
| AWS | South America / APAC / Middle East | 12 | 47 | |
| o3-pro ↗ | AWS | North America | 345.5 | 1381.8 |
| AWS | EU / UK | 292.3 | 1169.2 | |
| AWS | South America / APAC / Middle East | 237.5 | 950.0 | |
| o4-mini ↗ | AWS | North America | 17 | 68 |
| AWS | EU / UK | 14 | 58 | |
| AWS | South America / APAC / Middle East | 12 | 47 | |
ada embedding ↗ |
AWS | North America | 1.68 | N/A |
| AWS | EU / UK | 1.42 | N/A | |
| AWS | South America / APAC / Middle East | 1.16 | N/A | |
| text-embedding-3-large ↗ | AWS | North America | 2.24 | N/A |
| AWS | EU / UK | 1.89 | N/A | |
| AWS | South America / APAC / Middle East | 1.54 | N/A | |
| text-embedding-3-small ↗ | AWS | North America | 0.34 | N/A |
| AWS | EU / UK | 0.29 | N/A | |
| AWS | South America / APAC / Middle East | 0.24 | N/A | |
| OpenAI Text Embedding 3 Large ↗ | AWS | North America | 2.2 | N/A |
| AWS | EU / UK | 1.9 | N/A | |
| AWS | South America / APAC / Middle East | 1.5 | N/A | |
| OpenAI Text Embedding 3 Small ↗ | AWS | North America | 0.3 | N/A |
| AWS | EU / UK | 0.3 | N/A | |
| AWS | South America / APAC / Middle East | 0.2 | N/A | |
| OpenAI Text Embedding Ada 002 ↗ | AWS | North America | 1.7 | N/A |
| AWS | EU / UK | 1.4 | N/A | |
| AWS | South America / APAC / Middle East | 1.2 | N/A | |
| Anthropic Claude 3 ↗ | AWS | North America | 52 | 258 |
| AWS | EU / UK | 44 | 218 | |
| AWS | South America / APAC / Middle East | 35 | 177 | |
| Anthropic Claude 3 Haiku ↗ | AWS | North America | 4.3 | 21.5 |
| AWS | EU / UK | 3.6 | 18.2 | |
| AWS | South America / APAC / Middle East | 3.0 | 14.8 | |
| Anthropic Claude 3.5 Haiku ↗ | AWS | North America | 12 | 62 |
| AWS | EU / UK | 10 | 52 | |
| AWS | South America / APAC / Middle East | 9 | 43 | |
| Anthropic Claude 4.5 Haiku ↗ | AWS | North America | 17.3 | 86.4 |
| AWS | EU / UK | 14.6 | 73.1 | |
| AWS | South America / APAC / Middle East | 11.9 | 59.4 | |
| Anthropic Claude 3.5 Sonnet ↗ | AWS | North America | 52 | 258 |
| AWS | EU / UK | 44 | 218 | |
| AWS | South America / APAC / Middle East | 35 | 177 | |
| Anthropic Claude 3.5 Sonnet v2 ↗ | AWS | North America | 46 | 232 |
| AWS | EU / UK | 39 | 196 | |
| AWS | South America / APAC / Middle East | 32 | 159 | |
| Anthropic Claude 4 Sonnet ↗ | AWS | North America | 46.4 | 231.8 |
| AWS | EU / UK | 39.2 | 196.2 | |
| AWS | South America / APAC / Middle East | 31.9 | 159.4 | |
| Anthropic Claude 4.5 Sonnet ↗ | AWS | North America | 51.8 | 259.1 |
| AWS | EU / UK | 43.8 | 219.2 | |
| AWS | South America / APAC / Middle East | 35.6 | 178.1 | |
| Anthropic Claude 4.6 Sonnet ↗ | AWS | North America | 54.5 | 272.7 |
| AWS | EU / UK | 46.2 | 230.8 | |
| AWS | South America / APAC / Middle East | 37.5 | 187.5 | |
| Anthropic Claude 4 Opus ↗ | AWS | North America | 232 | 1159 |
| AWS | EU / UK | 196 | 981 | |
| AWS | South America / APAC / Middle East | 159 | 797 | |
| Anthropic Claude 4.1 Opus ↗ | AWS | North America | 259 | 1295 |
| AWS | EU / UK | 219 | 1096 | |
| AWS | South America / APAC / Middle East | 178 | 891 | |
| Anthropic Claude 4.5 Opus ↗ | AWS | North America | 90.9 | 454.5 |
| AWS | EU / UK | 76.9 | 384.6 | |
| AWS | South America / APAC / Middle East | 62.5 | 312.5 | |
| Anthropic Claude 4.6 Opus ↗ | AWS | North America | 90.9 | 454.5 |
| AWS | EU / UK | 76.9 | 384.6 | |
| AWS | South America / APAC / Middle East | 62.5 | 312.5 | |
| Anthropic Claude 4.7 Opus ↗ | AWS | North America | 90.9 | 454.5 |
| AWS | EU / UK | 76.9 | 384.6 | |
| AWS | South America / APAC / Middle East | 62.5 | 312.5 | |
| Anthropic Claude 4.8 Opus ↗ | AWS | North America | 90.9 | 454.5 |
| AWS | EU / UK | 76.9 | 384.6 | |
| AWS | South America / APAC / Middle East | 62.5 | 312.5 | |
| Mistral Small 24B ↗ | AWS | North America | 158 | 525 |
| AWS | EU / UK | 133 | 444 | |
| AWS | South America / APAC / Middle East | 108 | 361 | |
| Mistral Small 24B Instruct ↗ | AWS | North America | 157.5 | 525 |
| AWS | EU / UK | 133.3 | 444.2 | |
| AWS | South America / APAC / Middle East | 108.3 | 360.9 | |
| Llama 3.1_8B ↗ | AWS | North America | 158 | 525 |
| AWS | EU / UK | 133 | 444 | |
| AWS | South America / APAC / Middle East | 108 | 361 | |
| Llama 3.3_70B ↗ | AWS | North America | 158 | 525 |
| AWS | EU / UK | 133 | 444 | |
| AWS | South America / APAC / Middle East | 108 | 361 | |
| Llama 3.3 70B Instruct ↗ | AWS | North America | 157.5 | 525 |
| AWS | EU / UK | 133.3 | 444.2 | |
| AWS | South America / APAC / Middle East | 108.3 | 360.9 | |
| Llama 4 Scout_17B 16E Instruct ↗ | AWS | North America | 1.5 | 5.7 |
| AWS | EU / UK | 1.2 | 4.8 | |
| AWS | South America / APAC / Middle East | 1.0 | 3.9 | |
| Llama 4 Maverick_17B 128E Instruct ↗ | AWS | North America | 2.1 | 8.4 |
| AWS | EU / UK | 1.8 | 7.1 | |
| AWS | South America / APAC / Middle East | 1.4 | 5.8 | |
| Nemotron 3 Nano 30B ↗ | AWS | North America | 1.1 | 4.4 |
| AWS | EU / UK | 0.9 | 3.7 | |
| AWS | South America / APAC / Middle East | 0.8 | 3.0 | |
| Nemotron 3 Super 120B ↗ | AWS | North America | 2.7 | 11.8 |
| AWS | EU / UK | 2.3 | 10.0 | |
| AWS | South America / APAC / Middle East | 1.9 | 8.1 | |
| Snowflake Arctic Embed ↗ | AWS | North America | 38 | 38 |
| AWS | EU / UK | 32 | 32 | |
| AWS | South America / APAC / Middle East | 26 | 26 | |
| Snowflake Arctic Embed M ↗ | AWS | North America | 38.2 | 38.2 |
| AWS | EU / UK | 32.3 | 32.3 | |
| AWS | South America / APAC / Middle East | 26.2 | 26.2 | |
| Gemini 1.5 Flash ↗ | AWS | North America | 1.3 | 5.2 |
| AWS | EU / UK | 1.1 | 4.4 | |
| AWS | South America / APAC / Middle East | 0.9 | 3.5 | |
| Gemini 1.5 Pro ↗ | AWS | North America | 21 | 86 |
| AWS | EU / UK | 18 | 73 | |
| AWS | South America / APAC / Middle East | 15 | 59 | |
| Gemini 2.0 Flash ↗ | AWS | North America | 1.5 | 6.2 |
| AWS | EU / UK | 1.3 | 5.2 | |
| AWS | South America / APAC / Middle East | 1.1 | 4.3 | |
| Gemini 2.5 Flash Lite ↗ | AWS | North America | 1.7 | 6.9 |
| AWS | EU / UK | 1.5 | 5.8 | |
| AWS | South America / APAC / Middle East | 1.2 | 4.8 | |
| Gemini 2.5 Flash ↗ | AWS | North America | 5.2 | 43.2 |
| AWS | EU / UK | 4.4 | 36.5 | |
| AWS | South America / APAC / Middle East | 3.6 | 29.7 | |
| Gemini 2.5 Pro <= 200k tokens ↗ | AWS | North America | 21.6 | 172.7 |
| AWS | EU / UK | 18.3 | 146.2 | |
| AWS | South America / APAC / Middle East | 14.8 | 118.8 | |
| Gemini 2.5 Pro > 200k tokens ↗ | AWS | North America | 43.2 | 259.1 |
| AWS | EU / UK | 36.5 | 219.1 | |
| AWS | South America / APAC / Middle East | 29.7 | 178.1 | |
| Gemini 3 Flash ↗ | AWS | North America | 9.1 | 54.5 |
| AWS | EU / UK | 7.7 | 46.2 | |
| AWS | South America / APAC / Middle East | 6.3 | 37.5 | |
| Gemini 3 Pro <= 200k tokens ↗ | AWS | North America | 34.5 | 207.3 |
| AWS | EU / UK | 29.2 | 175.4 | |
| AWS | South America / APAC / Middle East | 23.8 | 142.5 | |
| Gemini 3 Pro > 200k tokens ↗ | AWS | North America | 69.1 | 310.9 |
| AWS | EU / UK | 58.5 | 263.1 | |
| AWS | South America / APAC / Middle East | 47.5 | 213.8 | |
| Gemini 3.1 Pro <= 200k tokens ↗ | AWS | North America | 34.5 | 207.3 |
| AWS | EU / UK | 29.2 | 175.4 | |
| AWS | South America / APAC / Middle East | 23.8 | 142.5 | |
| Gemini 3.1 Pro > 200k tokens ↗ | AWS | North America | 69.1 | 310.9 |
| AWS | EU / UK | 58.5 | 263.1 | |
| AWS | South America / APAC / Middle East | 47.5 | 213.8 | |
| Gemini 3.1 Flash Lite ↗ | AWS | North America | 4.5 | 27.3 |
| AWS | EU / UK | 3.8 | 23.1 | |
| AWS | South America / APAC / Middle East | 3.1 | 18.8 | |
| Gemini Embedding 2 Text ↗ | AWS | North America | 3.6 | N/A |
| AWS | EU / UK | 3.1 | N/A | |
| AWS | South America / APAC / Middle East | 2.5 | N/A | |
| Document Information Extraction | AWS | North America | 182 | N/A |
| AWS | EU / UK | 154 | N/A | |
| AWS | South America / APAC / Middle East | 125 | N/A |
中文翻译¶
AIP 计算用量¶
AIP 计算用量涉及大型语言模型(LLM,large language models)。从根本上说,LLM 接收文本作为输入,并以文本作为输出进行响应。输入和输出的文本量以 token 为单位进行衡量。LLM 的计算用量 以每一定数量 token 的计算秒数(compute-seconds)来衡量。不同模型的计算用量费率可能不同,详见下文。
AIP 中的 Token¶
Token 是 LLM 用于处理和理解输入文本的基本单位。根据语言和具体模型的不同,一个 token 可以短至单个字符,也可以长至整个单词。
重要的是,token 与单词并非一一对应。例如,常见单词可能是一个 token,但较长或不常见的单词可能会被拆分为多个 token。甚至标点符号和空格也可能被视为 token。
不同的模型提供商对 token 的定义各不相同;例如,OpenAI ↗ 和 Anthropic ↗。平均而言,一个 token 大约包含 4 个字符,一个字符可以是单个字母或标点符号。
在 AIP 中,向 LLM 发送提示(prompt)并从 LLM 接收提示的应用程序会消耗 token。每个提示和响应都由可衡量的 token 数量组成。这些 token 可以发送给多个 LLM 提供商;由于提供商之间的差异,这些 token 会被转换为计算秒数,以匹配底层模型提供商的价格。
所有提供 LLM 支持功能的应用程序在使用时都会消耗 token。请参阅以下列表,了解当您与这些应用程序的 LLM 支持功能交互时可能消耗 token 的应用程序。
- AIP Assist
- AIP Logic
- AIP Error Enhancer
- AIP Code Assist
- AIP Analyst
- AI FDE
- Workshop LLM 支持工具
- Quiver LLM 支持工具
- Pipeline Builder LLM 支持工具
- 直接调用语言模型服务(包括 Python 和 TypeScript 库)
AIP 将文本直接路由到后端 LLM,这些 LLM 自行执行分词(tokenization)。文本的大小将决定后端模型用于提供响应所消耗的计算量。
以下示例句子被发送到 GPT-4o 模型。
AIP incorporates all of Palantir's advanced security measures for the protection of sensitive data in compliance with industry regulations.
该句子包含 140 个字符,将按以下方式进行分词,每个 token 之间用 | 字符分隔。请注意,token 并不总是等同于一个单词;有些单词会被拆分为多个 token,如下例中的 AIP 和 Palantir。
A|IP| incorporates| all| of| Pal|ant|ir|'s| advanced| security| measures| for| the| protection| of| sensitive| data| in| compliance| with| industry| regulations|.
该句子包含 24 个 token,将消耗以下数量的计算秒数:
计算秒数 = 24 个 token * 43 计算秒数 / 10,000 个 token
计算秒数 = 24 * 43 / 10,000
计算秒数 = 0.1032
上述句子中的 token 数量和字符数已通过 OpenAI 的 Tokenizer 功能 ↗ 验证。
了解 AIP 计算用量的驱动因素¶
由 LLM token 产生的计算秒数用量直接归属于请求该用量的单个应用程序资源。例如,如果您使用 AIP 自动解释 Pipeline Builder 中的管道,那么 LLM 生成该解释所使用的计算秒数将归属于该特定管道。这在平台中都是如此;牢记这一点将帮助您跟踪 token 的使用位置。
在某些情况下,计算用量无法归属于平台中的单个资源;例如 AIP Assist 和 Error Explainer 等。当用量无法归属于单个资源时,token 将归属于发起 token 使用的用户文件夹。
我们建议您注意代表您发送给 LLM 的 token。通常,您在使用 LLM 时包含的信息越多,消耗的计算秒数就越多。例如,以下场景描述了使用计算秒数的不同方式。
- 在 Pipeline Builder 中,您可以要求 AIP 解释您的转换节点;所选节点的数量会影响 LLM 生成响应所使用的 token 数量,从而影响计算秒数的用量。这是因为随着节点数量的增加,LLM 必须处理的关于这些节点配置的文本量也会增加。
- 在 AIP Assist 中,要求 LLM 生成大块代码需要更多的输出 token。较短的响应使用较少的 token,因此计算量也更少。
- 在 AIP Logic 中,在提示中发送大量文本需要更多的 token,因此需要更多的计算秒数。
导出 AIP Token 用量数据¶
要详细分析您的注册(enrollment)的 LLM 用量,您可以从控制面板的内部数据集导出部分导出 AIP Token Usage 数据集。该数据集提供按模型和资源划分的每日 token 消耗明细,以及相应的计算秒数和货币用量。有关更多信息,请参阅内部数据集导出。
使用 AIP 衡量计算¶
:::callout{theme="neutral"} 如果您与 Palantir 签订了企业合同,请在进行计算用量计算之前联系您的 Palantir 代表。 :::
| 模型 | Foundry 云提供商 | Foundry 区域 | 每 10k 输入 token 的计算秒数 | 每 10k 输出 token 的计算秒数 |
|---|---|---|---|---|
| Grok-2 ↗ | AWS | 北美 | 36 | 182 |
| AWS | 欧盟 / 英国 | 31 | 154 | |
| AWS | 南美 / 亚太 / 中东 | 25 | 125 | |
| Grok-2-Vision ↗ | AWS | 北美 | 36 | 182 |
| AWS | 欧盟 / 英国 | 31 | 154 | |
| AWS | 南美 / 亚太 / 中东 | 25 | 125 | |
| Grok-3 ↗ | AWS | 北美 | 55 | 273 |
| AWS | 欧盟 / 英国 | 46 | 231 | |
| AWS | 南美 / 亚太 / 中东 | 38 | 188 | |
| Grok-3-Mini-Reasoning ↗ | AWS | 北美 | 5.5 | 9.1 |
| AWS | 欧盟 / 英国 | 4.6 | 7.7 | |
| AWS | 南美 / 亚太 / 中东 | 3.8 | 6.3 | |
| Grok-4 <= 128k tokens ↗ | AWS | 北美 | 54.5 | 272.7 |
| AWS | 欧盟 / 英国 | 46.2 | 230.8 | |
| AWS | 南美 / 亚太 / 中东 | 37.5 | 187.5 | |
| Grok-4 > 128k tokens ↗ | AWS | 北美 | 109.1 | 545.5 |
| AWS | 欧盟 / 英国 | 92.3 | 461.5 | |
| AWS | 南美 / 亚太 / 中东 | 75.0 | 375.0 | |
| Grok-4 Fast Reasoning <= 128k tokens ↗ | AWS | 北美 | 3.6 | 9.1 |
| AWS | 欧盟 / 英国 | 3.1 | 7.7 | |
| AWS | 南美 / 亚太 / 中东 | 2.5 | 6.3 | |
| Grok-4 Fast Reasoning > 128k tokens ↗ | AWS | 北美 | 7.3 | 18.2 |
| AWS | 欧盟 / 英国 | 6.2 | 15.4 | |
| AWS | 南美 / 亚太 / 中东 | 5.0 | 12.5 | |
| Grok-4 Fast Non-Reasoning <= 128k tokens ↗ | AWS | 北美 | 3.6 | 9.1 |
| AWS | 欧盟 / 英国 | 3.1 | 7.7 | |
| AWS | 南美 / 亚太 / 中东 | 2.5 | 6.3 | |
| Grok-4 Fast Non-Reasoning > 128k tokens ↗ | AWS | 北美 | 7.3 | 18.2 |
| AWS | 欧盟 / 英国 | 6.2 | 15.4 | |
| AWS | 南美 / 亚太 / 中东 | 5.0 | 12.5 | |
| Grok Code Fast 1 ↗ | AWS | 北美 | 3.6 | 27.3 |
| AWS | 欧盟 / 英国 | 3.1 | 23.1 | |
| AWS | 南美 / 亚太 / 中东 | 2.5 | 18.8 | |
| Grok-4.1 Fast Non-Reasoning ↗ | AWS | 北美 | 3.6 | 9.1 |
| AWS | 欧盟 / 英国 | 3.1 | 7.7 | |
| AWS | 南美 / 亚太 / 中东 | 2.5 | 6.3 | |
| Grok-4.1 Fast Reasoning ↗ | AWS | 北美 | 3.6 | 9.1 |
| AWS | 欧盟 / 英国 | 3.1 | 7.7 | |
| AWS | 南美 / 亚太 / 中东 | 2.5 | 6.3 | |
| Grok-4.20 Reasoning <= 200k tokens ↗ | AWS | 北美 | 36.4 | 109.1 |
| AWS | 欧盟 / 英国 | 30.8 | 92.3 | |
| AWS | 南美 / 亚太 / 中东 | 25.0 | 75.0 | |
| Grok-4.20 Reasoning > 200k tokens ↗ | AWS | 北美 | 72.7 | 218.2 |
| AWS | 欧盟 / 英国 | 61.5 | 184.6 | |
| AWS | 南美 / 亚太 / 中东 | 50.0 | 150.0 | |
| Grok-4.20 Non-Reasoning <= 200k tokens ↗ | AWS | 北美 | 36.4 | 109.1 |
| AWS | 欧盟 / 英国 | 30.8 | 92.3 | |
| AWS | 南美 / 亚太 / 中东 | 25.0 | 75.0 | |
| Grok-4.20 Non-Reasoning > 200k tokens ↗ | AWS | 北美 | 72.7 | 218.2 |
| AWS | 欧盟 / 英国 | 61.5 | 184.6 | |
| AWS | 南美 / 亚太 / 中东 | 50.0 | 150.0 | |
| GPT-4.5 ↗ | AWS | 北美 | 1159.1 | 2318.2 |
| AWS | 欧盟 / 英国 | 980.8 | 1961.5 | |
| AWS | 南美 / 亚太 / 中东 | 796.9 | 1593.8 | |
| GPT-4o ↗ | AWS | 北美 | 43 | 172 |
| AWS | 欧盟 / 英国 | 36 | 145 | |
| AWS | 南美 / 亚太 / 中东 | 30 | 118 | |
| GPT-4o mini ↗ | AWS | 北美 | 2.6 | 10.3 |
| AWS | 欧盟 / 英国 | 2.2 | 8.7 | |
| AWS | 南美 / 亚太 / 中东 | 1.8 | 7.1 | |
| GPT-4.1 ↗ | AWS | 北美 | 31 | 124 |
| AWS | 欧盟 / 英国 | 26 | 105 | |
| AWS | 南美 / 亚太 / 中东 | 21 | 85 | |
| GPT-4.1-mini ↗ | AWS | 北美 | 6.2 | 24.7 |
| AWS | 欧盟 / 英国 | 5.2 | 20.9 | |
| AWS | 南美 / 亚太 / 中东 | 4.3 | 17 | |
| GPT-4.1-nano ↗ | AWS | 北美 | 1.5 | 6.2 |
| AWS | 欧盟 / 英国 | 1.3 | 5.2 | |
| AWS | 南美 / 亚太 / 中东 | 1.1 | 4.3 | |
| GPT-5 ↗ | AWS | 北美 | 20.5 | 163.6 |
| AWS | 欧盟 / 英国 | 17.3 | 138.5 | |
| AWS | 南美 / 亚太 / 中东 | 14.1 | 112.5 | |
| GPT-5-mini ↗ | AWS | 北美 | 4.1 | 32.7 |
| AWS | 欧盟 / 英国 | 3.5 | 27.7 | |
| AWS | 南美 / 亚太 / 中东 | 2.8 | 22.5 | |
| GPT-5-nano ↗ | AWS | 北美 | 0.82 | 6.5 |
| AWS | 欧盟 / 英国 | 0.69 | 5.5 | |
| AWS | 南美 / 亚太 / 中东 | 0.56 | 4.5 | |
| GPT-5-pro ↗ | AWS | 北美 | 231.8 | 1854.5 |
| AWS | 欧盟 / 英国 | 196.2 | 1569.2 | |
| AWS | 南美 / 亚太 / 中东 | 159.4 | 1275.0 | |
| GPT-OSS-20B ↗ | AWS | 北美 | 1.1 | 4.9 |
| AWS | 欧盟 / 英国 | 1.0 | 4.2 | |
| AWS | 南美 / 亚太 / 中东 | 0.79 | 3.4 | |
| GPT-OSS-120B ↗ | AWS | 北美 | 2.5 | 9.8 |
| AWS | 欧盟 / 英国 | 2.1 | 8.3 | |
| AWS | 南美 / 亚太 / 中东 | 1.7 | 6.8 | |
| GPT-5 Codex ↗ | AWS | 北美 | 20.5 | 163.6 |
| AWS | 欧盟 / 英国 | 17.3 | 138.5 | |
| AWS | 南美 / 亚太 / 中东 | 14.1 | 112.5 | |
| GPT-5.1 Codex Mini ↗ | AWS | 北美 | 5.5 | 36.4 |
| AWS | 欧盟 / 英国 | 4.6 | 30.8 | |
| AWS | 南美 / 亚太 / 中东 | 3.8 | 25 | |
| GPT-5.1 Codex ↗ | AWS | 北美 | 23.6 | 181.8 |
| AWS | 欧盟 / 英国 | 20 | 153.8 | |
| AWS | 南美 / 亚太 / 中东 | 16.3 | 125 | |
| GPT-5.1 ↗ | AWS | 北美 | 23.6 | 181.8 |
| AWS | 欧盟 / 英国 | 20 | 153.8 | |
| AWS | 南美 / 亚太 / 中东 | 16.3 | 125 | |
| GPT-5.1 Codex Max ↗ | AWS | 北美 | 22.7 | 181.8 |
| AWS | 欧盟 / 英国 | 19.2 | 153.8 | |
| AWS | 南美 / 亚太 / 中东 | 15.6 | 125.0 | |
| GPT-5.2 ↗ | AWS | 北美 | 31.8 | 254.5 |
| AWS | 欧盟 / 英国 | 26.9 | 215.4 | |
| AWS | 南美 / 亚太 / 中东 | 21.9 | 175.0 | |
| GPT-5.2 Codex ↗ | AWS | 北美 | 32.7 | 254.5 |
| AWS | 欧盟 / 英国 | 27.7 | 215.4 | |
| AWS | 南美 / 亚太 / 中东 | 22.5 | 175 | |
| GPT-5.2 Pro ↗ | AWS | 北美 | 381.8 | 3054.5 |
| AWS | 欧盟 / 英国 | 323.1 | 2584.6 | |
| AWS | 南美 / 亚太 / 中东 | 262.5 | 2100.0 | |
| GPT-5.3 Codex ↗ | AWS | 北美 | 31.8 | 254.5 |
| AWS | 欧盟 / 英国 | 26.9 | 215.4 | |
| AWS | 南美 / 亚太 / 中东 | 21.9 | 175.0 | |
| GPT-5.4 <= 272k tokens ↗ | AWS | 北美 | 45.5 | 272.7 |
| AWS | 欧盟 / 英国 | 38.5 | 230.8 | |
| AWS | 南美 / 亚太 / 中东 | 31.3 | 187.5 | |
| GPT-5.4 > 272k tokens ↗ | AWS | 北美 | 90.9 | 409.1 |
| AWS | 欧盟 / 英国 | 76.9 | 346.2 | |
| AWS | 南美 / 亚太 / 中东 | 62.5 | 281.3 | |
| GPT-5.4 Pro <= 272k tokens ↗ | AWS | 北美 | 545.5 | 3272.7 |
| AWS | 欧盟 / 英国 | 461.5 | 2769.2 | |
| AWS | 南美 / 亚太 / 中东 | 375.0 | 2250.0 | |
| GPT-5.4 Pro > 272k tokens ↗ | AWS | 北美 | 1090.9 | 4909.1 |
| AWS | 欧盟 / 英国 | 923.1 | 4153.8 | |
| AWS | 南美 / 亚太 / 中东 | 750.0 | 3375.0 | |
| GPT-5.4-mini ↗ | AWS | 北美 | 13.6 | 81.8 |
| AWS | 欧盟 / 英国 | 11.5 | 69.2 | |
| AWS | 南美 / 亚太 / 中东 | 9.4 | 56.3 | |
| GPT-5.4-nano ↗ | AWS | 北美 | 3.6 | 22.7 |
| AWS | 欧盟 / 英国 | 3.1 | 19.2 | |
| AWS | 南美 / 亚太 / 中东 | 2.5 | 15.6 | |
| GPT-5.5 <= 272k tokens ↗ | AWS | 北美 | 81.8 | 490.9 |
| AWS | 欧盟 / 英国 | 69.2 | 415.4 | |
| AWS | 南美 / 亚太 / 中东 | 56.3 | 337.5 | |
| GPT-5.5 > 272k tokens ↗ | AWS | 北美 | 163.6 | 736.4 |
| AWS | 欧盟 / 英国 | 138.5 | 623.1 | |
| AWS | 南美 / 亚太 / 中东 | 112.5 | 506.3 | |
| GPT Realtime ↗ | AWS | 北美 | 72.7 | 290.9 |
| AWS | 欧盟 / 英国 | 61.5 | 246.2 | |
| AWS | 南美 / 亚太 / 中东 | 50 | 200 | |
| GPT Realtime 1.5 ↗ | AWS | 北美 | 72.7 | 290.9 |
| AWS | 欧盟 / 英国 | 61.5 | 246.2 | |
| AWS | 南美 / 亚太 / 中东 | 50.0 | 200.0 | |
| o1 ↗ | AWS | 北美 | 232 | 927 |
| AWS | 欧盟 / 英国 | 196 | 785 | |
| AWS | 南美 / 亚太 / 中东 | 159 | 638 | |
| o1-mini ↗ | AWS | 北美 | 17 | 68 |
| AWS | 欧盟 / 英国 | 14 | 58 | |
| AWS | 南美 / 亚太 / 中东 | 12 | 47 | |
| o3 ↗ | AWS | 北美 | 31 | 124 |
| AWS | 欧盟 / 英国 | 26 | 105 | |
| AWS | 南美 / 亚太 / 中东 | 21 | 85 | |
| o3-mini ↗ | AWS | 北美 | 17 | 68 |
| AWS | 欧盟 / 英国 | 14 | 58 | |
| AWS | 南美 / 亚太 / 中东 | 12 | 47 | |
| o3-pro ↗ | AWS | 北美 | 345.5 | 1381.8 |
| AWS | 欧盟 / 英国 | 292.3 | 1169.2 | |
| AWS | 南美 / 亚太 / 中东 | 237.5 | 950.0 | |
| o4-mini ↗ | AWS | 北美 | 17 | 68 |
| AWS | 欧盟 / 英国 | 14 | 58 | |
| AWS | 南美 / 亚太 / 中东 | 12 | 47 | |
ada embedding ↗ |
AWS | 北美 | 1.68 | 不适用 |
| AWS | 欧盟 / 英国 | 1.42 | 不适用 | |
| AWS | 南美 / 亚太 / 中东 | 1.16 | 不适用 | |
| text-embedding-3-large ↗ | AWS | 北美 | 2.24 | 不适用 |
| AWS | 欧盟 / 英国 | 1.89 | 不适用 | |
| AWS | 南美 / 亚太 / 中东 | 1.54 | 不适用 | |
| text-embedding-3-small ↗ | AWS | 北美 | 0.34 | 不适用 |
| AWS | 欧盟 / 英国 | 0.29 | 不适用 | |
| AWS | 南美 / 亚太 / 中东 | 0.24 | 不适用 | |
| OpenAI Text Embedding 3 Large ↗ | AWS | 北美 | 2.2 | 不适用 |
| AWS | 欧盟 / 英国 | 1.9 | 不适用 | |
| AWS | 南美 / 亚太 / 中东 | 1.5 | 不适用 | |
| OpenAI Text Embedding 3 Small ↗ | AWS | 北美 | 0.3 | 不适用 |
| AWS | 欧盟 / 英国 | 0.3 | 不适用 | |
| AWS | 南美 / 亚太 / 中东 | 0.2 | 不适用 | |
| OpenAI Text Embedding Ada 002 ↗ | AWS | 北美 | 1.7 | 不适用 |
| AWS | 欧盟 / 英国 | 1.4 | 不适用 | |
| AWS | 南美 / 亚太 / 中东 | 1.2 | 不适用 | |
| Anthropic Claude 3 ↗ | AWS | 北美 | 52 | 258 |
| AWS | 欧盟 / 英国 | 44 | 218 | |
| AWS | 南美 / 亚太 / 中东 | 35 | 177 | |
| Anthropic Claude 3 Haiku ↗ | AWS | 北美 | 4.3 | 21.5 |
| AWS | 欧盟 / 英国 | 3.6 | 18.2 | |
| AWS | 南美 / 亚太 / 中东 | 3.0 | 14.8 | |
| Anthropic Claude 3.5 Haiku ↗ | AWS | 北美 | 12 | 62 |
| AWS | 欧盟 / 英国 | 10 | 52 | |
| AWS | 南美 / 亚太 / 中东 | 9 | 43 | |
| Anthropic Claude 4.5 Haiku ↗ | AWS | 北美 | 17.3 | 86.4 |
| AWS | 欧盟 / 英国 | 14.6 | 73.1 | |
| AWS | 南美 / 亚太 / 中东 | 11.9 | 59.4 | |
| Anthropic Claude 3.5 Sonnet ↗ | AWS | 北美 | 52 | 258 |
| AWS | 欧盟 / 英国 | 44 | 218 | |
| AWS | 南美 / 亚太 / 中东 | 35 | 177 | |
| Anthropic Claude 3.5 Sonnet v2 ↗ | AWS | 北美 | 46 | 232 |
| AWS | 欧盟 / 英国 | 39 | 196 | |
| AWS | 南美 / 亚太 / 中东 | 32 | 159 | |
| Anthropic Claude 4 Sonnet ↗ | AWS | 北美 | 46.4 | 231.8 |
| AWS | 欧盟 / 英国 | 39.2 | 196.2 | |
| AWS | 南美 / 亚太 / 中东 | 31.9 | 159.4 | |
| Anthropic Claude 4.5 Sonnet ↗ | AWS | 北美 | 51.8 | 259.1 |
| AWS | 欧盟 / 英国 | 43.8 | 219.2 | |
| AWS | 南美 / 亚太 / 中东 | 35.6 | 178.1 | |
| Anthropic Claude 4.6 Sonnet ↗ | AWS | 北美 | 54.5 | 272.7 |
| AWS | 欧盟 / 英国 | 46.2 | 230.8 | |
| AWS | 南美 / 亚太 / 中东 | 37.5 | 187.5 | |
| Anthropic Claude 4 Opus ↗ | AWS | 北美 | 232 | 1159 |
| AWS | 欧盟 / 英国 | 196 | 981 | |
| AWS | 南美 / 亚太 / 中东 | 159 | 797 | |
| Anthropic Claude 4.1 Opus ↗ | AWS | 北美 | 259 | 1295 |
| AWS | 欧盟 / 英国 | 219 | 1096 | |
| AWS | 南美 / 亚太 / 中东 | 178 | 891 | |
| Anthropic Claude 4.5 Opus ↗ | AWS | 北美 | 90.9 | 454.5 |
| AWS | 欧盟 / 英国 | 76.9 | 384.6 | |
| AWS | 南美 / 亚太 / 中东 | 62.5 | 312.5 | |
| Anthropic Claude 4.6 Opus ↗ | AWS | 北美 | 90.9 | 454.5 |
| AWS | 欧盟 / 英国 | 76.9 | 384.6 | |
| AWS | 南美 / 亚太 / 中东 | 62.5 | 312.5 | |
| Anthropic Claude 4.7 Opus ↗ | AWS | 北美 | 90.9 | 454.5 |
| AWS | 欧盟 / 英国 | 76.9 | 384.6 | |
| AWS | 南美 / 亚太 / 中东 | 62.5 | 312.5 | |
| Anthropic Claude 4.8 Opus ↗ | AWS | 北美 | 90.9 | 454.5 |
| AWS | 欧盟 / 英国 | 76.9 | 384.6 | |
| AWS | 南美 / 亚太 / 中东 | 62.5 | 312.5 | |
| Mistral Small 24B ↗ | AWS | 北美 | 158 | 525 |
| AWS | 欧盟 / 英国 | 133 | 444 | |
| AWS | 南美 / 亚太 / 中东 | 108 | 361 | |
| Mistral Small 24B Instruct ↗ | AWS | 北美 | 157.5 | 525 |
| AWS | 欧盟 / 英国 | 133.3 | 444.2 | |
| AWS | 南美 / 亚太 / 中东 | 108.3 | 360.9 | |
| Llama 3.1_8B ↗ | AWS | 北美 | 158 | 525 |
| AWS | 欧盟 / 英国 | 133 | 444 | |
| AWS | 南美 / 亚太 / 中东 | 108 | 361 | |
| Llama 3.3_70B ↗ | AWS | 北美 | 158 | 525 |
| AWS | 欧盟 / 英国 | 133 | 444 | |
| AWS | 南美 / 亚太 / 中东 | 108 | 361 | |
| Llama 3.3 70B Instruct ↗ | AWS | 北美 | 157.5 | 525 |
| AWS | 欧盟 / 英国 | 133.3 | 444.2 | |
| AWS | 南美 / 亚太 / 中东 | 108.3 |