LLM Price Compare

LANGUAGE
  • English
  • 简体中文

LLM Pricing

LLM Price Compare is your one-stop destination for comparing and calculating the latest prices for large language model (LLM) APIs from leading providers such as OpenAI GPT-4, Anthropic Claude, Google Gemini, Mate Llama 3, and more. Our streamlined LLM Price Check tool allows you to easily compare prices, helping you to optimize your AI budget efficiently. With LLM Price Compare, you can make informed decisions and get the best value for your investment in AI technology. Join us today and start saving money on your LLM API usage!

  • Foreign
  • Domestic
Model
Provider
Quality
Context
Input $/1M
Output $/1M
Knowledge
Free trial
gpt-4-turbo-2024-04-09OpenAI100128K10302023-12
gpt-4OpenAI908K30602021-09
gpt-4-32kOpenAI32K601202021-09
gpt-3.5-turbo-0125OpenAI6716K0.51.52021-09
gpt-3.5-turbo-instructOpenAI604K1.522021-09
claude-3-opusAnthropic100200K15752023-08
claude-3-sonnetAnthropic85200K3152023-08
claude-3-haikuAnthropic78200K0.251.252023-08
claude-2.1Anthropic66200K824
claude-2.0Anthropic72100K824
claude-instant-1.2Anthropic65100K0.82.4
llama-3-70b-instructDeepinfra888K0.590.792023-12
llama-3-8b-instructDeepinfra588K0.10.12023-12
gemini-proGoogle6632K0.130.382023-04
gemini-1.5-proGoogle881M7212023-04
gemma-7b-itDeepinfra598K0.10.12024-02
mistral-largeMistral8432K824
mistral-mediumMistral7632K2.78.1
mistral-smallMistral7332K26
mixtral-8x7bMistral6832K0.70.72023-12
mistral-7bMistral4032K0.250.252023-12
command-r-plusCohere80128K3152024-03
command-rCohere674K0.51.52024-03
commandCohere4K0.30.6
pplx-70b-onlinePerplexity454K11
pplx-7b-onlinePerplexity354K0.20.2
openchat-7bOpenChat568K0.130.13
llama-3-70bGroq888K0.590.79
llama-3-8bGroq588K0.050.1
llama-2-70bGroq524K0.640.8
llama-2-7bGroq272K0.10.1
mixtral-8x7bGroq6832K0.270.27
gemma-7bGroq598K0.10.1
llama-2-7b-chat-fp16Cloudflare3K0.566.66
llama-2-7b-chat-int8Cloudflare2 K0.160.24
mistral-7b-instructCloudflare32K0.110.19
llama-3-soliloquy-8bLynn24K0.10.1
meta-llama-3-70b-instructReplicate8K0.652.75
meta-llama-3-8b-instructReplicate8K0.050.25
llama-2-13bReplicate4K0.10.5
llama-2-13bReplicate4K0.10.5
llama-2-7bReplicate4K0.050.25
llama-2-70bReplicate4K0.652.75
mistral-7b-v0.1Replicate32K0.050.25
mistral-7b-instruct-v0.2Replicate32K0.050.25
mixtral-8x7b-instruct-v0.1Replicate32K0.31
jurassic-2-ultraAWS32K18.818.8
jurassic-2-midAWS32K12.512.5
titan-text-liteAWS32K0.30.4
titan-text-expressAWS32K0.81.6
claude-instantAWS6532K0.82.4
claude-3-sonnetAWS8532K315
claude-3-haikuAWS7832K0.251.25
commandAWS32K1.52
command-lightAWS32K0.30.6
llama-2-chat-13bAWS3732K0.751
llama-2-chat-70bAWS5232K1.952.56
mistral-7bAWS4032K0.150.2
mistral-8x7bAWS32K0.450.7
gpt-4-0125-previewOpenAI100128K10302023-12
gpt-4-1106-previewOpenAI128K10302023-04
gpt-4-vision-previewOpenAI100128K10302023-04
gpt-3.5-turbo-1106OpenAI4K122021-09
gpt-3.5-turbo-0613OpenAI4K1.522021-09
gpt-3.5-turbo-16k-0613OpenAI4K342021-09
gpt-3.5-turbo-0301OpenAI4K1.522021-09

Sources

Pricing

Context size information

Quality

Chat

top