Groq 2024-07-23

llama-3.1-8b-instant

开源极速档 · 海量批量处理

Universaltextultra low costExtremely fast reasoning

数据政策 / Data Policy:未知 / Unknown

未经法务确认上游 ToS,不做承诺

查看 Groq 服务条款

提示:此 provider 数据政策待法务审定;medical / legal 客户请联系商务确认 DPA。

context window

128K

tokens

maximum output

8.2K

tokens

knowledge cutoff

2023-12

Overall rating

7.14

/ 10

capability radar

code6.5

Mathematics6.5

reasoning6.5

creativity6.5

multilingual7.0

long context7.0

speed10.0

Pricing

input price$0.060000/ 1M tokens

output price$0.096000/ 1M tokens

Support features

Tool callStreaming output

Recommended scenarios

开源极速档

海量批量处理

Call example

Called via Nexevo.ai gateway — fully compatible with OpenAI SDK, just replace base URL

curl https://api.nexevo.ai/v1/chat/completions \
  -H "Authorization: Bearer $NEXEVO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.1-8b-instant",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Other models of Groq

mixtral-8x7b-32768

context: 32K · Comprehensive score: 8.0

$0.288000

llama-3.3-70b-groq

context: 128K · Comprehensive score: 7.7

$0.708000

gemma2-9b-it

context: 8K · Comprehensive score: 7.1

$0.240000