Perplexity Sonar Models

ModelPrice per 1000 requestsPrice per 1M tokens
llama-3.1-sonar-small-128k-online$5$0.2
llama-3.1-sonar-large-128k-online$5$1
llama-3.1-sonar-huge-128k-online$5$5

The pricing for sonar models is a combination of a fixed price per request and a small variable price based on number of input and output tokens in a request.

Perplexity Chat Models

ModelPrice per 1M tokens
llama-3.1-sonar-small-128k-chat$0.2
llama-3.1-sonar-large-128k-chat$1

Open-Source Models

ModelPrice per 1M tokens
llama-3.1-8b-instruct$0.2
llama-3.1-70b-instruct$1