Pricing

This page shows pricing information to help you understand API costs.For billing setup, payment methods, and usage monitoring, visit the Admin section. For rate limits, see the Rate Limits & Usage Tiers page.

Estimate your cost

Agent API Pricing

The Agent API provides access to third-party models from providers including OpenAI, Anthropic, Google, xAI, Z.AI, Moonshot AI, and NVIDIA with transparent, token-based pricing at direct provider rates with no markup.

Model Pricing

Agent API pricing varies by provider and model, with each provider offering multiple models at different price points.

View Complete Third-Party Model Pricing

See the full pricing breakdown for all available models from OpenAI, Anthropic, Google, xAI, Z.AI, Moonshot AI, and NVIDIA, including cache rates and provider documentation links on the Agent API Models page.

Tool Pricing

When using tools with the Agent API:

Tool	Price	Description
`web_search`	$0.0025 per invocation	Performs web searches to retrieve current information
`fetch_url`	$0.00025 per invocation	Fetches and extracts content from specific URLs
`people_search`	$0.005 per invocation	Looks up professionals, employees, and people. $5 per 1,000 tool invocations
`finance_search`	$0.005 per invocation	Retrieves financial data and market information. $5 per 1,000 tool invocations
`sandbox`	$0.03 per session	Isolated container for executing code during an Agent API request. A session covers up to 20 minutes of active use for billing purposes — this is the billing window, not a runtime cap. SDK search queries made from inside the sandbox are billed at $0.0025 per request (same as `web_search`).

Most tool costs are per invocation. sandbox is billed per container session — a 20-minute billing window per container, not a runtime cap — plus per SDK search query made from inside it. Tool costs are separate from model token costs.

Search API Pricing

API	Price per 1K requests	Description
Search API	$5.00	Raw web search results with advanced filtering

Billing unit: Search API charges for each successful POST /search request, not for each query in the request. A successful request containing an array of up to five queries is one billing unit. Invalid requests, rate-limited requests, and upstream failures are not billed. A successful response is billed even when it returns no results. There are no additional token-based charges.

Sonar API Pricing

Total cost per query = Token costs + Request fee (varies by search context size, applies to Sonar, Sonar Pro, and Sonar Reasoning Pro models only)

Token Pricing
Request Pricing
Pro Search Pricing

Token Pricing

Token pricing is based on the number of tokens in your request and response.

Model	Input Tokens ($/1M)	Output Tokens ($/1M)	Citation Tokens ($/1M)	Search Queries ($/1K)	Reasoning Tokens ($/1M)
Sonar	$1	$1	-	-	-
Sonar Pro	$3	$15	-	-	-
Sonar Reasoning Pro	$2	$8	-	-	-
Sonar Deep Research	$2	$8	$2	$5	$3

Request Pricing by Search Context Size

Search context determines how much web information is retrieved. Higher context = more comprehensive results. The following table shows the request fee for each model for every 1000 requests.

Model	Low Context Size	Medium Context Size	High Context Size
Sonar	$5	$8	$12
Sonar Pro	$6	$10	$14
Sonar Reasoning Pro	$6	$10	$14

Low: (default) fastest, cheapest
Medium: Balanced cost/quality
High: Maximum search depth, best for research

Learn more about search context →

Pro Search Pricing (Pro Search for Sonar Pro)

Pro Search enhances Sonar Pro with automated tool usage and multi-step reasoning. When enabled, the model can perform multiple web searches and fetch URL content to answer complex queries. Learn more about Pro Search here.

Pro Search requires stream: true and is enabled via the search_type parameter in web_search_options.

Search Type Options

Search Type	Description	Request Fee (per 1K)
`fast`	(default) Standard Sonar Pro behavior	$6 / $10 / $14
`pro`	Multi-step tool usage for complex queries	$14 / $18 / $22
`auto`	Automatic classification based on query complexity	Varies by classification

Request fees vary by search context size (Low / Medium / High). Token pricing remains the same as standard Sonar Pro ($3 per 1M input, $15 per 1M output).

Embeddings API Pricing

Generate high-quality text embeddings for semantic search, retrieval-augmented generation (RAG), and other machine learning applications.

Standard Embeddings

Model	Dimensions	Price ($/1M tokens)
`pplx-embed-v1-0.6b`	1024	$0.004
`pplx-embed-v1-4b`	2560	$0.03

Contextualized Embeddings

Model	Dimensions	Price ($/1M tokens)
`pplx-embed-context-v1-0.6b`	1024	$0.008
`pplx-embed-context-v1-4b`	2560	$0.05

View Embeddings API Documentation

Learn how to use the Embeddings API for semantic search, RAG, and more.

Token and Cost Glossary

Input Tokens

The number of tokens in your prompt or message to the API. This includes:

Your question or instruction
Any context or examples you provide
System messages and formatting

Example: “What is the weather in New York?” = ~8 input tokens

Output Tokens

The number of tokens in the API’s response. This includes:

The generated answer or content
Any explanations or additional context
Search results and references

Example: “The weather in New York is currently sunny with a temperature of 72°F.” = ~15 output tokens

Citation Tokens

Tokens used specifically for generating search results and references in responses. Only applies to Sonar Deep Research model.Example: Including source links, reference numbers, and bibliographic information

Search Context Size vs Context Window

Search context size is not the same as the context window.

Search context size: How much web information is retrieved during search (affects request pricing)
Context window: Maximum tokens the model can process in one request (affects token limits)

Search Queries

The number of individual searches conducted by Sonar Deep Research during query processing. This is separate from your initial user query.

The model automatically determines how many searches are needed
You cannot control the exact number of search queries
The reasoning_effort parameter influences the number of searches performed
Only applies to Sonar Deep Research model

Reasoning Tokens

Tokens used for step-by-step logical reasoning and problem-solving. Only applies to Sonar Deep Research model.Example: Breaking down a complex math problem into sequential steps with explanations

Token Calculation: 1 token ≈ 4 characters in English text. The exact count may vary based on language and content complexity.

Cost Examples

Agent API Web Search

openai/gpt-5.2 • 500 input + 200 output tokens • 1 web search

Component	Cost
Input tokens	$0.000875
Output tokens	$0.0028
`web_search`	$0.0025
Total	$0.006175

Agent API Research Preset

low preset representative run • 2,000 input + 1,000 output tokens • 1 web search + 1 fetch

Component	Cost
Model input tokens	$0.001
Model output tokens	$0.003
`web_search`	$0.0025
`fetch_url`	$0.00025
Total	$0.00675

Actual preset costs vary with the selected model, token usage, and tool invocations. When present on a completed response, usage.cost.total_cost reports the calculated request cost.

Purchase Options

Perplexity API Platform on AWS Marketplace

Purchase API credits through AWS Marketplace with consolidated billing and enterprise procurement.

Contact Sales Team

Fill out our enterprise inquiry form to discuss custom pricing, dedicated support, and enterprise features for teams and organizations.

Getting Started

Agent API

Search API

Embeddings API

Perplexity SDK

Admin & Management

Resources

Legacy API

Estimate your cost

Agent API Pricing

Model Pricing

View Complete Third-Party Model Pricing

Tool Pricing

Search API Pricing

Sonar API Pricing

Token Pricing

Request Pricing by Search Context Size

Pro Search Pricing (Pro Search for Sonar Pro)

Search Type Options

Embeddings API Pricing

Standard Embeddings

Contextualized Embeddings

View Embeddings API Documentation

Input Tokens

Output Tokens

Citation Tokens

Search Context Size vs Context Window

Search Queries

Reasoning Tokens

Cost Examples

Agent API Web Search

Agent API Research Preset

Purchase Options

Perplexity API Platform on AWS Marketplace

Contact Sales Team

​Estimate your cost

​Agent API Pricing

​Model Pricing

View Complete Third-Party Model Pricing

​Tool Pricing

​Search API Pricing

​Sonar API Pricing

​Token Pricing

​Request Pricing by Search Context Size

​Pro Search Pricing (Pro Search for Sonar Pro)

​Search Type Options

​Embeddings API Pricing

​Standard Embeddings

​Contextualized Embeddings

View Embeddings API Documentation

​Input Tokens

​Output Tokens

​Citation Tokens

​Search Context Size vs Context Window

​Search Queries

​Reasoning Tokens

​Cost Examples

Agent API Web Search

Agent API Research Preset

​Purchase Options

Perplexity API Platform on AWS Marketplace

Contact Sales Team

Estimate your cost

Agent API Pricing

Model Pricing

Tool Pricing

Search API Pricing

Sonar API Pricing

Token Pricing

Request Pricing by Search Context Size

Pro Search Pricing (Pro Search for Sonar Pro)

Search Type Options

Embeddings API Pricing

Standard Embeddings

Contextualized Embeddings

Input Tokens

Output Tokens

Citation Tokens

Search Context Size vs Context Window

Search Queries

Reasoning Tokens

Cost Examples

Purchase Options