Skip to content

How tokens work

We charge by tokens because that’s what the upstream LLM providers charge us. A “request” can use 1k or 100k tokens — billing per request either burns our margin or forces us to limit you artificially.

What is a token?

Roughly four characters of English text, or one short Russian word. The phrase def parse_token(t): is about 5 tokens. A 200-line file is typically 1,000–2,000 tokens.

What counts on your quota

DirectionCounted?
Your prompt (input)Yes
Files / snippets sent for contextYes
Model output (the reply, including tool calls)Yes
Tool results returned to the modelYes — these are part of the next prompt
The IDE talking to the backendNo

In practice, an average agent turn (small refactor) uses 3,000–8,000 tokens. A long debugging session can use 50,000+.

Plans at a glance

PlanMonthly tokensCrypto price
Trial250,000 (one-time)$0
Plus1,500,000 / month~$15
Pro5,000,000 / month~$35
Ultra20,000,000 / month + Max-Quality always on + priority~$99
Top-up+500,000 (valid 90 days)~$5

Yearly plans get −12%. See sirius-ide.com/pricing for live numbers.

When does Ultra make sense?

Ultra is the “almost-unlimited” tier (think: Cursor Ultra, but designed for CIS prices). Pick Ultra if any of these describe you:

  • You run the agent on big projects daily and constantly hit the Pro cap toward month-end.
  • You want Max Quality mode enabled by default (premium reasoning models: Claude Opus 4.7, GPT-5.2-codex, o3) instead of toggling it per-task.
  • You want top-priority queue placement so the agent doesn’t wait behind other users at peak hours.

Rollover policy

Monthly tokens do not roll over. Top-up packs are valid for 90 days. If you consistently leave tokens on the table, switch to a smaller plan + top-ups.