OpenRouter and BYOK - ONA.UNO Docs

ONA.UNO uses a “Bring Your Own Key” (BYOK) model for cloud AI features. This means you create your own account with an AI provider and use your own API key — ONA.UNO doesn’t bundle AI services into its price or act as a middleman.

This document explains how it works and why we chose this approach.

a close up of a key on a table

What is OpenRouter?

OpenRouter is a unified API service that provides access to AI models from multiple providers — Google (Gemini), Alibaba (Qwen), xAI (Grok), and many others — through a single account and API key.

Instead of creating separate accounts with Google, Alibaba, and xAI, you create one OpenRouter account and access all their models through one interface with one balance.

Why ONA.UNO Uses OpenRouter

Access to multiple providers. ONA.UNO uses different models for different tasks: Qwen for embeddings, Gemini for pipeline processing, and your choice of Gemini or Grok for chat. OpenRouter lets you access all of these with a single API key.

Near-cost pricing. OpenRouter charges close to what the underlying providers charge, with minimal markup. You’re paying for actual AI compute, not a reseller premium.

No vendor lock-in. If a better model becomes available from a new provider, OpenRouter typically adds support quickly. ONA.UNO can offer new models without requiring you to create new accounts.

Proven reliability. OpenRouter handles millions of API requests and provides consistent uptime. It’s a mature service used by many applications.

What is BYOK?

BYOK stands for “Bring Your Own Key.” It sounds technical, but the concept is simple: instead of ONA.UNO charging you for AI usage, you create your own account with an AI service and pay them directly.

Think of it like this: ONA.UNO is the kitchen, but you bring your own ingredients. The AI provider (accessed through OpenRouter) supplies the ingredients — ONA.UNO just does the cooking.

How It Works in Practice

You create a free account at OpenRouter (takes 2 minutes)
You add some credit — as little as $5 to start
You copy your API key into ONA.UNO’s settings
Done. ONA.UNO can now use cloud AI features.

That’s the entire setup. No subscription forms, no recurring billing to manage, no cancellation hassles.

What You’re Actually Paying For

When ONA.UNO processes your content with cloud AI, it sends requests to AI providers like Google (Gemini) or Alibaba (Qwen). These providers charge tiny amounts per request — fractions of a cent for most operations.

With BYOK, you pay:

The actual AI cost — what Google, Alibaba, or xAI charges for processing
A small routing fee — OpenRouter adds a minimal surcharge for providing unified access

That’s it. No markup from ONA.UNO, no “AI credits” with inflated pricing, no mystery fees.

Complete Cost Visibility

You always know exactly what you’re spending:

In ONA.UNO: The status bar shows your current monthly spending and the cost of your last request — updated in real time. No surprises at the end of the month.

In OpenRouter: Your dashboard shows precise, up-to-the-second spending for each API key. You can see exactly which requests cost what.

No Risk of Overspending

Worried about runaway costs? You’re fully protected:

Set a spending limit on your API key. Once reached, the key stops working — period. You decide the maximum, and it’s a hard cap.
Credits don’t auto-renew by default. When your balance runs out, requests simply fail. You add more credits when you’re ready.
No surprise charges. Unless you explicitly enable auto-recharge (which is off by default), nothing happens automatically.

You’re always in control.

Is This Complicated?

No. Creating an OpenRouter account takes a few minutes. The API key is just a string of characters you copy once and paste into ONA.UNO’s settings.

This is for your benefit

BYOK might sound unfamiliar, but it’s actually the most user-friendly approach to AI costs. You pay only for what you use, at near-wholesale prices, with complete visibility into spending. No subscriptions, no commitments. Start with $5 to process a few hundred to a few thousand items — if it’s not for you, you’re out the cost of a coffee.

Why BYOK?

Given that BYOK requires a small setup step, why did we choose this approach?

Transparency

You see exactly what AI processing costs. There’s no markup hidden in ONA.UNO’s price, no “AI credits” with unclear value. When you spend $1 on OpenRouter, you get $1 worth of AI processing at published rates.

No Recurring Fees

Many apps charge monthly subscriptions that assume heavy AI usage. If you use the app lightly one month, you still pay the full subscription. With BYOK, you pay only for what you use. Take a month off? Pay nothing.

Your Relationship, Your Data

Your API requests go directly from your Mac to OpenRouter to the AI provider. ONA.UNO doesn’t proxy your requests through our servers. We don’t see your content, your usage patterns, or your costs. Your relationship is directly with OpenRouter and the AI providers.

Flexibility

You can use the same OpenRouter account with other applications. Your credits aren’t locked to ONA.UNO — they work anywhere OpenRouter is supported.

Getting Started with OpenRouter

1. Create an Account

Visit openrouter.ai and create an account. You can sign up with email or use Google/GitHub authentication.

2. Add Credits

Navigate to your account settings and add credits. The minimum is $5 USD. Credits are added to your balance immediately and never expire.

Payment methods include credit cards and cryptocurrency. OpenRouter doesn’t store your payment details — they use Stripe for processing.

3. Generate an API Key

In your OpenRouter dashboard, go to Keys and create a new API key. Give it a descriptive name like “ONA.UNO” so you can identify it later.

Important: You can set a spending limit when creating the key. This is a hard cap — once reached, the key stops working until you raise the limit. This prevents any possibility of runaway costs.

4. Add the Key to ONA.UNO

In ONA.UNO, go to Settings → AI → Remote Models and paste your API key. ONA.UNO will validate it and show your current balance.

That’s it. ONA.UNO can now use cloud AI features.

Cost Structure

Pay-As-You-Go

OpenRouter is purely pay-as-you-go. There’s no subscription, no monthly minimum, no recurring charges. Add credits when you need them, use them over time.

Credits Never Expire

Unlike some services that expire unused credits, OpenRouter credits remain in your account indefinitely. Add $20 and use it over six months — no problem.

Per-Token Pricing

AI costs are measured in tokens (roughly 4 characters or 0.75 words). Each model has published input and output token prices. Embeddings are charged per token processed. Chat responses are charged for both the context (input) and the response (output).

You don’t need to understand token math — just know that longer content costs more to process, and the prices are published and predictable.

Real-World Costs

Here’s what typical ONA.UNO usage actually costs:

Initial Library Processing

When you first add sources to ONA.UNO, every item needs embeddings (for search) plus summaries, titles, and tags. This is the largest one-time cost.

Library Size	Estimated Cost
10,000 items	$3–4
25,000 items	$8–10

The actual cost depends on content length — a library of short notes costs less than one full of long documents.

Ongoing Daily Use

After initial processing, costs come from:

New items: $0.04–0.06 per 100 items
Chat conversations: $0.01–0.15 per conversation, depending on length
Full summaries: $0.003–0.006 per summary

Rough estimate: Active daily use (adding 30–50 items, regular chat and summaries) runs $2–5 per month.

Inactive Periods

If you don’t use ONA.UNO’s cloud features, you pay nothing. Your credits sit in your account until you need them.

For detailed cost breakdowns, see LLM Cost (OpenRouter).

Spending Limits

OpenRouter provides robust cost control through API key limits.

Setting a Limit

When you create an API key (or edit an existing one), you can set a credit limit. This is the maximum amount that key can spend — ever, across all time.

For example, set a $10 limit. Once total spending reaches $10, the key stops working. You’ll need to either add more credits and raise the limit, or create a new key.

Why This Matters

Spending limits give you complete control:

No surprise bills. If you set a $20 limit, you cannot spend more than $20.
Budget enforcement. Set a limit that matches your comfort level.
Peace of mind. Even if something goes wrong (a bug, unexpected usage), spending stops at your limit.

Recommended Approach

Start with a conservative limit ($10–20) while you learn your usage patterns. After a month, you’ll have a sense of what ONA.UNO costs for your workflow. Adjust the limit accordingly.

Model Selection in ONA.UNO

ONA.UNO uses specific models for different tasks:

Pipeline Processing (Automatic)

Task	Model	Notes
Embeddings	Qwen3 Embedding 8B	Creates semantic vectors for search
Summaries	Gemini 2.5 Flash Lite	Generates bullet-point summaries
Titles	Gemini 2.5 Flash Lite	Creates descriptive titles
Tags	Gemini 2.5 Flash Lite	Suggests relevant tags

These models are selected for their balance of quality, speed, and cost. You don’t choose them — ONA.UNO uses them automatically.

Chat and Full Summaries (Your Choice)

For interactive features, you choose between model families:

Google Gemini 2.5 Flash

Excellent reasoning and fast responses
Large context window
Available in Balanced (faster) and High (deeper reasoning) variants

Grok 4.1 Fast

Massive context capacity (up to 2 million tokens)
Good for conversations spanning many documents
Available in Balanced and High variants

Switch between models anytime in the chat interface. Different models suit different tasks — experiment to find what works for your questions.

For a deeper dive into how ONA.UNO’s AI pipeline works — including why specific models were chosen and how they work together — see Leveraging AI.

Mixed Mode and API Keys

If you use Mixed mode, the pipeline stages (embeddings, summaries, titles, tags) run locally on your Mac — no API key needed for those operations.

However, chat and full summaries require an API key even in Mixed mode. These features use cloud models because local models can’t deliver acceptable quality for interactive conversations and comprehensive summaries.

Also, AI → Consolidate AI Tags… requires an OpenRouter API key in both Mixed and Online modes, because clustering uses a remote model.

You can use Mixed mode without an API key to explore ONA.UNO’s timeline and search. But to unlock chat and summaries, you’ll need an OpenRouter account.

Troubleshooting

When something goes wrong, ONA.UNO shows an error with an HTTP status code from OpenRouter. Here’s what the common codes mean:

Status 401 — Authentication Failed

Your API key isn’t working.

Check that you copied the entire key (they’re long strings)
Ensure there are no extra spaces before or after the key
Verify the key is active in your OpenRouter dashboard

Status 402 — Payment Required

Your OpenRouter balance is empty or your key has hit its spending limit.

Add more credits at openrouter.ai
Or raise the spending limit on your API key

Status 429 — Rate Limited

OpenRouter or the underlying provider is temporarily limiting requests. This usually resolves within minutes. If it persists, check OpenRouter’s status page.

Status 500/502/503 — Server Error

OpenRouter or the AI provider is having issues. Wait a few minutes and try again. Check OpenRouter’s status page for outage information.

”OpenRouter API key missing”

ONA.UNO’s status bar shows this when no API key is configured. Go to Settings → AI → Remote Models and add your key.

One Feature Works But Another Doesn’t

Different features use different models. If chat works but pipeline processing doesn’t (or vice versa):

A specific model may be temporarily unavailable — check OpenRouter’s status page
Try switching chat models to see if the issue is model-specific

Privacy Considerations

When you use cloud AI features:

Your content travels from your Mac → OpenRouter → AI Provider (Google, Alibaba, or xAI)
OpenRouter routes the request but doesn’t store your content beyond processing
The AI provider processes your content according to their privacy policy
ONA.UNO never sees your content — the connection is direct from your Mac

For detailed information about data handling, see Privacy FAQ.

Summary

Aspect	Details
What is BYOK	You create your own OpenRouter account and API key
Why BYOK	Transparency, no subscriptions, direct relationship with provider
Minimum to start	$5 USD (credits never expire)
Typical costs	~$3 to process 10,000 items; $2–5/month ongoing
Cost control	Set hard spending limits on API keys
Required for	Online mode (all features), Mixed mode (chat, full summaries, and AI tag consolidation)

Questions not covered here? Check the other FAQ topics or contact us.