BYOK (Bring Your Own Key)
EvidenceLens runs at $0 server-side LLM cost. To get conversational answers and summaries, you supply your own API key for the provider of your choice. We proxy the request, stream the response back, and never store the key.
Supported providers
- Anthropic — Claude models, prompt caching enabled by default.
- OpenAI-compatible — OpenAI, Groq, OpenRouter, Together, DeepInfra, and any other endpoint that speaks the OpenAI Chat Completions API. You configure the
base_urlalongside the key. - Ollama — your own local model. The "key" field carries your
OLLAMA_BASE_URLinstead of a credential. - WebLLM — fully in-browser inference, no key required. Slowest, but private by construction.
How it works
- You paste your key in the settings panel. It is stored only in your browser's
localStorage. - Each request sends the key over TLS to
POST /api/agenton the agent service, which forwards to the upstream provider and streams Server-Sent Events back. - The server validates the key on first use (cached for 10 minutes by
SHA-256(key + provider)) and writes only token counts, latency, and error codes to telemetry. The key itself is never logged or persisted.
Why BYOK
This is a free public service run on a small budget. Server-side LLM costs would scale with traffic and force us to either rate-limit aggressively or shut down. BYOK keeps the search engine itself permanently free while letting power users opt into whatever model and budget they prefer.
Privacy
Keys live in your browser and the in-memory request lifecycle of the proxy. They are not written to disk, not logged, not shared with third parties. If you'd rather not trust the proxy at all, switch to WebLLM and the entire interaction stays on your machine.