Here are some FAQs.
Settings > Models & API Keys
.
Make sure you add the v1, for example: http://127.0.0.1:11434/v1
You don’t need to put an API key, unless you’ve enabled it yourself.
Note: The chat model needs to follow the OpenAI scheme.You can also override the Autocomplete, Embeddings, Voice, Thinking, Web Search, and Code Apply models. By doing this, you completely bypass our server.
Settings > Tools
and disabling the tools you don’t want.
You can then manually click the “Build and Fix Errors” button in the chat view whenever you like.
Note: If you don’t want to deal with rate limits, use Alex Sidebar’s Pro or Unlimited plans. We handle all the API management for you.
Settings > Privacy
.
We collect Crash Logs & Analytics (via Sentry and PostHog) which you cannot disable — unless you are on a team plan (see #9.)
Settings > Tools
and uncheck the Simulator actions, as well as the “Run App” and “Compile” actions.
TLDR; The context bar is only useful for knowing when to start a new chat. It’s not used for our billing.Long answer: The Context Bar (tokens) is entirely different than the message system. It shows how much of the context limit you’ve used up in the chat. AI systems work based on “Context”. Every time we send a message, we have to construct the whole chat into one large request to send to the AI. Naturally, this becomes very expensive. e.g. if you used Claude Sonnet 4 with 200k tokens (or approximately 1 million characters of text), you would need to spend $0.60 every time you send a message. This includes any time the agent takes an action. This is why we limit the amount of context is sent to the chat model. And when we limit it, that means only a certain length of conversation can be passed in. What determines how much context is used? The total text inside the chat. This includes: