Improvement: Increasing Token Usage Efficiency (In Progress) #678

kc0tlh · 2024-10-16T23:47:46Z

Background
Large language models (LLMs) decode text through tokens—frequent character sequences within text/code. Under the hood Bolt.new is powered mostly by Anthropic's Sonnet 3.5 AI model, so using Bolt consumes tokens that we must purchase from Anthropic.

Our goal is for Bolt to use as few tokens as possible to accomplish each task, and here's why: 1) AI model tokens are one of our largest expenses and if less are used, we save money, 2) so that users can get more done with Bolt and become fans/advocates, and 3) ultimately so we can attract more users and continue investing in improving the platform!

When users interact with Bolt, tokens are consumed in 3 primary ways: chat messages between the user and the LLM, the LLM writing code, and the LLM reading the existing code to capture any changes made by the user.

There are numerous product changes that we are working on to increase token usage efficiency, and in the meantime there are many tips and tricks you can implement in your workflow to be more token efficient:

Upcoming Improvements
Optimizing token usage is a high priority for our team, and we are actively exploring several R&D initiatives aimed at improving token usage efficiency automatically behind the scenes. In the meantime, we will be shipping multiple features in the next 2 weeks that give the user more control over the AI including controlling when it can write code to the filesystem, and which files it is able to modify. These additional controls, paired with the tips below should help you manage your tokens more efficiently. Subscribe to this issue to be notified when those new features land.

While we work on these improvements, here are some strategies you can use to maximize token usage efficiency today:

Avoid Repeated Automated Error "Fix" Attempts
Continuously clicking the automatic "fix" button can lead to unnecessary token consumption. After each attempt, review the result and refine your next request if needed. There are programming challenges that the AI cannot solve automatically, so it is a good idea to do some research and intervene manually if it fails to fix automatically.
Leverage the Rollback Functionality
Use the rollback feature to revert your project to a previous state without consuming tokens. This is essentially and undo button that can take you back to any prior state of your project, This can save time and tokens if something goes wrong with your project. Keep in mind that there is no "redo" function though, so be sure you want to revert before using this feature because it is final: all changes made after the rollback point will be permanently removed.
Crawl, Walk, Run
Make sure the basics of your app are scaffolded before describing the details of more advanced functionality for your site.
Use Specific and Focused Prompts
When prompting the AI, be clear and specific. Direct the model to focus on certain files or functions rather than the entire codebase, which can improve token usage efficiency. This approach is not a magic fix, but anecdotally we've seen evidence that it helps. Some specific prompting strategies that other users have reported as helpful are below, and a ton more can be found in the comment thread below:

If you have specific technologies you want to use (IE Astro, Tailwind, ShadCN), say that in your initial prompt

Mention Specific Code Segments or Classes: When possible, refer to specific divs, classes, or functions to guide Bolt to the exact place where you want the changes made. You can do this manually in chat or by highlighting the relevant code in your project and using the "Ask Bolt" functionality.

Use the Prompt Enhancer function: The better the prompt, the higher quality of the output — bolt․new can help you improve your prompts automatically with the prompt enhancement feature!

Be Specific About What Should Remain Unchanged: Mention explicitly that no modifications should occur to other parts of the site.

Batch multiple simple to explain instructions into one message. For example you can ask Bolt to change the color scheme, add mobile responsiveness, and restart the dev server safely all in one message.

Understand Project Size Impact
As your project grows, more tokens are required to keep the AI in sync with your code. Larger projects (and longer chat conversations) demand more resources for the AI to stay aware of the context, so it's important to be mindful of how project size impacts token usage.
Advanced Strategy: Reset the AI Context Window
If the AI seems stuck or unresponsive to commands, consider refreshing the Bolt.new chat page in your browser. This resets the LLM’s context window, clears out prior chat messages, and reloads your code in a fresh chat session. This will clear the chat, so you will need to remind the AI of any context not already captured in the code, but it can help the AI regain focus when it is overwhelmed due to the context window being full.

We appreciate your patience during this beta period and look forward to updating this thread as we ship new functionality and improvements to increase token usage efficiency!

kc0tlh · 2024-10-22T19:58:29Z

Checkout our latest X thread outlining some of the upcomming dev priorities related to error fixing and token efficiency, specifically:

breaking error loops
controlling what the AI can & cannot edit
a "require approval before AI changes code" mode
& reducing avg token usage per message

All landing over this week & next, ETAs for each included in thread!

kc0tlh · 2024-10-30T17:25:47Z

📣 Update: We are hosting live office hours on Youtube tomorrow where we will be announcing new features that greatly enhance token efficiency, Bolt.new's overall intelligence, and your ability to intervene and control how the AI modifies your code.

I will be posting updates here as well after the live stream, so stay tuned!

yasinkocak · 2024-11-02T08:26:21Z

why is toke reload expensiver then monthly price with the same amount tokens?

kc0tlh · 2024-11-02T23:37:52Z

@yasinkocak the TLDR is that by doing this, it allows us to bring down the price of tokens for everyone on subscription plans. If you're interested, our CEO gave the more detailed explanation on the bolt.new weekly office hours 2 weeks ago.

chefbc2k · 2024-11-02T23:43:43Z

Thank you. I really appreciate the response. But what I'm concerned with now is that I've spent a little bit over $120 on the same project due to a little bit of misunderstanding of how the app works, and then also not being a clear breakdown of how many tokens are used for what or in relation to that, or more tokens used for fixes that there are, job completions. And since it's not clear, like it is inclined, how much token usage I'm using per event, what I would like to know is if I can get a refund or a partial refund for some of the money that I've already spent. I understand that you're going through testing and basically beta right now. But like you're learning, I'm learning as well, and I think we should both give grace for that.

…

On Nov 2, 2024, at 6:38 PM, Alexander Berger ***@***.***> wrote: @yasinkocak <https://github.com/yasinkocak> the TLDR is that by doing this, it allows us to bring down the price of tokens for everyone on subscription plans. If you're interested, our CEO gave the more detailed explanation on the bolt.new weekly office hours <https://www.youtube.com/live/W1mH1aSh6Vo?si=hLaSIO8YtY5g010P&t=578> 2 weeks ago. — Reply to this email directly, view it on GitHub <#678 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AR3HI2F5YXQ3H5L6VUMZTF3Z6VO6LAVCNFSM6AAAAABQCRFMTOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINJTGIZDKNRWGA>. You are receiving this because you are subscribed to this thread.

kc0tlh · 2024-11-02T23:52:23Z

@chefbc2k all refund related issues will be handled via [email protected] so please reach out there with a link to your comment, thanks!

kc0tlh · 2024-11-03T00:04:56Z

📣 Major Update: in our latest Office Hours session announced a complete overhaul of how the AI will write code to the filesystem in the future. You can watch the clip from our Office Hours session on the new diff-based approach here.

You can try using Bolt with the new Diffs approach at this preview URL: https://1f1a1008.bolt-cnr.pages.dev/ Please note: this is experimental, has bugs, and may completely destroy your project, so DO NOT edit projects that are important to you via this URL.

Finally, you can checkout last weeks full bolt.new Office Hours session here for more details on what is coming next!

tamlerner · 2024-11-05T17:13:47Z

Hi , same here!

I've used a "repair" prompt many times and lost TONS of tokens.

Plus many times this was created due to the issue of the code not being entirely finished. What I mean: Basically when you ask bolt to rewrite a file with an error that was targeted, sometimes it rewrites parts of it, in the middle it leaves a: "... same code as before...." text, instead of bringing the full code. So you have to retriger an action to ask it to ask the full file instead of leaving a comment in the middle. Again, waste of a lot of tokens plus sometimes since it rewrites many files at the same time you have to look one by one to see if it actually has gone through this mistake.

This is something the Claude model does a lot and have experimented even with the latest update, so even though I am very specific about the prompt now, there definitely needs to be a fix to improve token efficiency and avoid unnecessary bugs from "uncompleted" scripts.

katetrahan · 2024-11-06T18:19:59Z

One our our community members created a helpful guide for prompting best practices that may be helpful with token usage efficiency. You can view the full post in our discord channel:

🚀 Ultimate Guide to bolt.new Prompting
https://bolt.new

📚 Resources & Tools
• Bolt Prompter (ChatGPT): https://chatgpt.com/g/g-tozliiBeO-bolt-prompter
• Prompt Inspiration: https://cursor.directory/
• Issues & Updates: https://github.com/stackblitz/bolt.new/issues

🏗️ Project Structure Best Practices

Start Broad, Then Drill Down
- Begin with overall architecture
- Use small, focused prompts for specific components
- Keep token count under 200,000 per prompt

Development Sequence

1. UI Design & Components
2. Authentication (add after UI is stable)
3. Database Setup
4. API Routes
5. Additional Features

User Story Prompting
Example format:

"I need an app where:
- As a user, I want to sign in with email
- As a user, I want to view my dashboard
- As a signed-in user, I want to update my profile"

🛠️ Specific Implementation Tips

UI Development

"Create a modern [type] app with:
1. Clean, minimal design
2. Responsive layout
3. Following components: [list them]
4. Using [Tailwind/shadcn/etc] for styling"

Authentication

Firebase works well for quick implementation
Supabase requires more setup but offers good features
Add auth AFTER basic UI is working

Database Integration
Works well:

Firebase: Easiest integration, good for beginners
NoCoDB: Reported successful implementations
MongoDB: Works with proper abstraction layer

Challenging/WIP:

Supabase: Some connection issues reported
AWS: Limited success reports
External SQL databases: Mixed results

⚡ Pro Tips

Controlled Changes

"bolt: don't change any code whatsoever. 
Theoretically, if I want to add [feature], 
what code changes would be needed?"

Error Prevention

Fork project for fresh start if stuck
Save working versions before major changes
Implement database abstraction layer for flexibility

Database Setup

Start with test mode
Add authentication layer
Switch to production after testing

Image Handling

Use external CDNs (e.g., Webflow) for images
Firebase Storage for file uploads
Avoid direct image hosting in bolt

🎯 Example Full Project Prompt

I need a [type] application with these specifications:

Tech Stack:
- Frontend: React with TypeScript
- Styling: Tailwind CSS
- Auth: Firebase
- Database: [choice]

Core Features:
1. User authentication
2. [Main feature]
3. [Secondary features]

Start with creating the main page that includes:
[Detailed page requirements]

⚠️ Common Pitfalls to Avoid

Don't try to build everything in one prompt
Avoid adding auth before UI is stable
Don't mix database implementations
Test each feature before moving to next
Keep backups of working code

🔄 If Things Go Wrong

Fork the project for a fresh start
Break down the problem into smaller prompts
Check GitHub issues for known problems
Move to discussions channel for help

📝 Note: bolt.new is actively developing (< 4 weeks in production). These practices are community-sourced and evolving. For latest updates, check the GitHub repository.

Guide created from community discussions in the bolt.new Discord. Last updated: Nov 5, 2024.

kc0tlh added the enhancement New feature or request label Oct 17, 2024

lcgarza10 mentioned this issue Nov 1, 2024

Deployment Error: An error occured while streaming this message. #1022

Open

This was referenced Nov 1, 2024

Chat Error: prompt is too long: 200000 tokens > 199999 maximum #1322

Open

Using up credits to fix problems but doesnt fix the problem #1305

Closed

Bolt.new repeatedly fails when adding new content #939

Closed

kc0tlh mentioned this issue Nov 2, 2024

application gets stuck #907

Closed

kc0tlh mentioned this issue Nov 5, 2024

your bot is a liar. #1620

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvement: Increasing Token Usage Efficiency (In Progress) #678

Improvement: Increasing Token Usage Efficiency (In Progress) #678

kc0tlh commented Oct 16, 2024 •

edited

Loading

kc0tlh commented Oct 22, 2024

kc0tlh commented Oct 30, 2024

yasinkocak commented Nov 2, 2024

kc0tlh commented Nov 2, 2024

chefbc2k commented Nov 2, 2024 via email

kc0tlh commented Nov 2, 2024

kc0tlh commented Nov 3, 2024

tamlerner commented Nov 5, 2024

katetrahan commented Nov 6, 2024 •

edited

Loading

Improvement: Increasing Token Usage Efficiency (In Progress) #678

Improvement: Increasing Token Usage Efficiency (In Progress) #678

Comments

kc0tlh commented Oct 16, 2024 • edited Loading

kc0tlh commented Oct 22, 2024

kc0tlh commented Oct 30, 2024

yasinkocak commented Nov 2, 2024

kc0tlh commented Nov 2, 2024

chefbc2k commented Nov 2, 2024 via email

kc0tlh commented Nov 2, 2024

kc0tlh commented Nov 3, 2024

tamlerner commented Nov 5, 2024

katetrahan commented Nov 6, 2024 • edited Loading

kc0tlh commented Oct 16, 2024 •

edited

Loading

katetrahan commented Nov 6, 2024 •

edited

Loading