






















1. Use context compression proxies/tools. These can compress anywhere from 5% to over 95% depending on the workload. Search github for these.
2. Use bigger models for creating detailed plan, and then use smaller cheap models, like deepseek-v4-flash, or gemini-3-flash for actual implementation. This works really well.
3. Do not just keep chatting in the same session. Try to start a new session for evrry new chat message, or at most like after 2-3 chat messages. if needed, you can ask it summarize the details and use that as context for new chat session.
4. Implement the features in small sets, not one go.. and reset session after every set is done.
5. Keep AGENTS.md small, just the basic info about your project, and the file paths and what that file contains and do and then, general guideleines (10 max).
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。