byte
#U
all bytes
Reminder to self. LLMs are stateless. You pay for the whole story every time. As conversations with agents get longer, each prompt gets compoundingly expensive even with engineering solutions to optimize this with cache, compression, decomposition, parallel agents etc.
xxx