Unacceptable latency with 4.1
Opened this issue · 3 comments
Describe the bug
After calling a tool, frequently 4.1 will take up to 30 seconds to give a response, even with a small number of tokens. Here are two examples
GPT 4.1-mini has no delay but frequently gets things wrong so we can't use it.
This started about a month ago. Prior to that, everything worked fine.
This is completely unacceptable for a paid product. I have submitted a support ticket as well and gotten no response.
I don't know how you expect businesses to adopt the agents SDK with such abysmal performance. We are getting complaints about it constantly and since we are raising money, VCs are calling it out and saying our product is shit. This needs to be fixed.
For reference, the 20 second movieglu response had 9,000 tokens in it.