Rate limit issues with new Azure AI Agent Service deployment

I’m having trouble with my new Azure AI project. I set up a gpt-4o-mini deployment with default settings. The metrics show I’m nowhere near the rate limits. But when I try to use the AI Agent Service preview, I keep getting rate limit errors.

Here’s what I did:

  1. Made a new project in Azure AI portal
  2. Set up a gpt-4o-mini deployment (Global Standard)
  3. Created an Agent using Assistants API
  4. Started a playground session
  5. Entered some input

Every time I do this, I get an error saying I hit a rate limit. It’s weird because I can use the model directly without any issues.

Is this a bug? Am I doing something wrong? I’m stuck and can’t figure out how to use the AI Agent Service. Any ideas what might be causing this?

// Example code snippet
function verifyLimit(currentUsage, maxLimit) {
  if (currentUsage < maxLimit) {
    console.log('Under the limit');
    return true;
  } else {
    console.log('Limit reached');
    return false;
  }
}

let currentUsage = 100;
let maxLimit = 8000;

let outcome = verifyLimit(currentUsage, maxLimit);
console.log(outcome);

Has anyone else run into this problem? I’d really appreciate some help!

I ran into this exact problem last week. Turns out, the AI Agent Service has separate rate limits from the model itself. What worked for me was going into the Azure portal and adjusting the quota specifically for the AI Agent Service. It’s hidden in a different menu than the regular model quotas.

Another thing to check is your network connection. I noticed that sometimes spotty internet can trigger false rate limit errors. Try running your requests from a different network if possible.

If none of that helps, you might want to try creating a new deployment from scratch. I’ve seen cases where a fresh setup resolved weird issues like this. Just remember to delete the old one to avoid unnecessary charges.

Lastly, keep an eye on the Azure status page. There were some backend issues recently that caused similar symptoms. Good luck sorting it out!

hey stella, i’ve faced similar issues. try resetting ur API key or waiting a bit before retrying. sometimes the system takes time to sync up. also, double-check ur quota settings in the azure portal. they might be different for the AI Agent Service. if nothing works, hit up azure support. they’re usually helpful with weird glitches like this.

I encountered a similar issue when setting up my Azure AI project. The rate limit errors with the AI Agent Service preview can be frustrating. Have you tried adjusting the concurrent requests setting in your deployment configuration? Sometimes, lowering this value can help mitigate rate limit problems. Additionally, ensure your Azure subscription has the necessary quota for AI Agent Service usage. It’s separate from the model usage quota. If the problem persists, I’d recommend reaching out to Azure support for a more in-depth investigation. They might need to check the backend configuration for your specific deployment.

yo stella, i had the same headache. try checkin ur network - sometimes bad connection messes with the service. also, make sure ur using the latest SDK version. if nothin helps, consider makin a new deployment. it fixed similar issues for me before. good luck!