Share via

403 Forbidden: Resource temporarily blocked on gpt-5.5 - How to configure for legitimate bursty traffic

Kunal Singal 0 Reputation points
2026-05-25T19:11:13.67+00:00

Environment

model: gpt-5.5

region: East US 2

I recently deployed gpt-5.5 and was trying to use it with codex
Since then model seems to be blocked

HTTP/2 403 content-length: 123 content-type: application/json apim-request-id: f6501a03-6b98-48a1-8cda-288f920af250 strict-transport-security: max-age=31536000; includeSubDomains; preload x-content-type-options: nosniff x-ms-region: East US 2 date: Mon, 25 May 2026 18:57:37 GMT {"error":{"code":"Forbidden","message":"Your resource has been temporarily blocked because we detected unusual behavior."}}

Codex usage (seems expected to me for a coding agent)
User's image

Expected usage context:

Expected usage is bursty by design - not anomalous:

  • Production agents (end-user facing): unpredictable bursty usages by nature (not uniform over days / within single day)
  • At times coding agents like codex (idle for hours / days then there can be usages like above)

Key questions

  1. Please help in unblocking the above resource? (apim-request-id above)
  2. What specifically triggered the block: token volume, idle→burst pattern, or content filter? The 403 gives no diagnostic signal.
  3. What limits need to be increased to accommodate bursty workloads given the expected usage? Ideally i would want to eliminate / relax these limits to accommodate our expected bursty workloads.
  4. Is there a documented warmup recommendation for new or idle deployments?
Foundry Models
Foundry Models

A catalog of AI models in Microsoft Foundry that you can discover, compare, and deploy using Azure’s built‑in tools for evaluation, fine‑tuning, and inference


1 answer

Sort by: Most helpful
  1. Sina Salam 29,846 Reputation points Volunteer Moderator
    2026-05-26T13:37:00.16+00:00

    Hello Kunal Singal,

    Welcome to the Microsoft Q&A and thank you for posting your questions here.

    I understand that you are having 403 Forbidden: Resource temporarily blocked on gpt-5.5 - and need How to configure for legitimate bursty traffic.

    I can say that this error is rampart recently for now with many services.

    This issue is not a normal quota or rate-limit problem. A standard Azure OpenAI quota/rate-limit issue normally appears as HTTP 429, while the customer’s error is HTTP 403 with the message “Your resource has been temporarily blocked because we detected unusual behavior.” This points to a service-side protection / abuse-monitoring block, not something that can be fixed by simply increasing TPM/RPM or changing max_tokens. Azure abuse monitoring evaluates both content signals and usage behavior patterns, including recurrence, severity, and potential misuse indicators. - https://learn.microsoft.com/en-us/azure/foundry/openai/concepts/abuse-monitoring, https://learn.microsoft.com/en-us/azure/foundry/openai/how-to/quota

    Best thing to do is to stop all traffic to the affected resource, then open an Azure technical support request asking Microsoft to perform a temporary block / unusual-behavior review using the APIM request ID, timestamps, resource name, region, deployment name, and the full 403 response. Azure support is the only reliable path to both unblock the resource and confirm the backend trigger category. - https://learn.microsoft.com/en-us/azure/azure-portal/supportability/how-to-create-azure-support-request

    After Microsoft clears the block, the workload should be redesigned for bursty Codex / coding-agent traffic. If the workload remains on Standard or Global Standard deployment, enforce client-side rate shaping, backoff, jitter, request smoothing, and gradual ramp-up. If the workload is production-critical and legitimately bursty, move it to Provisioned Throughput (PTU) because PTU is designed for predictable throughput and latency with allocated model-processing capacity. - https://learn.microsoft.com/en-us/azure/foundry/openai/concepts/provisioned-throughput, and https://learn.microsoft.com/en-us/azure/foundry/openai/how-to/latency

    I hope this is helpful! Do not hesitate to let me know if you have any other questions, steps or clarifications.


    Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.

    Was this answer helpful?

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.