Microsoft Fabric REST API Performance remediate

Structured diagnostic workflows and automation scripts for identifying and resolving performance bottlenecks in Microsoft Fabric REST API integrations.

When to Use This Skill

API calls to api.fabric.microsoft.com are slow or timing out
Receiving HTTP 429 (Too Many Requests) responses with Retry-After headers
Long running operations (LRO) polling is inefficient or stalling
Paginated API responses are taking too long to enumerate
Bulk workspace or item operations exceed capacity throttle limits
Entra ID token acquisition is adding unexpected latency
Spark job submissions return HTTP 430 (TooManyRequestsForCapacity)
Need to benchmark Fabric REST API throughput for a given capacity SKU

Prerequisites

PowerShell 7+ with Invoke-RestMethod support
Microsoft Entra ID app registration with appropriate Fabric scopes
A valid Bearer token or MSAL-based authentication flow
Access to at least one Fabric workspace

Diagnostic Decision Tree

Determine the root cause category before applying a fix:

API Call Slow or Failing?
├── HTTP 429 returned?
│   ├── YES → Throttling. See §1 Throttling Diagnosis
│   └── NO  → Continue
├── HTTP 430 returned?
│   ├── YES → Capacity exhausted. See §2 Capacity Limits
│   └── NO  → Continue
├── HTTP 202 + LRO stalling?
│   ├── YES → Polling issue. See §3 LRO Optimization
│   └── NO  → Continue
├── Large result sets slow?
│   ├── YES → Pagination. See §4 Pagination Tuning
│   └── NO  → Continue
├── Token acquisition slow?
│   ├── YES → Auth latency. See §5 Token Performance
│   └── NO  → General latency. See §6 Baseline Benchmarking

§1 Throttling Diagnosis (HTTP 429)

Fabric throttles per-user, per-API within a time window. When exceeded, the API returns HTTP 429 with a Retry-After header (in seconds).

Diagnosis Steps:

Run the throttle diagnostic script to measure your current request rate against throttle limits
Capture Retry-After header values to understand cooldown periods
Review call patterns for burst behavior vs. steady-state

Resolution Patterns:

Pattern	Description
Exponential backoff	Respect `Retry-After`, then add jitter to avoid thundering herd
Request batching	Group related calls to reduce total API invocations
Caller isolation	Use separate service principals for independent workloads
Rate limiter	Implement a client-side token bucket before sending requests

Key Facts:

Every Fabric admin and core public API call is throttled
Throttle window and limits are per-user, per-API (not published explicitly)
The Retry-After value is in seconds (commonly 30-60s)

See throttling-deep-dive.md for implementation patterns.

§2 Capacity Rate Limits (HTTP 430)

Spark jobs and compute-bound operations have a separate throttle tied to the Fabric capacity SKU. When the max queue limit is reached, new jobs return HTTP 430.

Capacity Queue Limits:

SKU	Queue Limit
F2 / F4	4
F8	8
F16	16
F32	32
F64 (P1)	64
F128 (P2)	128
F256 (P3)	256
F512 (P4)	512
F1024	1024
F2048	2048
Trial	Not supported

Resolution:

Cancel active Spark jobs via the Monitoring Hub
Upgrade to a larger capacity SKU
Enable optimistic job admission for higher concurrency
Implement client-side queue management before submitting jobs

§3 Long Running Operation (LRO) Optimization

Many Fabric APIs return HTTP 202 Accepted with three critical headers:

Location — polling URL (Get Operation State endpoint)
x-ms-operation-id — operation GUID for constructing polling URLs
Retry-After — seconds to wait before first poll

Common Performance Issues:

Issue	Symptom	Fix
Aggressive polling	Hundreds of GET calls, wastes quota	Honor `Retry-After`, use exponential backoff
Ignoring Location header	Building URLs manually, missing result endpoint	Use Location header directly; it transitions from State to Result when complete
Not checking for result	Polling succeeds but result never fetched	After `Succeeded` status, call Get Operation Result
Missing failure handling	Stuck in infinite poll loop	Check for `Failed` and `Skipped` statuses

LRO Status Values: Succeeded, Failed, Skipped, Completed

Run the LRO polling benchmark script to profile your polling efficiency.

See lro-patterns.md for complete polling implementation patterns.

§4 Pagination Tuning

Fabric paginated APIs return continuationToken and continuationUri in response bodies. Performance degrades when consuming large result sets sequentially.

Optimization Strategies:

Use continuationUri directly rather than rebuilding URLs with continuationToken
Process pages concurrently when downstream logic allows
Implement early termination when the target item is found
Cache intermediate results for retry resilience

Template: Use the pagination walker template for efficient enumeration.

§5 Token Acquisition Performance

Slow Entra ID token acquisition adds latency to every API call chain.

Diagnosis:

Measure token acquisition time separately from API call time
Check if tokens are being acquired per-request instead of cached
Verify token lifetime and refresh logic

Optimization:

Technique	Impact
Token caching	Eliminate redundant auth round-trips
MSAL token cache serialization	Persist tokens across process restarts
Certificate-based auth	Faster than client secret for service principals
Reduce scope requests	Request only needed scopes per call

§6 Baseline Benchmarking

Before remediate, establish a performance baseline.

Run the baseline benchmark script to capture:

Token acquisition latency (ms)
Simple GET endpoint response time (ms)
Paginated enumeration throughput (items/sec)
LRO polling round-trip time (ms)

Compare results against expected ranges in the baseline reference.

Quick Reference: HTTP Status Codes

Code	Meaning	Action
200	Success	Process response
201	Created (LRO complete)	Fetch result
202	Accepted (LRO started)	Begin polling via Location header
400	Bad request	Validate request body/parameters
401	Unauthorized	Refresh token, check scopes
403	Forbidden	Verify workspace/item permissions
404	Not found	Confirm workspace/item IDs
429	Throttled	Wait `Retry-After` seconds, then retry
430	Capacity exhausted	Reduce concurrent jobs or scale SKU

remediate

Problem	Likely Cause	Resolution
All calls slow (>2s)	Token not cached	Implement MSAL token caching
Intermittent 429s	Burst pattern	Add rate limiter with token bucket
LRO never completes	Operation failed silently	Check for `Failed` status in poll response
Pagination returns duplicates	Stale continuationToken	Always use fresh `continuationUri` from latest response
430 on Spark submit	Capacity queue full	Check Monitoring Hub, scale SKU, or wait
Token acquisition >3s	Network/DNS issue	Test connectivity to `login.microsoftonline.com`

References

Throttling Deep Dive — retry patterns, jitter, token bucket implementation
LRO Patterns — polling state machines, cancellation, parallel LRO management
Baseline Expectations — expected laten

fabric-rest-api-perf-remediate

Cómo agregar

Pega en el README de tu repo

Skills relacionadas

MoneyPrinterTurbo

weather-svg-creator

telegram-bot-builder

segment-automation

Recibe nuevas skills de Automação todos los lunes