yangmao.ai · Python setup money page

Cloudflare Workers AI Python API Setup

Use this page when you need a working Python starting point for Cloudflare Workers AI, then validate quota and model names in the official console before production.

Quick verdict

  • Free API: 每天 10000 神经元(永久有效)
  • Rate limits: 10000 requests/day
  • Best model starting point: @cf/meta/llama-3.1-8b-instruct
  • Mainland China access: direct or relatively friendly

Provider fit matrix

Best fit Image generation prototypes, creative automation, and media workflows
Watch out Image/video credits can burn faster than chat tokens; test with small batches first
Production fallback Keep at least one compatible backup provider before shipping

Cloudflare Workers AI buyer intent notes

Who should care

Best for edge AI experiments, serverless inference, Cloudflare-native apps, and developers who want low-ops deployment near users.

Decision trigger

Use Workers AI when your app already lives on Cloudflare and latency/ops simplicity beat maximum model choice.

Watch out: Check model list, account limits, binding configuration, and regional latency before routing all inference to the edge.

Production readiness checklist

Quota gate Start inside 每天 10000 神经元(永久有效); log usage before adding retries or batch jobs.
No-card check Try the free path first, then confirm whether billing is required for API keys, higher RPM, or production endpoints.
Regional smoke test Still run one request from your deployment region and from mainland China if users are there.
Source freshness Snapshot date: 2026-06-16; official quota and pricing can change without notice.

Python setup snapshot

Start with the smallest possible embeddings request, then move the key to your server-side secret manager before production.

# 方式一:通过 AI Gateway(OpenAI 兼容)
from openai import OpenAI

client = OpenAI(
    api_key="your-cloudflare-api-token",
    base_url="https://gateway.ai.cloudflare.com/v1/{account_id}/{gateway_id}"
)

response = client.chat.completions.create(
    model="@cf/meta/llama-3.1-8b-instruct",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

# 方式二:REST API
import requests

response = requests.post(
    f"https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run/@cf/meta/llama-3.1-8b-instruct",
    headers={"Authorization": "Bearer your-api-token"},
    json={"messages": [{"role": "user", "content": "Hello!"}]}
)
print(response.json())

Free API and pricing notes

每天 10000 神经元(永久有效)

Cloudflare Workers $5/mo plan includes Workers AI with 10,000 free neurons/day. 50+ open-source models available including LLM, image generation (SD XL), embeddings, speech-to-text. OpenAI-compatible API via AI Gateway, or direct Workers AI binding. Inference runs on Cloudflare's global edge network with ultra-low latency. Direct China access.

Access and production risk

Mainland China friendly / direct path likely

Direct access from China via Cloudflare edge network, low latency. Workers AI accelerated by global CDN.

How to set it up

1

Create or locate your provider API key in the official dashboard.

2

Install the OpenAI-compatible Python SDK or the provider-supported SDK.

3

Set the API key in an environment variable instead of hard-coding secrets.

4

Run a small Cloudflare Workers AI embeddings request with @cf/meta/llama-3.1-8b-instruct.

5

Watch free credits, RPM/TPM limits, response shape, and error messages before scaling.

Cloudflare Workers AI production validation table

Use this table before sending real users, scheduled agents, or paid traffic to Cloudflare Workers AI. The goal is to validate source freshness, quota behavior, regional access, and fallback needs instead of trusting a stale free-credit claim.

Check Pass condition If it fails
Signup and billing state Key creation works and the account can spend the recorded 每天 10000 神经元(永久有效). Compare Cloudflare Workers AI alternatives or route through a gateway before inviting users.
First request from target region A minimal request succeeds from your deployment region and mainland-China test point if relevant. Do not ship cron jobs or public demos until latency, DNS, TLS, and auth are repeatable.
Quota, retry, and error shape Rate-limit behavior matches the current 10000 requests/day note or official dashboard values. Cap retries, add request logging, and keep a second route for 429/5xx bursts.
Cost per accepted task Real prompts stay within your target token, query, image-credit, or compute budget. Use cheaper primary routes, caching, shorter prompts, or fallback only after validation failure.

Credit-change alerts

Want to know when free credits, pricing, or availability changes? Subscribe first, then compare official providers, API gateways, and alternatives.

Subscribe → Get an OpenLLMAPI key → Compare API gateways →

Related internal links

Source snapshot

Data source: yangmao.ai provider YAML tracker plus provider docs reviewed by the daily crawler. Official dashboards can change quota and pricing without notice; verify before production.

yangmao.ai provider id
cloudflare-workers-ai
Official source
https://ai.cloudflare.com
Last updated
2026-06-16
Free tier
10,000 free requests/day
API credits
每天 10000 神经元(永久有效)
Rate limit
10000 requests/day
Access note
Direct access from China via Cloudflare edge network, low latency. Workers AI accelerated by global CDN.

FAQ

Does Cloudflare Workers AI have a free API?

Yes. Current yangmao.ai record: 每天 10000 神经元(永久有效). Rate limit note: 10000 requests/day.

Is Cloudflare Workers AI OpenAI-compatible?

The recorded setup uses an OpenAI-compatible pattern or SDK-style call. Validate the latest base URL and model names in Cloudflare Workers AI docs.

Can I use Cloudflare Workers AI from mainland China?

Cloudflare Workers AI is marked as relatively direct or Mainland-China-friendly in the current tracker.

What should I do when Cloudflare Workers AI credits run out?

Compare the alternatives below, check /en/free-ai-api/, and shortlist official providers or API gateway options before production.

Is Workers AI a replacement for OpenAI?

It is better viewed as an edge/serverless inference option with a curated model list, not a complete replacement for every frontier API.

🎁 Free Resource Pack

Get the Free AI Startup Toolkit

Free API credits list, AI business case studies, payment stack, risk checklist, and a monetization roadmap.

Get it free →
🐑 AI Assistant