Google Gemini 2.5 Fl
Google Gemini 2.5 Flash 模型提供免费 API 调用额度,每分钟最多1500次请求,适合开发者和中小应用集成,中国大陆可通过代理或 Google Cloud 端点访问。
AI DEAL COLLECTION
OpenAI free API searches, API key access, alternatives, and low-cost compatible routes.
OpenAI free API searches, API key access, alternatives, and low-cost compatible routes. It is useful for developers, indie hackers, and AI tool users who want to compare free credits, limits, and alternative routes quickly.
yangmao.ai refreshes free tiers, expiration dates, claim requirements, and accessibility signals through automated pipelines plus manual checks. Always verify the final claim page before use.
Check the same page for alternative providers, OpenAI-compatible APIs, China-friendly access, or evergreen free tiers instead of relying on one vendor.
Google Gemini 2.5 Flash 模型提供免费 API 调用额度,每分钟最多1500次请求,适合开发者和中小应用集成,中国大陆可通过代理或 Google Cloud 端点访问。
Groq 提供基于 LPU 推理引擎的免费 API,支持 Mixtral 8x7B 等模型,每日1440次请求限制,响应速度极快,中国大陆可通过代理访问。
Mistral AI 的 Le Chat 聊天机器人提供完全免费的无限对话额度,支持多语言和代码生成,无需绑定信用卡,中国大陆可直接访问网页版。
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, approximately 26% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, representing a 26%-50% decrease compared to GPT-4o.
OpenAI announces GPT-4.1 API price drop, with input price reduced to $2 per million tokens and output price reduced to $8 per million tokens, offering better value than GPT-4o.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1 million token context windows with significantly reduced API pricing, offering developers more powerful and cost-effective AI capabilities.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1 million token context windows with significantly reduced API pricing, offering developers more powerful and cost-effective AI capabilities.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1 million token context windows, with API prices lower than GPT-4o, offering developers more powerful and cost-effective AI capabilities.
Anthropic has released Claude Security in public beta, an AI-powered security tool that automatically scans codebases, validates its own findings, and proposes fixes. It is free for all users during the beta period with no additional cost. The tool aims to help development teams identify and fix security vulnerabilities early in the development lifecycle.
百度千帆平台为注册用户提供每月 100 万 Token 的免费 API 额度,支持 ERNIE 系列模型,中国大陆直接访问,适合个人开发者和学生。
百度千帆平台为新用户提供 ERNIE-Bot、ERNIE-3.5 等模型免费调用额度,每月基础免费额度充足,中国大陆直接使用,支持 SDK 和 REST API。
百度千帆平台近期调整免费政策,ERNIE-Bot、ERNIE-Bot-Turbo 等模型每日免费调用次数提升至 1000 次,注册即享,无需绑定银行卡,中国大陆开发者友好。
Cerebras uses proprietary WSE chips for the world's fastest inference (2000+ tokens/s, 20x faster than GPU). Free tier: 1M tokens/day, 30 RPM, no credit card. Models: Llama 3.3 70B, Llama 3.1 8B, Qwen 3.5, and more. OpenAI-compatible API. Best for latency-sensitive use cases: real-time chat, streaming, Agent tool calls. Competes with Groq on speed, but with a larger daily token budget.
ChatGPT (OpenAI) has a recorded free tier: Limited requests/day. Good for testing before upgrading.
ChatGPT Go introduces localized pricing in five Southeast Asian markets (Thailand, Indonesia, Vietnam, Philippines, Malaysia), offering lower prices than USD pricing, but Singapore is notably excluded. Claude's pricing in Asian markets stands out as an outlier. This update reflects OpenAI's pricing strategy adjustment in Asia to improve regional competitiveness.
ChatGPT (OpenAI) is recorded as supporting OpenAI-compatible API access. Free/trial info: $0. Useful for low-cost testing by swapping SDK base_url.
Malta's AI for All program treats ChatGPT Plus as a national AI literacy benefit: complete the course first, then receive one year of Plus. This is country-limited, not a general loophole, but an important AI public-benefit signal.
A developer built an "AI World" prototype using Claude paid version two months ago, and now Emergence AI has launched a nearly identical product. This tool allows users to create and explore AI-driven virtual worlds for free, without needing a Claude subscription. It's a great free alternative for users who want to experience AI world building without paying.
A developer has created a free file that aims to fix how Claude behaves in chat, and is currently recruiting testers on Reddit. The file may optimize Claude's response quality by adjusting prompts or configurations. Users can obtain and try the file for free, but are expected to provide feedback to help improve it.
A developer shared four free tips for using Claude Code when building iOS/macOS apps. These tips cover code generation, debugging optimization, project structure suggestions, and more, helping users leverage Claude Code more efficiently for Apple platform development. All tips require no additional payment and are suitable for Claude users to learn from.
A community developer has released a free toolkit for Claude Code, significantly expanding its capabilities. The toolkit includes 50 predefined skills, 7 specialized agents, 11 slash commands, and auto-formatting hooks covering full-stack engineering scenarios including frontend, backend, database, and DevOps. Users can download and use it for free, greatly enhancing development productivity.
A developer built a free local MCP server that significantly optimizes Claude Code's PR review process. The tool reduces token consumption per PR review from 63K to 8.7K, drastically lowering usage costs. Users need to set up the local server and integrate it into their Claude Code workflow. This solution is ideal for developers who frequently use Claude Code for code reviews.
On May 6, 2026, Claude released a status update fixing connection failures for users whose organizations restrict GitHub access by IP address. This issue affected enterprise or organizational users with IP whitelist restrictions on GitHub. Claude has deployed a fix, and all affected users can now access the service normally. This update ensures users can continue using Claude without changing their network configuration.
A Reddit user shared a free photo-culling tool built with Claude that automates the process of culling, deduplicating, and ranking photos. The tool efficiently reduces 8,000 trip photos down to the best 50, saving significant manual sorting time. It is currently completely free to use, requiring only Claude API access.
Reddit community users are compiling a hidden tips guide for Claude free tier users, focusing on advanced usage of Artifacts and Projects. These tips help users get a better experience within the free quota, including prompt optimization and using project features to manage conversation history. The guide is community-driven and continuously updated.
A developer shares the complete process of building 62 free tools in one month using Claude's free tier, leveraging the Ralph Wiggum Loop and a shell script. The tutorial details automated prompt engineering and tool generation methods, significantly boosting the efficiency of Claude's free tier usage. Ideal for users looking to explore AI tool development at low cost.
According to Reddit community discussions, Claude AI currently offers a free trial option, allowing new users to experience basic chat functionality after registration. Specific free quotas and usage limits may vary by region and time; users are advised to check the official page for the latest information. This trial is suitable for users who want to get an initial understanding of Claude's capabilities.
This Reddit trend points to a free online session around OpenSpec and Claude Code on "Spec-Driven Prototyping." The session is expected to show how to combine OpenSpec specifications with Claude Code to rapidly build prototypes. Because the source is a community-event signal, it is recorded as a Claude Code ecosystem learning resource and does not change Anthropic's official free tier or pricing.
Claudex is a free open-source CLI tool built by a community developer, designed to emulate Claude Code-style workflows. Users can try it without any subscription fee, requiring only a Claude API key to run. It is suitable for developers exploring Claude's coding assistance capabilities and supports customizable workflows.
Cloudflare Workers $5/mo plan includes Workers AI with 10,000 free AI calls per day (measured in neurons), permanently valid. 50+ open-source models: - LLM: Llama 3.1 8B, Llama 3.3 70B, Gemma, Mistral 7B, Phi-2 - Image generation: Stable Diffusion XL (completely free!) - Embeddings: BGE Base/Large (for RAG and semantic search) - Speech-to-text: Whisper Highlights: - Permanently valid, never expires - Inference on 300+ global edge nodes, ultra-low latency - Direct China access, no proxy needed - OpenAI-compatible via AI Gateway - Pay-as-you-go after free quota, no hard cutoff - If you already use Cloudflare Workers, this is essentially free Ideal for lightweight AI: blog writing, content tagging, summarization, embeddings, product image generation.
Cohere offers a free Trial API Key with 1,000 calls/month across all models: - Command R+: top RAG and chat model - Rerank: document reranking for RAG pipelines - Embed: multilingual text embeddings No credit card required, resets monthly. Great for prototyping RAG projects. Note: Trial Key is not permitted for production use.
DAAF (Data Analyst Augmentation Framework) version 2.1.0 has been released, fully free and open source. The framework aims to provide the safest and easiest way to use Claude Code for data analysis and processing. The new version brings significant improvements in usability, safety, and analytical rigor, suitable for data scientists, analysts, and developers.
Domestic open-source large language model downloads have exceeded 10 billion, marking the vigorous growth of China's open-source AI ecosystem. These models include open-source versions released by multiple well-known vendors and institutions, covering different parameter scales from lightweight to large. Users can download model weights for free for academic research, commercial applications, or further development. This milestone reflects the widespread recognition and adoption of domestic AI technology by the open-source community.
Fireworks AI 提供每日 100 万 token 免费额度,支持 Llama 3、Mixtral、Gemma 等主流开源模型。API 兼容 OpenAI 格式,中国大陆可直连,适合原型开发和轻量应用。
提供高速推理 API,支持 Llama、Qwen 等开源模型。新用户有每日免费的 token 额度,适用于开发和测试。
FreeModel is the lower-friction option in this GPT-5.5 free trial batch: no card, quick signup, and useful for light testing. The key unknowns are whether the model is truly native GPT-5.5, whether the weekly quota resets reliably, and long-term service stability.
The Gemini API free tier is suitable for developers, small projects, and prototypes. Actual free rate limits vary by model, project, and billing tier, so users should confirm current limits in AI Studio.
GitHub Copilot Free is the official free tier for AI coding tools: 2,000 completions and 50 agent/chat requests per month, with no credit card required according to GitHub’s pricing page.
Google 最新 Gemini 2.5 Pro 模型提供免费 API 层,每分钟最多2次请求,无需付费即可体验长上下文推理能力,适合开发测试和小型应用。
Gemini 1.5 Flash 和 Gemini 1.5 Pro 模型免费层,每分钟 60 次请求,无需付费即可使用,中国大陆开发者可通过代理访问。
Google Gemini API 提供永久免费套餐,支持 Gemini 1.5 Flash 和 Gemini 1.5 Pro 模型,每分钟最多 60 次请求,无每日 token 上限,适合个人开发者和学习使用。中国大陆需科学上网。
Google Gemini API 提供免费层,支持 Gemini 1.5 Pro 和 Flash 模型,每分钟最多 60 次请求,无需付费即可使用多模态能力,中国大陆需代理访问。
Google Gemini API 提供免费层级,每分钟最多60次请求,支持 Gemini 1.5 Flash 和 Gemini 1.5 Pro 模型,中国大陆开发者可通过代理或直接访问(部分地区可用)。无需绑定信用卡即可开始使用。
Google has announced the shutdown of its free search index, meaning AI applications and developers relying on web search can no longer access real-time search results for free. Traffic defense services like Cloudflare are also intensifying blocking of AI crawlers, further complicating web search. Users need to seek alternatives such as Bing API, DuckDuckGo, or self-built crawlers, though costs and technical barriers may increase.
GPT free users have recently noticed that different users are receiving varying free benefits. Some users get higher daily message limits, while others gain priority access to new models or features. This change appears to be rolling out gradually, possibly based on user activity, account history, or geographic location. OpenAI has not yet officially announced the specific rules, but community discussions are active.
Groq 提供基于 LPU 推理引擎的免费 API,支持 Llama 3、Mixtral 等模型,每日 1440 次请求限制,速度极快。需海外邮箱注册,中国大陆可访问但需翻墙。
Groq 提供每日100万Token免费API调用额度,基于其自研LPU芯片实现极速推理(支持Llama 3、Mixtral等模型)。注册需海外邮箱,但API中国大陆可直连,适合低延迟场景。
Groq 提供基于 LPU 推理引擎的免费 API,支持 Llama 3、Mixtral 等模型,每天最多 1440 次请求,中国大陆可直连,适合低延迟推理测试。
Groq 提供完全免费的 API 访问,支持 Llama 3、Mixtral 等开源模型,速率限制为 30 次/分钟,无总量上限。中国大陆用户需自行解决网络访问问题,注册无需信用卡。
Groq is one of today's most useful free inference deals: the free tier lets developers test Llama, Mixtral, Gemma and other models through an OpenAI-compatible API. It is best for AI agents, RAG summarization, and low-latency chat prototypes. China access may require additional verification or a relay.
Groq 提供免费 API 额度,支持 Llama 3、Mixtral 等开源模型,推理速度极快,每日有限免费调用次数,注册即用,中国大陆需科学上网。
Groq uses custom LPU (Language Processing Unit) chips for the fastest AI inference in the industry. Free models: - Llama 3.3 70B Versatile — 6000 TPM / 30 RPM - Llama 4 Scout 17B — 6000 TPM / 30 RPM - Llama 4 Maverick 17B — 6000 TPM / 30 RPM - Mixtral 8x7B — 5000 TPM / 30 RPM - Gemma 2 9B — 15000 TPM / 30 RPM - DeepSeek R1 Distill Llama 70B — 6000 TPM / 30 RPM Highlights: - 10x+ faster than GPU solutions, Llama 3.3 70B reaches 300+ tokens/sec - API keys start with gsk_, OpenAI-compatible - No total cap, rate-limited only - Requires proxy from China (use openllmapi.com)
Groq 将免费套餐的每日 API 请求上限从 500 次提升至 1000 次,支持 Llama 3、Mixtral 等开源模型,中国大陆开发者可直接通过 API 调用,无需绑定信用卡。
Groq uses proprietary LPU (Language Processing Unit) chips for the world's fastest AI inference. Free tier requires no credit card. Free tier details: - Llama 3.3 70B: 30 RPM, 6000 tokens/min, 14400 requests/day - Llama 3.1 8B: 30 RPM, 20000 tokens/min - Gemma 2 9B: 30 RPM, 15000 tokens/min - Mixtral 8x7B: 30 RPM, 5000 tokens/min - Llama 4 Scout/Maverick (newly added) Why Groq is so fast: - Custom LPU chip designed specifically for LLM inference - Deterministic execution, no GPU memory bandwidth bottleneck - Llama 3.3 70B output at 300+ tokens/s (GPU typically 30-50 tokens/s) - Ultra-low time-to-first-token, ideal for real-time chat and streaming Best for: - Real-time AI chat (speed is the core experience) - Agent tool calls (low latency = faster multi-step reasoning) - Streaming output (buttery smooth typewriter effect) - Rapid prototyping China accessible. OpenAI-compatible API, base_url is https://api.groq.com/openai/v1.
Groq 于2026年4月底上线Mixtral 8x7B免费推理服务,每日500次请求,无需信用卡,API兼容OpenAI格式,中国大陆开发者可直接调用。
Groq 提供 Mixtral 8x7B 等模型的免费 API 访问,速率限制为每分钟30次请求,适合快速原型开发。中国大陆需通过代理访问。
Groq 提供基于 LPU 的高速推理服务,Mixtral 8x7B 模型每日免费额度高达100万token,注册即用,中国大陆可直接访问 API。
Hugging Face 提供 Inference API 免费套餐,每月 3 万次调用,支持数千个开源模型(文本、图像、音频等),中国大陆可访问但速度较慢,适合学习和实验。
Hugging Face 提供免费推理 API,可调用数千个社区模型(包括文本、图像、音频等),中国大陆可直接访问,无需付费。
Mistral AI 于2026年4月更新免费政策,Le Chat 平台每月提供100万token免费额度,支持Mistral Large 2模型,中国大陆可直连。
Mistral AI 的 Le Chat 聊天应用提供免费无限对话,支持 Mistral Large 等模型,中国大陆可直接访问网页版,无需注册即可使用基础功能。
Mistral AI’s official free API entry point is the Experiment plan: free for evaluation and prototyping, with limited rate limits; production or higher usage requires the Scale plan.
Mistral AI 提供免费开发者计划,每月 50 万 token 的 API 调用额度,支持 Mistral Large、Mistral Small 等模型,中国大陆需科学上网。
Mistral AI’s official free API entry point is the Experiment plan: free for evaluation and prototyping, with limited rate limits; production or higher usage requires the Scale plan.
Mistral AI 的 Le Chat 平台提供免费层,支持无限次对话、文件上传(图像、PDF、Word、Excel)和网络搜索,无需付费。中国大陆可直接访问网页版。
Mistral AI 推出的 Le Chat 聊天助手提供每日100次免费对话额度,使用自家 Mistral Large 模型,支持中文。可通过网页或 API 使用,注册即享,无需付费。中国大陆可正常访问。
This open source tool is designed for AI agents to perform budget checks before API calls, preventing high bills from infinite loops or misconfigurations. It gained 560 downloads within 3 days of release, indicating strong developer demand for such protection. The tool is completely free and open source, suitable for any team using AI agents.
OpenAI released a new version of Agents SDK with MCP integration and web search tool, completely free and open source for developers.
OpenAI released Codex CLI, an open-source command-line coding tool that enables AI-assisted coding directly in the terminal, completely free to use.
The OpenAI Codex Enterprise Promo is an official limited-time application entry for enterprises adding net-new Codex users. The official page confirms that new Codex users on eligible enterprise accounts can request two months of free Codex usage; eligibility, routing, and approval remain subject to OpenAI's review.
OpenAI Codex for Open Source is an official application program for OSS maintainers. The key confirmed benefits are six months of ChatGPT Pro with Codex, API credits, and conditional Codex Security access; all benefits remain subject to OpenAI review and the Program Terms.
OpenAI Codex for Students is an official OpenAI Developers student offer: verified U.S. and Canadian university students can claim $100 in ChatGPT credits (shown as about 2,500 credits) for Codex, expiring 12 months after the grant date. These are not API credits and the offer is not global student access.
OpenAI announces DeployCo, a new enterprise service designed to help organizations deeply embed AI capabilities into their business operations. DeployCo provides end-to-end deployment support including model customization, safety compliance, performance optimization, and continuous monitoring, enabling businesses to leverage OpenAI's advanced models more effectively. The service targets enterprises requiring large-scale AI deployment, especially in high-compliance industries such as finance, healthcare, and legal. The DeployCo team works closely with clients from proof-of-concept to production deployment, offering expert guidance throughout the process. DeployCo is now accepting enterprise inquiries, with pricing customized based on client needs. This move marks OpenAI's strategic shift from providing API services to offering complete enterprise AI solutions.
OpenAI announces the launch of DeployCo, a new company focused on helping enterprises build deployment solutions around artificial intelligence. DeployCo aims to provide end-to-end support from model selection to production deployment, including customized model fine-tuning, security and compliance consulting, and continuous optimization services. This service is primarily targeted at large enterprise customers, helping them deeply integrate AI capabilities into their business systems.
OpenAI has officially launched DeployCo, a new service designed for enterprises to help them build and deploy applications around AI intelligence. DeployCo provides end-to-end deployment solutions, including model integration, performance optimization, and operational support. The service may include a free trial tier or initial usage credits to lower the barrier for enterprise adoption. Specific pricing and free tier details have not been fully disclosed, but this marks a significant expansion of OpenAI's enterprise offerings.
OpenAI announces DeployCo, a new service designed to help enterprises deeply integrate AI capabilities into their operations. DeployCo offers customized deployment solutions, ongoing optimization support, and industry-specific solutions, enabling businesses to build core operations around intelligence. The service targets enterprise customers needing large-scale, secure, and efficient AI deployment.
OpenAI API has a recorded free trial: $5; rate limit: 3 RPM (free tier).
OpenAI has a recorded free tier: ChatGPT free tier unlimited. Good for testing before upgrading.
OpenAI launches new GPT-4.1 API features including controlled generation, improved structured outputs, enhanced image understanding, and code execution support, providing developers with more powerful model capabilities.
OpenAI announced a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output price to $8 per million tokens, approximately 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI launches GPT-4.1 series API, approximately 26% cheaper than GPT-4o, with input at $2/M tokens and output at $8/M tokens. GPT-4.1 mini and nano are even more affordable for various use cases.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, approximately 50% lower than GPT-4o, greatly reducing developer costs.
OpenAI announced a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output price to $8 per million tokens, 26% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces GPT-4.1 API price reduction, with input prices 26% lower and output prices 50% lower than GPT-4o; GPT-4.1 mini and nano are even cheaper.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2/M tokens and output to $8/M tokens, 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, offering better value than GPT-4o for large-scale inference and generation tasks.
OpenAI announces price reduction for GPT-4.1 API series, with input price dropping to $2 per million tokens and output to $8 per million tokens, offering better value than GPT-4o.
OpenAI announces a significant price cut for GPT-4.1 API, with input price reduced to $2/M tokens and output to $8/M tokens, offering better value than GPT-4o for large-scale API usage.
OpenAI announced a significant price reduction for the GPT-4.1 API, with input prices dropping to $2 per million tokens and output prices to $8 per million tokens, about 50% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input at $2/M tokens and output at $8/M tokens, 26%-50% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI launches GPT-4.1 API series with significant price reduction compared to GPT-4o. GPT-4.1 nano input is only $0.1/1M tokens, output $0.4/1M tokens, ideal for cost-effective AI applications.
GPT-4.1 input $2/M tokens, output $8/M tokens, ~26% cheaper than GPT-4o.
OpenAI announced a significant price drop for GPT-4.1 API, with input price reduced to $2/1M tokens and output to $8/1M tokens, offering better value than GPT-4o.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, representing a 26%-50% decrease compared to GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2/M tokens and output to $8/M tokens, approximately 50% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, approximately 26% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces a significant price reduction for the GPT-4.1 API, with input dropping to $2 per million tokens and output to $8 per million tokens, offering a substantial cost saving compared to GPT-4o for AI application development.
OpenAI announces a significant price reduction for GPT-4.1 API, with input price reduced to $2/1M tokens and output to $8/1M tokens, about 50% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, 26%-50% lower than GPT-4o, greatly reducing developer costs.
OpenAI announces a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, 26% cheaper than GPT-4o, greatly reducing developer costs.
GPT-4.1 adds code completion capability for seamless IDE integration.
GPT-4.1 supports invoking a code execution sandbox via API, enhancing coding and data analysis.
GPT-4.1 series supports built-in code execution, allowing users to run code directly in conversations for programming, data processing, and analysis, boosting development efficiency.
GPT-4.1 series adds code execution and image generation, available for free users.
OpenAI releases GPT-4.1 series, focusing on improved code generation and image understanding, with structured outputs and function calling for developers and advanced users.
OpenAI announced that the GPT-4.1 series models now support calling the code interpreter via API, allowing developers to leverage code execution for programming assistance, data processing, and analysis directly within their applications, significantly enhancing the model's utility in coding and data analysis scenarios.
ChatGPT adds image generation powered by GPT-4.1, supporting iterative editing and text rendering, available to free users.
OpenAI launched GPT-4.1 mini and nano models, with input pricing at $0.4/M tokens and $0.1/M tokens respectively, both supporting 1M token context window.
OpenAI launched GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. The new series supports up to 1 million token context windows, with significantly reduced API pricing compared to previous generations, offering developers more powerful and cost-effective AI capabilities.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, all supporting up to 1M token context windows with significantly reduced API pricing compared to previous generations, offering developers more powerful and cost-effective AI capabilities.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context, significant performance improvements, and reduced API pricing starting at $2 per million input tokens.
OpenAI officially releases the GPT-4.1 series, including standard, mini, and nano versions, with significant performance improvements across benchmarks and substantially reduced inference costs, offering developers more efficient and cost-effective AI capabilities.
OpenAI officially releases the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. The new series offers significant performance improvements at lower prices, suitable for various AI applications.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. All models support a 1M token context window, with significant performance improvements in code generation, instruction following, and long-context understanding. API pricing is substantially reduced compared to GPT-4o series, with input prices starting at $2/M tokens and output at $8/M tokens, offering developers better cost-effectiveness.
OpenAI launched GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1M token context window with improved performance and reduced pricing.
OpenAI released GPT-4.1 series models, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI released GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano with 1M token context window and lower API pricing compared to GPT-4o, ideal for long-context and high-throughput applications.
OpenAI 于2026年4月将GPT-4o免费层从每日10次提升至50次,无需绑定支付方式即可使用,支持文本和图像输入。
OpenAI released an updated GPT-4o mini model with improved performance and lower cost.
ChatGPT free users can now access GPT-4o mini with limits, experiencing more powerful AI conversation capabilities.
OpenAI 为 GPT-4o-mini 模型提供免费层,注册后每日可免费调用约100次,适合轻量级应用和测试。中国大陆需通过代理访问。
OpenAI announces a significant price reduction for GPT-4o mini API, with input price dropping to $0.15/M tokens and output to $0.60/M tokens, offering developers a more cost-effective AI service.
OpenAI launches GPT-4o mini, a cost-efficient small model for affordable applications.
新注册用户可获 $5 API 额度,用于体验 o3-mini 模型,有效期30天,支持中国大陆信用卡注册。
OpenAI is recorded as supporting OpenAI-compatible API access. Free/trial info: $5. Useful for low-cost testing by swapping SDK base_url.
新注册用户可获得 $50 免费 API 额度,可用于 Realtime API 及 GPT-4o 等模型,有效期 90 天。
OpenAI has enhanced Structured Outputs for the GPT-4.1 series, improving JSON mode reliability and performance, enabling developers to obtain structured outputs more consistently.
OpenAI free benefits are expanding from individual trials to students, teachers, military cohorts, and country programs. This tracker consolidates eligibility, regions, duration, official paths, and alternatives.
A developer has integrated OpenAI TTS into their AI platform, offering completely free and unlimited voice generation with no paywalls. Users can generate any number of voice outputs without paying. The feature aims to test the actual market demand for free TTS services.
Perplexity Pro 提供1个月免费试用,包含无限次搜索、高级模型(GPT-4、Claude 3等)和文件上传功能。需绑定支付方式,试用结束后自动续费(可取消)。中国大陆可访问,但需科学上网。
Replicate 提供每月 50 次免费推理额度,支持大量开源模型(如 Stable Diffusion、Llama、Whisper),中国大陆需代理访问,适合模型测试和小型项目。
Replit has launched a Free Day of Coding event, offering users one day of free access to its AI-assisted development platform. The platform integrates code generation, auto-completion, and intelligent debugging to help developers build projects faster. This event aims to let more people experience the productivity boost of AI-driven coding.
Runtime, a YC P26-backed project, introduces sandboxed coding agents for teams. The tool allows team members to safely run AI coding agents in isolated sandbox environments, supporting collaboration and code review. A free trial is currently available, making it ideal for development teams exploring AI-assisted coding.
SambaNova Cloud offers the world's only free LLaMA 3.1 405B API access. Core advantages: - LLaMA 3.1 405B (405 billion parameters) completely free — the largest free open-source model - The only platform globally offering free 405B access, bar none - Custom RDU (Reconfigurable Dataflow Unit) chip acceleration, ultra-fast inference - 30 RPM rate limit, no total cap — thousands of calls per day - API keys start with sn-, OpenAI-compatible format Supported models: - LLaMA 3.1 405B (flagship, best for complex reasoning) - Llama 3.3 70B (best value) - DeepSeek R1/V3 (671B MoE) - Qwen 2.5 72B - More models added regularly 405B vs 70B difference: - Significantly better complex reasoning (math, logic, multi-step) - Stronger long-text understanding (128K context) - Higher code generation quality - More precise instruction following Requires proxy from China (use openllmapi.com). Ideal for developers needing large model capabilities on a budget.
SiliconFlow 提供长期免费API额度,每月200万Token调用量,另赠送15元体验金可用于更高性能模型。支持多种开源模型(如Qwen、Llama、ChatGLM等),中国大陆直连,注册即用。
SiliconFlow 提供每日200次免费API调用额度,支持Llama、Qwen、DeepSeek等主流开源模型,中国大陆用户可直接注册使用,无需海外信用卡。
Superset is an integrated development environment (IDE) designed for the agent era, incubated by YC P26. It provides a complete toolchain to help developers build, debug, and deploy AI agent applications. The project is fully free and open source, with anyone able to access the GitHub repository for source code and contributions. As a newly launched product on its first day, Superset aims to lower the barrier to agent development, enabling more developers to get started quickly.
腾讯混元大模型为开发者提供每月 100 万 token 的免费 API 调用额度,支持文本生成、对话等能力,中国大陆开发者可直接使用微信/QQ 登录,无需绑定信用卡。
useknockout is an open-source project offering a free SOTA background removal and super-resolution API as an alternative to remove.bg and Topaz. It is MIT licensed and runs on the Modal platform, allowing users to utilize it within Modal's free tier. Suitable for developers and businesses needing image background removal or super-resolution processing.
Voker, a YC S24-backed startup, launches an analytics platform specifically designed for AI agents. New users can start with a free trial without requiring a credit card. The platform offers real-time monitoring of agent performance, costs, error rates, and latency, helping developers optimize their AI agent deployments. It supports multiple agent frameworks and provides customizable dashboards and alerts.
字节跳动火山引擎提供的豆包大模型 API,新用户通常有一定量的免费 tokens 额度,中国大陆可直接使用且稳定。
Warpdrv is a newly released open-source Llama.cpp launcher designed for daily-driving Qwen 35b and 27b models on Strix Halo and RTX Pro hardware. The project is completely free, and users can obtain the code directly from Reddit or GitHub. It simplifies the local LLM deployment process, suitable for users with compatible hardware for local inference.
Zhipu GLM is a strong free API option for China-based developers today: registration is local-friendly, access is stable, and the API can be used in an OpenAI-compatible style. It is useful for Chinese customer support, knowledge-base QA, content generation, and multimodal experiments.
智谱 AI 为新用户提供 100 万 token 免费额度,可用于 GLM-4 系列模型(含 API 和 Web 端),中国大陆直接注册使用,无需海外支付方式,适合中文场景开发。
智谱 AI 为开发者提供 GLM-4、GLM-3-Turbo 等模型的免费 API 调用额度,每月 100 万 Token,注册即享,支持中国大陆网络直接使用,适合个人开发者和中小企业测试集成。
ChatGPT free users can now use GPT-4o mini, replacing GPT-3.5 for a more powerful free experience.
OpenAI launched GPT-4o mini, smaller and cheaper, with multimodal support
OpenAI released GPT-4o mini with pricing at $0.15/M input tokens and $0.60/M output tokens, 97% cheaper than GPT-4o, significantly reducing API usage costs.
🎁 Free Resource Pack
Free API credits list, AI business case studies, payment stack, risk checklist, and a monetization roadmap.