Back
GPT 5.3 Codex
OpenAI, United States182/191 correctLast updated Mar 13, 2026Website
GPT 5.3 Codex by OpenAI achieved an overall score of 94.4% on Appwrite Arena, ranking #2 of 10 benchmarked models. The benchmark tests how well AI models understand Appwrite - the open-source backend platform for authentication, databases, storage, functions, and more. This model answered 182 of 191 questions correctly across categories including Auth, Databases, Functions, Storage, and CLI. During the benchmark it consumed 840,349 tokens costing $3.2731, averaging 49.2 tokens per second over 49m 51s. Compare GPT 5.3 Codex with other LLMs on the Appwrite Arena leaderboard.
Total tokens
840,349Input: 693,211
Output: 147,138Total duration
49m 51sAvg speed
49.2 tok/sTotal cost
$ 3.2731| Cost/1M | Overall | Foundation | Auth | Databases | Functions | Storage | Sites | Messaging | Realtime | CLI |
|---|---|---|---|---|---|---|---|---|---|---|
| $1.75Input: $1.75/1M tokens Output: $14.00/1M tokens | 94.4% | 94.2% | 99.2% | 83.9% | 93.2% | 94.3% | 99% | 99.7% | 96% | 92.1% |
Deterministic17/17 correct
AI-Judged62% average score