CrofAI OpenAI-compatible

Aggregator that resells open-source models behind one cheap key.

CrofAI is an inference aggregator: one API key, many open-source models (DeepSeek, GLM, Kimi, Qwen, MiniMax, Gemma). Two of its models are completely free, and the paid ones are positioned as the cheapest places to run their respective open weights.

Strengths

  • One key, many open-source models
  • Two fully free models (Qwen3.5-9B, GLM-4.7-Flash)
  • Standard OpenAI endpoint — works with any compatible client

When to use it

  • Trying many open-source models under one key
  • Exploration before committing to a single provider
  • Quick MVPs and prototypes

Subscription plans

PlanPriceQuotaAvailable
Free / PAYG$0/moPay-per-token, no recurring chargeyes
Hobby$5/mo500 daily requests · access to all modelsyes
Pro$10/mo1,000 daily requests · priority supportyes
Intermediate$20/mo2,500 daily requestsyes
Scale$50/mo7,500 daily requestsyes
Max$100/mo15,000 daily requestsyes
Notes: Subscription tiers add daily request quotas on top of the PAYG endpoint; all tiers see the same model catalogue. See crof.ai/pricing for current rates.

Models tested on CrofAI

Speed numbers below are specific to CrofAI's routing and hardware. The same model may appear on other providers' pages with different throughput.

Model Best tok/sAvg tok/s RunsSuccess Longest output (chars)
kimi-k2.5-lightning614.4557.94100%5,484
qwen3.5-9b174.7148.64100%2,592
qwen3.6-27b167.3137.24100%2,895
qwen3.5-9b-chat158.0122.94100%4,734
kimi-k2.5115.198.04100%5,467
glm-4.7-flash113.281.8450%3,245
glm-5.1108.074.04100%4,600
kimi-k2.6105.873.74100%5,201
glm-5105.184.2450%2,722
gemma-4-31b-it104.963.94100%3,469
greg103.8100.1450%4,380
minimax-m2.5102.873.64100%3,352
glm-4.799.562.54100%3,521
deepseek-v3.293.073.04100%3,275
kimi-k2.6-precision80.764.74100%3,901
glm-5.1-precision77.767.64100%4,217
deepseek-v4-pro70.061.14100%4,699
qwen3.5-397b-a17b52.752.7450%3,018