The cheapest LLM is Ling-2.6-flash at $0.010 per million input tokens. Granite 4.0 Micro ($0.017) and Llama 3.1 8B Instruct ($0.020) round out the top three.
AI models ranked by input token price. The most affordable large language model APIs, from budget open-weight models to discounted frontier models.
The cheapest LLM is Ling-2.6-flash at $0.010 per million input tokens. Granite 4.0 Micro ($0.017) and Llama 3.1 8B Instruct ($0.020) round out the top three.
Granite 4.0 Micro ($0.017) is the closest alternative on this metric, followed by Llama 3.1 8B Instruct ($0.020). See the full ranking above for the tradeoffs.