Thursday, 4 June 2026

The AI ROI Problem Nobody Is Talking About

Why Using One LLM for Everything Is Costing Your Business Money.

A few months ago, I was speaking with the head of digital transformation at a manufacturing company. "We've integrated AI into almost everything," he said proudly. Customer support. Procurement. Contract analysis. Email drafting. BOM reviews. Supplier negotiations.

"That's great," I replied. "Which models are you using?" His answer came instantly. "
The same one for everything."  And that's when I realized something. 

The biggest AI problem in 2026 isn't that organizations aren't adopting AI. 
It's that many organizations are adopting AI without understanding that different models are optimized for different kinds of work. 
They're buying a Formula 1 car and using it to drive to the grocery store. 
Can it do the job? Absolutely. Is it the most effective way to do it? Probably not.


The Great LLM Misunderstanding 

When ChatGPT first entered the mainstream, most people viewed AI as a single technology. 
You ask a question. The AI gives an answer. Problem solved. 
But today's AI landscape is very different. · Some models excel at deep reasoning. 
  • Some are optimized for speed and cost efficiency. 
  • Some are exceptional at coding. 
  • Others are designed to work with images, PDFs, audio, and videos. 
  • Some can process extremely large documents and maintain context across hundreds of pages.

Yet many teams continue to treat all LLMs as interchangeable. That's a bit like hiring a world-class surgeon to answer the office phone. 
The surgeon can do it. But that's not where their value lies. 

A Real Procurement Example
Imagine a procurement team receives the following tasks every day: 
  • Categorize supplier emails 
  • Generate RFQs 
  • Compare supplier quotations 
  • Analyze contracts 
  • Extract pricing from invoices 
  • Recommend negotiation strategies 
Many organizations send every one of these tasks to the same premium model. Perhaps GPT-5.5, Or Claude (Since it has gained a huge attention) Gemini 2.5 Pro. 
The outputs are often excellent. But that doesn't automatically make it the most effective architecture. 
A common misconception is that the most powerful model always produces the best business outcome
In reality, the best outcome is usually a combination of: 
  • Accuracy 
  • Speed 
  • Cost 
  • Reliability 
  • Governance 
  • User experience 
The optimal model is not necessarily the smartest model. It's the model that best fits the task.

Think of LLMs Like Employees Imagine your company has four specialists. Each is exceptionally good at something different.

The Strategist 
  • This person thrives on complexity. 
  • They connect dots others miss.
  • They evaluate trade-offs. 
  • They make sense of ambiguity. 
This is where premium reasoning models such as GPT-5.5, Claude Opus, and Gemini 2.5 Pro often shine. 
Scenario: "Should we consolidate suppliers globally or maintain regional sourcing?" That's not a search problem. It's a reasoning problem. 

The Operations Executive 
  • Fast. 
  • Reliable. 
  • Efficient. 
Capable of handling thousands of repetitive tasks consistently. 

This is where models such as GPT-5.5 Mini, Gemini Flash, Claude Sonnet, and smaller Llama variants are often highly effective. Scenario: "Classify 50,000 customer support tickets into categories." For many operational workflows, speed, consistency, and cost efficiency matter more than advanced reasoning. 

The Software Engineer 
This person understands code, data structures, APIs, debugging, and system design. 

Many modern LLMs are highly capable coding assistants, particularly when helping with development workflows. 

Scenario: "Why is my Power BI measure producing incorrect variance calculations?" 
This requires technical understanding, structured reasoning, and precision. A very different skill set from drafting marketing copy. 

The Analyst 
This person can read hundreds of pages without losing concentration. 
  • They identify patterns. 
  • Spot risks. 
  • Summarize information. 
  • Compare alternatives. 
Models with strong long-context capabilities are particularly useful here. Scenario: 
"Review this 150-page supplier contract and identify pricing-related risks." This is where long-context understanding becomes valuable. 

The Hidden Cost of Using the Wrong Model Most discussions about AI costs focus on the price per token. But there's another question worth asking: Are we spending tokens on work that actually requires a premium model? 
Imagine paying a senior consultant to: · Rename files · Sort spreadsheets · Copy data between systems 

The consultant isn't the problem. The assignment is. 

The same principle applies to AI. It's common to find premium models being used for tasks where a smaller, lower-cost model could deliver comparable business value. The issue isn't model quality. It's model selection. 

What Mature AI Teams Are Doing Differently 
Many advanced AI teams are moving toward multi-model strategies. 
Instead of asking one model to do everything, they route different tasks to different models. 

For example: · 
  • Step 1 A lightweight model categorizes incoming supplier emails. 
  • Step 2 A cost-efficient model drafts RFQs. 
  • Step 3 A premium reasoning model develops negotiation recommendations. 
  • Step 4 A long-context model reviews supplier contracts. 
  • Step 5 A multimodal model extracts information from invoices and PDFs. 

When designed carefully, these workflows can reduce costs, improve throughput, and maintain the quality required for the business process. The goal isn't to use more models. The goal is to use the right model at the right stage. 

The Question Nobody Should Be Asking 

Whenever someone asks: "Which LLM is the best?" They're usually asking the wrong question. Nobody asks: "Which vehicle is the best?" · 
  • A truck is better than a sports car for moving furniture. 
  • A motorcycle is better than a truck for navigating city traffic. 
  • An airplane is better than both for crossing continents. 

The answer depends on the job. AI is no different. 
The better question is: "Which LLM is best suited for this specific task?" That single shift in thinking changes everything. 
It improves efficiency. It reduces unnecessary costs. It creates better user experiences. And it helps organizations extract more value from their AI investments. 

Synopsis 
The Future Belongs to AI Architects 
Over the next few years, every organization will have access to powerful AI models. 
Access will no longer be the differentiator. 
Judgment will. 
The most successful organizations won't be the ones using the biggest models everywhere. They'll be the ones that understand:

  • When to use a premium reasoning model 
  • When to use a lightweight operational model 

  • When to use a coding-focused model 
  • When to use a multimodal model 
  • And when not to use an LLM at all 

Because AI success isn't about finding one model that does everything. It's about designing systems that use the right model for the right job. 
The future of AI belongs not just to model builders, but to AI architects, the people who know how to orchestrate intelligence effectively. And in that future, success won't be measured by how many tokens you consume. It will be measured by how much business value you create from every token you spend.

No comments:

Post a Comment

The AI ROI Problem Nobody Is Talking About

Why Using One LLM for Everything Is Costing Your Business Money . A few months ago, I was speaking with the head of digital transformation a...