Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Jack Kochanowicz To Undergo Tommy John Surgery

    June 9, 2026

    Resumen semanal de la ley de deportes y entretenimiento de K&C – junio de 2026 #2

    June 9, 2026

    Liberals open to shorter metadata rules but splitting bill ‘not an option’ – National

    June 9, 2026
    Facebook X (Twitter) Instagram
    Select Language
    Facebook X (Twitter) Instagram
    NEWS ON CLICK
    Subscribe
    Tuesday, June 9
    • Home
      • United States
      • Canada
      • Spain
      • Mexico
    • Top Countries
      • Canada
      • Mexico
      • Spain
      • United States
    • Politics
    • Business
    • Entertainment
    • Fashion
    • Health
    • Science
    • Sports
    • Travel
    NEWS ON CLICK
    Home»Science & Technology»US Science & Tech»Can tech companies learn to love cheaper AI models? 
    US Science & Tech

    Can tech companies learn to love cheaper AI models? 

    News DeskBy News DeskJune 9, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    Can tech companies learn to love cheaper AI models? 
    Share
    Facebook Twitter Pinterest Email Copy Link

    The AI boom has been built on a basic assumption: Bigger models are more powerful, and the most powerful models win. Now, the industry is about to learn what happens if that assumption starts to break.  

    Mounting costs have already pressured users to give smaller and cheaper models a second look. This cost-conscious model-shopping is new and it’s unclear how it will affect the industry, but the impact is likely to be significant. 

    One prediction, laid out best by Coinbase co-founder Brian Armstrong, is that it will result in the vast majority of tasks shifting to cheaper models. 

    “[D]emand for intelligence is near infinite, but 80% of workloads will be running on 99% cheaper models within 12-18 months,” Armstrong wrote on X. “20% of workloads will still run on latest gen models where IQ maxing is important.” 

    It’s hard to overstate what a significant shift it will be for the AI industry if Armstrong’s prediction comes true.  

    Before now, most AI companies have competed on quality, which has meant defaulting to the most advanced available model. If those same jobs can be handled by cheaper models without affecting quality, it would mean a massive shift in the economics of AI. And critically, much of the savings would be coming out of the pockets of the big labs, dealing a financial blow to OpenAI and Anthropic just as they’re heading for their IPOs. 

    It’s a potentially seismic change in the industry, resting on one basic question: Are companies ready to switch to smaller models? 

    Initial tests suggest that, when the system is arranged right, cheaper models could sub in without any sacrifice in quality. In a recent test by the legal AI tool Harvey, the company was able to reduce inference costs by 3x without reducing quality. The test, performed in partnership with the inference platform Fireworks AI, combined Claude Opus and Fireworks’ GLM 5.1, and shifted to Opus for the most intensive tasks. The result was a significantly lower load in terms of server time and overall cost. 

    “Quality comes first, and in legal it always will,” Harvey co-founder Gabe Pereyra told TechCrunch, referring to the AI legal services his startup provides. “However, the definition of quality is evolving from simply using the most powerful model for everything, to using the best model that gets the right answer most efficiently.”

    This trend is often framed in terms of major labs versus Chinese models or open-weight ones, but that misses the bigger point. The real divide isn’t between proprietary and open models; it’s between large models and small ones. You can save money by switching from GPT-5.5 to DeepSeek’s V4 Flash, but switching to GPT-5.4-mini works just as well.  

    There’s an active price war going on between in-house inference from the big labs and independently served open-weight models. For the bigger question of small versus large, it doesn’t really matter which kind of small model wins out.  

    All of this might seem obvious — of course you shouldn’t use more compute than necessary — but it runs counter to the scaling-first approach that has dominated the industry until now. Inspired by the bitter lesson, labs have leaned hard into training the most compute-intensive models possible, pushing the frontier of what AI models can do. With prices heavily subsidized by investors, clients had no reason to choose anything but the most advanced option.

    With token prices rising and subsidies slowing down, users are facing cost pressure for the first time. We don’t know whether the new cost pressure will actually drive enterprise users to smaller models. They could just as easily economize by making fewer calls, using less context, or simply giving up on the least promising deployments. 

    But if it turns out that most deployments can be run just as well on a smaller model, it could put a serious damper on the growing demand for inference — and raise new questions about how to justify the cost of training a frontier model. 

    When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

    ai models Anthropic harvey OpenAI
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
    News Desk
    • Website

    News Desk is the dedicated editorial force behind News On Click. Comprised of experienced journalists, writers, and editors, our team is united by a shared passion for delivering high-quality, credible news to a global audience.

    Related Posts

    US Science & Tech

    GM’s EVs Will Soon Support More Kinds Of Public Chargers

    June 9, 2026
    US Science & Tech

    GM joins race to build batteries for AI data centers and the grid

    June 9, 2026
    US Science & Tech

    WWDC 2026: Everything announced on Siri AI, iOS 27, Apple Intelligence and more

    June 9, 2026
    US Science & Tech

    Opera’s Latest Android Update Includes A Soccer Hub And A Refreshed Start Page

    June 9, 2026
    US Science & Tech

    Kingdom Hearts IV Gets A Surprise Nintendo Direct Trailer Drop

    June 9, 2026
    US Science & Tech

    Apple’s foldable iPhone could be just around the corner

    June 9, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Don't Miss

    Jack Kochanowicz To Undergo Tommy John Surgery

    News DeskJune 9, 20260

    Angels right-hander Jack Kochanowicz told members of the media, including Jeff Fletcher of the Orange…

    Resumen semanal de la ley de deportes y entretenimiento de K&C – junio de 2026 #2

    June 9, 2026

    Liberals open to shorter metadata rules but splitting bill ‘not an option’ – National

    June 9, 2026

    Benfica confirm Mourinho to leave for Real Madrid – with ex-Fulham boss Silva hired as replacement

    June 9, 2026
    Tech news by Newsonclick.com
    Top Posts

    Resumen semanal de la ley de deportes y entretenimiento de K&C – junio de 2026 #2

    June 9, 2026

    ‘The Great Divide’ de Noah Kahan pasa su segunda semana en la cima del Billboard 200 – Celebrity Land

    May 10, 2026

    Real Madrid manager Arbeloa on Mbappe’s El Clasico absence

    May 10, 2026

    Meghan Markle Upset Over One ‘SNL’ Joke, Claims Insider

    May 10, 2026
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    Editors Picks

    Jack Kochanowicz To Undergo Tommy John Surgery

    June 9, 2026

    Resumen semanal de la ley de deportes y entretenimiento de K&C – junio de 2026 #2

    June 9, 2026

    Liberals open to shorter metadata rules but splitting bill ‘not an option’ – National

    June 9, 2026

    Benfica confirm Mourinho to leave for Real Madrid – with ex-Fulham boss Silva hired as replacement

    June 9, 2026
    About Us

    NewsOnClick.com is your reliable source for timely and accurate news. We are committed to delivering unbiased reporting across politics, sports, entertainment, technology, and more. Our mission is to keep you informed with credible, fact-checked content you can trust.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube
    Latest Posts

    Jack Kochanowicz To Undergo Tommy John Surgery

    June 9, 2026

    Resumen semanal de la ley de deportes y entretenimiento de K&C – junio de 2026 #2

    June 9, 2026

    Liberals open to shorter metadata rules but splitting bill ‘not an option’ – National

    June 9, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Editorial Policy
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Advertise
    • Contact Us
    © 2026 Newsonclick.com || Designed & Powered by ❤️ Trustmomentum.com.

    Type above and press Enter to search. Press Esc to cancel.