An OpenAI rival says its new AI model is not only cheaper to run than GPT-4, but it's also more useful

Tom Carter

May 9, 2024 at 8:01 AM·5 min read

Cohere unveils fine-tuned AI model it says outperforms GPT-4 on some tasks.
The model is also cheaper to run, costing up to 15 times less than larger AI systems.
Cohere is betting on cheaper, business-focused AI as it tries to compete with OpenAI and Anthropic.

OpenAI rival Cohere has unveiled an updated AI model it says is more useful and cheaper to run than GPT-4.

The AI startup says it is rolling out the ability to fine-tune its Command R AI model, allowing it to outperform larger models like GPT-4 in some use cases while costing up to fifteen times less to operate.

It raises hopes that smaller, cheaper models might be able to match the larger, more expensive AI systems built by tech giants as concerns grow over the spiraling costs of the AI boom.

"We have found that fine-tuning on data sets with a small model gets really great results," Cohere cofounder Nick Frosst told Business Insider.

"Fine tuning on Command R, when we benchmarked it against the competition, outperforms some models in completely different weight classes and can then do better than them at a tiny fraction of the price," he added.

Cohere said when tested on tasks such as summarizing meetings and analyzing financial and scientific information, the fine-tuned version of Command R was more accurate than GPT-4, GPT-4 Turbo, and Claude Opus, the most advanced model built by Amazon-backed Anthropic.

Cohere performed these tests itself, and found that its fine-tuned Command R model scored 80.2% on accuracy when summarizing meetings, compared to 78.8% for GPT-4 and 77.9% for Claude Opus. Similarly, when analyzing financial data Command R was 6.2% more accurate than GPT-4 and 5.3% more accurate than Claude.

The running cost of the fine-tuned model, known as the inference cost, is also far below GPT-4 and Claude Opus, costing between $2 to $4 per million tokens compared to $30 to $60 for GPT-4.

Cohere said that as Command R, which initially launched in March, is significantly smaller than the likes of GPT-4, it costs much less to run.

Fine-tuning, which sees users tailor the model with specialist data, also reduces the amount of computation required to run the model by making it better at more relevant tasks.

Fine-tuning on the Command R model is available on Cohere's platform from Thursday, with availability on other platforms coming in the near future.

Cohere bets on enterprise

The massive amount of computer power needed to train large AI models like GPT-4 and Meta's Llama has forced many AI companies into a multi-billion-dollar arms race, even as the path to making AI profitable remains elusive.

Mark Zuckerberg told investors that Meta will continue spending "aggressively" on AI, and OpenAI boss Sam Altman said last month that he "doesn't care" if building Artificial General Intelligence — AI with above human-level intelligence — costs $5 billion, $50 billion or $500 billion.

"As long as we can figure out a way to pay the bills, we're making AGI. It's going to be expensive," Altman said to a group of students at Stanford University.

Cohere, which is based in Toronto, has taken a different approach. The company is targeting businesses and enterprise customers, offering smaller AI models specifically tailored to business uses at a fraction of the cost of larger models.

"I think there's a very interesting scientific debate to be had about whether or not large language models alone will scale to AGI — I don't think they will. So I don't think just throwing more money into compute will result in something like AGI," said Frosst.

"Large language models are an incredible technology. I think they can deliver so much more value than they're delivering currently. But only if they're actually put into real business use cases, if they're made at a reasonable price point," he added.

Cohere was valued at over $2.1 billion last year, but the road hasn't been completely smooth. The Information reported in March that despite its lofty valuation, Cohere was generating only $13 million in annualized revenue by the end of last year.

Business Insider understands that annualized revenue had risen to around $35 million by the end of Q1. Frosst said that Cohere's revenue had increased due to the company releasing a steady stream of new models and updates this year.

"It's been a good start to the year for us. I think that is a direct result of us focusing on actually business-ready and real-world solutions rather than lofty science projects," he said.

However, the company still faces a challenge in competing with Big Tech-backed heavyweights like OpenAI and Anthropic.

The picture for AI startups looks less sunny than a year ago, with buzzy firms like Stability AI and Inflection encountering problems in recent months.

Stability conducted layoffs last month as part of an effort to "focus" its operations after CEO Emad Mostaque resigned, following reports that the startup was experiencing financial problems.

Meanwhile, Inflection, which was once valued at $4 billion, lost cofounder Mustafa Suleyman and a chunk of its staff to Microsoft in March.

Cohere is counting on its focus on enterprise and low-cost models to help it carve out a niche in an increasingly competitive AI landscape.

"We're interested in making these models as useful as possible," said Frosst.

"We're interested in a world where every day you use a language model to help you in any of the things you're using a computer with. You don't need AGI for that," he added.

Read the original article on Business Insider

TechCrunch
UK opens office in San Francisco to tackle AI risk
Ahead of the AI safety summit kicking off in Seoul, South Korea later this week, its co-host, the United Kingdom, is expanding its own efforts in the field. The AI Safety Institute, a U.K. body set up in November 2023 with the ambitious goal of assessing and addressing risks in AI platforms, has said it will open a second location in San Francisco. The Bay Area is the home of companies like OpenAI, Anthropic, Google and Meta that are building foundational AI technology.
TechCrunch
OpenAI and Google lay out their competing AI visions
This week had two major events from OpenAI and Google. Hot off OpenAI’s tail, Google’s I/O conference featured a smattering of announcements and integrations for its flagship model, Gemini. This week also saw some major shake-ups at AWS and OpenAI.
TechCrunch
This Week in AI: OpenAI moves away from safety
This week in AI, OpenAI once again dominated the news cycle (despite Google's best efforts) with a product launch, but also, with some palace intrigue. The company unveiled GPT-4o, its most capable generative model yet, and just days later effectively disbanded a team working on the problem of developing controls to prevent "superintelligent" AI systems from going rogue. Reporting -- including ours -- suggests that OpenAI deprioritized the team's safety research in favor of launching new products like the aforementioned GPT-4o, ultimately leading to the resignation of the team's two co-leads, Jan Leike and OpenAI co-founder Ilya Sutskever.
TechCrunch
OpenAI inks deal to train AI on Reddit data
OpenAI has reached a deal with Reddit to use the social news site's data for training AI models. Reddit content will be incorporated into ChatGPT, OpenAI's popular conversational AI, and the companies will work together to bring unspecified new "AI-powered features" to both Reddit users and moderators. OpenAI will also become a Reddit advertising partner.
TechCrunch
Anthropic is expanding to Europe and raising more money
On the heels of OpenAI announcing the latest iteration of its GPT large language model, its biggest rival in generative AI in the U.S. announced an expansion of its own. Anthropic said Monday that Claude, its AI assistant, is now live in Europe with support for "multiple languages," including French, German, Italian and Spanish across Claude.ai, its iOS app and its business plan for teams. The launch comes after Anthropic extended its API to Europe to get developers using and integrating its models.
TechCrunch
Microsoft dodges UK antitrust scrutiny over its Mistral AI stake
Microsoft won't be facing antitrust scrutiny in the U.K. over its recent investment in French AI startup, Mistral AI, with the country's Competition and Markets Authority (CMA) on Friday concluding that the partnership "does not qualify for investigation under the merger provisions of the Enterprise Act 2002." The decision comes three weeks after the CMA revealed a trio of early-stage probes into Amazon and Microsoft's various AI investments and partnerships, including the Redmond-based company's $16 million investment in Mistral AI, an OpenAI rival working on large language models. Shortly after, Microsoft hired the team behind Inflection AI, another OpenAI rival, essentially gutting the startup.
TechCrunch
OpenAI created a team to control 'superintelligent' AI — then let it wither, source says
OpenAI's Superalignment team, responsible for developing ways to govern and steer "superintelligent" AI systems, was promised 20% of the company's compute resources, according to a person from that team. Leike went public with some reasons for his resignation on Friday morning. OpenAI did not immediately return a request for comment about the resources promised and allocated to that team.
TechCrunch
Anthropic hires Instagram co-founder as head of product
Mike Krieger, one of the co-founders of Instagram and, more recently, the co-founder of personalized news app Artifact (which TechCrunch corporate parent Yahoo recently acquired), is joining Anthropic as the company's first chief product officer. As CPO, Krieger will oversee Anthropic's product engineering, management and design efforts, Anthropic says, as the company works to expand its suite of AI apps and bring Claude, its generative AI technology, to a wider audience.
Engadget
Engadget Podcast: The good, the bad and the AI of Google I/O 2024
In this bonus episode, Cherlynn and Devindra dive into the biggest Google I/O 2024 news.
Engadget
OpenAI claims that its free GPT-4o model can talk, laugh, sing and see like a human
The new model accepts any combination of text, audio and images as input and can generate an output in all three formats.
TechCrunch
OpenAI offers a peek behind the curtain of its AI's secret instructions
OpenAI is offering a limited look at the reasoning behind its own models' rules of engagement, whether it's sticking to brand guidelines or declining to make NSFW content. Large language models (LLMs) don't have any naturally occurring limits on what they can or will say.
TechCrunch
Stack Overflow signs deal with OpenAI to supply data to its models
OpenAI is collaborating with Stack Overflow, the Q&A forum for software developers, to improve its generative AI models' performance on programming-related tasks. As a result of the partnership, announced Monday, OpenAI's models, including models served through its ChatGPT chatbot platform, should get better over time at answering programming-related questions, the two companies say. At the same time, Stack Overflow will benefit from OpenAI's expertise in developing new generative AI integrations on the Stack Overflow platform.
TechCrunch
Microsoft and OpenAI launch $2M fund to counter election deepfakes
Microsoft and OpenAI have announced a $2 million fund to combat the growing risks of AI and deepfakes being used to "deceive the voters and undermine democracy." This year will see a record 2 billion people head to the polls in elections spanning some 50 countries, so there are concerns around the influence that AI will have among voters — particularly those in "vulnerable communities" that may be more susceptible to take what they see at face value. The rise of generative AI, including wildly popular chatbots such as ChatGPT, has led to a major new threat landscape involving AI-generated "deepfakes" designed to perpetuate disinformation.
TechCrunch
This Week in AI: Generative AI and the problem of compensating creators
This week in AI, eight prominent U.S. newspapers owned by investment giant Alden Global Capital, including the New York Daily News, Chicago Tribune and Orlando Sentinel, sued OpenAI and Microsoft for copyright infringement relating to the companies' use of generative AI tech. “We’ve spent billions of dollars gathering information and reporting news at our publications, and we can’t allow OpenAI and Microsoft to expand the big tech playbook of stealing our work to build their own businesses at our expense,” Frank Pine, the executive editor overseeing Alden’s newspapers, said in a statement.
TechCrunch
Anthropic launches new iPhone app and premium plan for businesses
Anthropic, one of the world's best-funded generative AI startups with $7.6 billion in the bank, is launching a new paid plan aimed at enterprises, including those in highly regulated industries like healthcare, finance and legal, as well as a new iOS app. Team, the enterprise plan, gives customers higher-priority access to Anthropic's Claude 3 family of generative AI models plus additional admin and user management controls. "Anthropic introduced the Team plan now in response to growing demand from enterprise customers who want to deploy Claude's advanced AI capabilities across their organizations," Scott White, product lead at Anthropic, told TechCrunch.
Engadget
The Morning After: Microsoft’s OpenAI partnership was born from Google AI envy
The biggest news stories this morning: TikTok might be trying to circumvent Apple’s in-app purchase rules, Batman: Arkham Shadow is the first big exclusive VR game for the Quest 3, Rabbit denies claims its R1 virtual assistant is a glorified Android app.
Engadget
Apple has reportedly resumed talks with OpenAI to build a chatbot for the iPhone
Apple has resumed talks with OpenAI, the maker of ChatGPT, to build an AI-powered chatbot into the iPhone, according to a new report.
Yahoo Finance
Markets brace for Nvidia earnings: What to know this week
A crucial earnings report from AI leader Nvidia greets a stock market that hit new records last week.
Yahoo Finance
The coming Supreme Court decisions that could ripple across the business world
The nation's highest court is expected to make rulings that could limit the reach of social media, the ability of federal agencies to crack down on companies, and the power of bankruptcy courts to shield powerful parties from liability.
Yahoo Finance
The big questions JPMorgan investors have for Jamie Dimon
Attendees at JPMorgan's annual investor day Monday will be listening for answers to some key questions. A top concern is how much longer Jamie Dimon plans to run the largest US bank.

News

Life

Entertainment

Finance

Sports

New on Yahoo

An OpenAI rival says its new AI model is not only cheaper to run than GPT-4, but it's also more useful

Cohere bets on enterprise

Recommended Stories

UK opens office in San Francisco to tackle AI risk

OpenAI and Google lay out their competing AI visions

This Week in AI: OpenAI moves away from safety

OpenAI inks deal to train AI on Reddit data

Anthropic is expanding to Europe and raising more money

Microsoft dodges UK antitrust scrutiny over its Mistral AI stake

OpenAI created a team to control 'superintelligent' AI — then let it wither, source says

Anthropic hires Instagram co-founder as head of product

Engadget Podcast: The good, the bad and the AI of Google I/O 2024

OpenAI claims that its free GPT-4o model can talk, laugh, sing and see like a human

OpenAI offers a peek behind the curtain of its AI's secret instructions

Stack Overflow signs deal with OpenAI to supply data to its models

Microsoft and OpenAI launch $2M fund to counter election deepfakes

This Week in AI: Generative AI and the problem of compensating creators

Anthropic launches new iPhone app and premium plan for businesses

The Morning After: Microsoft’s OpenAI partnership was born from Google AI envy

Apple has reportedly resumed talks with OpenAI to build a chatbot for the iPhone

Markets brace for Nvidia earnings: What to know this week

The coming Supreme Court decisions that could ripple across the business world

The big questions JPMorgan investors have for Jamie Dimon