Tomasz Tunguz

The Battle for AI Gravity

Thu, 25 Jul 2024 00:00:00 +0000

During the era of big data, data gravity was the core strategic imperative. Wherever the biggest dataset resided, customers ran their compute workloads that generated all of the profit and revenue growth for the last generation of data companies.

Today, the battle is for AI gravity.

Why? AI requires orders of magnitude more compute than other workloads, so there’s much more money & profit to be made serving customers running them.

Facebook and Google both announced very similar strategies of overinvesting in AI data centers. Google is on a trajectory to invest $50 billion this year. Amazon & Microsoft are also deploying similar tens of billions for the same purpose.

Within AI, the switching costs today are modest. Most large-scale AI products have yet to be built. Many enterprises are in the process of testing. Switching from OpenAI to Claude to Gemini to Mistral is still tractable & some are running comparative evaluations before picking.

There’s ample reason to wait. The proliferation of different model types has created a cornucopia of choice. Open & closed ; small/medium/large ; models built for images or code or text ; all of these are in rapid development.

In addition, the underyling systems to manage AI applications have changed rapidly. Initially, an AI app meant wrapping the AI. Then we began to add routers, mixtures of experts, & small language models. Now we’re realizing the LLM architecture isn’t the best at planning work : reinforcement learning is better & must be integrated.

So AI products aren’t electric motors with one or two moving pieces, but more like the gas powered engines with many moving parts. It’s easy to quote Knuth here : “Premature optimization is the root of all evil.” Better for large enterprises to wait until there’s a reference architecture that’s been proven to work.

Plus, data movement is less expensive than in the previous era. AWS & others have stopped charging to move data. New data formats like Iceberg simplify data movement. Both have decreased switching costs.

We should expect to see increasingly stiff & significant competition for AI gravity. OpenAI announced free fine-tuning for models to entice customers to use their platform. AWS cut prices more than 100 times in its first five years. History will rhyme with AI.

Customers & startups will benefit from this intense competition with better models, cheaper inference costs, & faster innovation as the incumbents spend their massive balance sheets to exert the greatest AI gravity.

Agentic Systems' Sales Cycles

Mon, 22 Jul 2024 00:00:00 +0000

As software startups begin to sell agentic systems, the procurement process will change. Unlike classical software, where the application either meets the criteria (price, integration into other software, particular features) or doesn’t, agentic systems operate on a performance continuum.

Here’s a recent evaluation table for Codestral, Mistral’s open-source code generation AI. All of these benchmarks are machine-generated : HumanEval & HumanEvalFIM are not human testers - but open-source projects that evaluate AI code.¹

This type of evaluation works well for broad sense of relative performance. But what if a business writes code in a particular language? Or with particular performance characteristics in mind?

What if an AI-powered customer support agent needs to be able to manage very technical telecom queries? Or a marketing AI needs to be culturally sensitive to a particular region?

The generic tests probably won’t work, which translates to slower sales cycles as prospective buyers understand the system’s performance in their own context.

In addition, agentic systems in the future will operate for longer periods of time without human intervention. The greater the autonomy, the greater the potential for errors. Benchmarks may not be enough; buyers may want to see how the system performs in their own context over time.

Startups - as they always do - will find ways to accelerate the evaluation. They might develop their own standards much the way that OpenAI has, or partner with third-parties to offer those third party evaluations for particular use-cases.

Imagine a modern day Gartner for Agentic Systems, a company that maintains a diverse pool of human evaluators & computer scientists skilled in various the evaluation of agentic products.

Alternatively, the most sophisticated organizations could create standards that then become broadly adopted. Banks could publish open-source standards for regulator-compliant customer support chatbots.

This purchasing behavior does exist elsewhere. Backtesting is the norm in trading algorithms & marketing optimization. Within the most sophisticated security organizations, security labs exist to test machine learning-based security products and performance before deploying them.

In certain cases, the business need will overwhelm the procurement process. This happens in classic software & it will happen with AI but it’s rarer.

However the problem is solved, agentic systems will evolve the procurement process & startups will need to navigate it.

¹ OpenAI created both of these tests to measure the accuracy of its code generation model & now it’s a standard for evaluating AI code generation models.

The Future of Blockchain Data : Our Investment in Allium

Wed, 17 Jul 2024 00:00:00 +0000

Large scale ETL (extract, transform, load) processes are a critical part of any data pipeline. They are responsible for moving data from one place to another, transforming it into a usable format, and loading it into a destination system.

In the world of blockchain, these processes are even more complex.

In web2, the engineering team building a payment processing system will convey to the analytics team the data schema. In web3, any programmer can create transactions & inject meaning into fields.

How does a web3 analyst or product manager or consumer make sense of data at scale?

As thousands of developers build & trillions of dollars worth of value are stored on blockchains, this problem compounds geometrically. In the early days of web3, this problem was limited to a relatively small set of developers.

Today, 56% of Fortune 500 companies are working on on-chain projects. Stablecoins now process more than twice as much transaction value as Visa every month. Bitcoin ETFs counts $63b in assets under management. BlackRock & Fidelity have launched tokenized Treasury products with nearly $1b in assets combined. Stripe now allows its merchants to accept payments in Ethereum, Solana, & Polygon without transaction fees. PayPal’s stablecoin has more than $0.5b in assets.

Understanding what data means across this universe isn’t easy.

In the future, every software will have a web3 component. Major financial institutions will offer a broad suite of tokenized products. Merchants will accept payments in a variety of cryptocurrencies.

Behind each of these products lies a complex data pipeline. Allium is a leader in this space, providing the most robust data solutions in web3. We are excited to support Allium on this journey and look forward to the innovations they will drive in the blockchain space.

Allium ingests & normalizes web3 data & works with leaders in the space including Visa to drive transparency into stablecoins, Stripe to fight fraud in payments, Grayscale to enable broader adoption of web3 in the public market, and Phantom to provide the most secure wallet for users.

Allium collaborated with Visa to measure the growth of stablecoin users, 27.5m monthly actives at the time, as well as isolate bot & human transaction activity.

We are thrilled to announce our investment in Allium, alongside friends at Amplify & Kleiner Perkins.

Punctuated Equilibrium in AI : Is it Better to Be A First Mover or A Last Mover?

Mon, 15 Jul 2024 00:00:00 +0000

Machine learning advances tend to evolve in bursts. Researchers publish a new paper with a newly discovered technique. It launches the industry forward & more researchers rapidly iterate to improve it further.

Progress looks like this - a series of aS curves one after another.

No one knows the time period between the rapid progress or the slope of the curves or how much progress we’ll make during one of these curves.

In the last few years, since the release of Attention is All You Need Paper, AI has boomed. The rapid pace of iteration has led to the release of model after model.

A collection of human evaluators marks our progress using a score called Elo collected by LMSyS, modeled after chess grandmaster rankings. If we plot Elo by Rank, we see an S-curve.

And if you look very closely at the upper right, which indicates the performance of the most recent models : GPT-4o, Claude 3.5 Sonnet, Gemini Pro, & Llama 3 - if you squint, perhaps you can imagine another elbow, another rapid advance of progress in performance.

OpenAI categorizes levels of AI similarly to self-driving cars. The five levels are

Level	Name	Description
1	Chatbots	robots that can chat with you
2	Reasoners	robots that use logic to solve problems
3	Agents	robots that act independently of humans
4	Innovators	robots that create new ideas independently
5	Organizations	robots that replicate the work of organizations

OpenAI has also said it’s close to moving from Level 1 to Level 2. What if that little orange elbow is the beginning of reasoning in AI?

In some software categories, first mover advantage exists. In search, last mover advantage (Google) won because they benefitted from the learnings of all who came before.

AI is characterized by waves of innovation & sudden change, a tecnological punctuated equilibrium.

Will the winners in this era be those who started early before the rapid advance or those who waited until afterwards?

No SaaS! How AI Agents Will Change Software Pricing

Fri, 12 Jul 2024 00:00:00 +0000

In a world where AI agents are 2.5-3x as productive as humans, which would parallel mechanical robots, how does a software company price?

Building on yesterday’s post, pricing in software companies may change significantly when AI agents become the norm.

The SaaS business model of the last 20 years for SaaS is a beautiful one. Annual prepaid contracts are free loans to software companies ; seat-based pricing is a tangible metric for pricing ; as a client grows so does this account, producing good net dollar retention.

What does a software seat mean when a human is no longer operating the software?

There a few alternatives :

Triple the per seat price : If the AI agent is 3x as productive as a human, the software company could charge 3x as much per seat. This would be a significant increase in price, but the value of the software would be much higher. Tripling prices will be a hard sell in a year but perhaps a slow increase over time would achieve this. Companies will need to adjust staffing plans & budgets.
Move to usage-based pricing : Jamin makes the case that AI software will be priced like databases since the AI is using the database directly. Just as databases charge for compute, AI agents will charge for compute. This aligns value well, but may inject unpredictability into the pricing model. It will require changing sales compensation plans & customer contracts, which database companies have navigated successfully. Buyers would need to be educated.¹
Pay for performance : Some AI companies are exploring charging for outcomes. If an AI agent replaces an SDR who is compensated for meetings, then why not charge this way? There are challenges here too. If the company doesn’t use the product in the most optimal way & performance suffers, should the contract shrink in value?

It will take time for both vendors & customers to grasp the implications for both productivity & expense.

But for the first time since Slack started offering billing on active seats, new pricing models provide a strategic option to startups looking to compete with incumbents.

Salesforce made famous the No-Software mantra competing on pricing.

The now-classic seat based model disrupted the perpetual license model. Perhaps usage or performance pricing will be the catalyst for a new era of upstarts displacing incumbents.

Maybe we’ll see a No-SaaS rebel replicate Marc Benioff’s playbook.

¹I imagine both usage-based pricing and pay for performance will be structured as a Two-Part Tariff with some base level of commitment to smooth revenue & cash flows.

AI Pricing Strategies for SaaS Companies Offering Copilots

Thu, 11 Jul 2024 00:00:00 +0000

Pricing an AI product will be a defining question in software for the next few years. AI products offer productivity gains. But greater productivity may reduce the demand for seats over time, ultimately decreasing the size of software markets.

We can observe the market trends today across some of the larger SaaS companies who offer AI pricing.

Company	Product	Base Price	AI Price	Ratio
Github	Github Enterprise	21	10	0.48
Gitlab	GitLab Duo	19	20	1.05
Google	Workspace Business Plus	18	20	1.11
Loom	Business	12.50	4	0.32
Microsoft	Office 365	45	30	0.67
Salesforce	Einstein 1 Service & Sales Cloud	330	170	0.51
ServiceNow	Pro	100	60	0.6
Zapier	Team	69	0	0
Zendesk	Suite Professional	115	0	0

The table above lists the company ; the product ; the base price per-seat for the enterprise plan if available, otherwise the team plan ; then the price for the AI or co-pilot add-on ; and finally the ratio between the AI price and the base price.

Sometimes the price is hard to compare, but I’ve tried to do my best to create a fair comparison.

Plotting the ratio illustrates the variance in the market today. Google charges more for their AI features than the base seat. While Loom charges about a 33% premium.

There’s no relationship between a more expensive seat & a greater ratio of the AI add-on. The R-Square is 0.08 : no correlation at all.

Overall, I’d characterize the ecosystem as iterating. OpenAI and GitHub launched their features at roughly $20-30 per month. This initial pricing has anchored the market at least for now in that range.

Microsoft & ServiceNow have stated AI features increased productivity by approximately 50 percent. If buyers act rationally & reduce headcount by 50%¹ which we know is probably not true, then to maintain the same revenue per customer, price would need to double. We can observe that in three of the companies’ pricing strategy above.

If pricing really does provide information (see the work of Mauboussin), then these companies are pricing in a 40% productivity gain.

This is for copilots. Agents, which fully automate work or at least claim to fully automate work, may have more disruptive pricing. Instead of hiring a sales development rep, hire a robot. I’ll write about that in tomorrow’s post.

¹ It’s highly debatable whether this will happen. Most companies will likely leverage efficiency gains into more growth, but let’s consider the downside scenario.

Select avg(Moby Dick) limit 2 sentences

Mon, 08 Jul 2024 00:00:00 +0000

The SQL statement above is a quote from our recent Office Hours with Benn Stancil. It’s not a SQL statement that would work today in a cloud data warehouse. But an LLM would understand it : summarize the book Moby Dick in two sentences.

Sure enough, ChatGPT answers the question :

This pseudocode blends the structured queries of data analysis with the unstructured data contained in a classic novel. This is how Benn views the future of BI

BI’s Third Form. “Is the output numbers in the future of BI? I don’t think it’s numbers…the theory is what we want ; the characterization & qualitative parts.”

Just as ChatGPT summarized the essence of Moby Dick, AI easily summarizes the themes of 10 customer conversations - or 100.

Benn views the future of BI as the narrative, the story behind the data. Numbers have been the historical source of truth, a proxy to underlying customer behavior. Nobel laureate Shiller wrote about the power of narrative economics, stories used to understand a business & shape the future.

The rest of the Modern Data Stack will transform, too. Data Quality will need to tackle ever greater volumes. If LLMs do average the insight in new underlying unstructured data sources which are potentially massive, what is Data Quality’s new role in the world of AI? ETL, more deterministic in nature, may not see full delegation to AI.

We covered the Databricks/Tabular acquisition & many other topics. Benn’s views on the data ecosystem are unique : a bit irreverent with lots of practical experience underpinning them.

I hope you enjoy the conversation as much as I did. The video is embedded below & podcast link is here.

The First of Your Newsletters

Mon, 01 Jul 2024 00:00:00 +0000

“This is the first of your newsletters that doesn’t align well with what I’ve been seeing in the field.”

After publishing The Four Barriers to AI Adoption, Dave Morse, a reader & a friend who was most recently CRO at Hebbia & VP Sales at Scale AI sent me this email.

Dave continued :

“The biggest blocker to adoption at AI application companies is user education and limitations of frontier models. Finding use cases that work; steering users away from failure cases. Prompting for use cases that work. Dealing with stochasticity.

This is where agents comprise most good adoption stories I know of. GPTs get into production quickly while search based apps or chatbots have a longer path to wide adoption. Even small startups can push agents into production in <2 weeks.

Most customers will waive or streamline security review, procurement processes, etc. to get their desired solution in house IF the vendor can demo a use case that actually works. They will gate some data access until the vendor has certifications or the business simply pushes IT to approve access. Some companies are closing deals in <90 days with no demo…there is some impressive selling going on.

Most AI companies I know send their initial MSA waiving rights to train on customer data. This can be added later if the business needs it. With issues around data retention and training waived the MSA doesn’t look that different from data analytics companies. Customer legal teams push risk around data leaks or other data issues into higher limit of liability”

One of the gifts of writing in public is receiving emails like this one that offer a different view of what’s working in the field.

Some things stand out to me from Dave’s perspective :

The nuance in selling between GPTs compared to search & chatbots is a great example of how the market is evolving differently in different segments : lumping all AI solutions into the same category is a mistake.

Leveraging the executive sponsorship to bypass procurement processes is consistent with the AI imperative boards & executives have championed.

The AI market isn’t a single market : it’s reflective of the software market both older segments & newer segments.

If you have a view on how the AI software market is evolving, I’d love to hear from you.

The Four Barriers to AI Adoption

Fri, 28 Jun 2024 00:00:00 +0000

AI adoption is slower than expected in many spaces. Some of the reasons are straightforward, but others are more subtle.

Most leaders wants to inject AI into their business to develop a competitive advantage. There are four challenges.

The first challenge is understanding the technology’s ability. Because the capabilities evolve so quickly, it’s hard to keep up. If PhDs in the domain are rushing to understand the capabilities reading papers every week, how are business leaders meant to grok the state of the art?

Also, because the systems are non-deterministic, they are unpredictable. The pace of innovation, the early understanding of AI internals, & the non-determinism compound to create doubt.

Security is the second challenge. Because of their unpredictable nature & because few have expertise launching these systems, product managers, engineering leaders, & security teams are hesitant to launch both internal & external systems until they develop confidence in data security.

AI security has at least four dimensions : model security, prompt injection, RAG authentication/authorization, & data loss prevention.

Legal is the next barrier to entry. Master service agreements (MSAs) are the contracts that dictate terms of service, data privacy, & service levels between a buyer & a vendor. These agreements’ clauses are well-trodden & known.

AI is new. Should a company allow a vendor to train a model using their data? Whose intellectual property is a fine-tuned model? What happens if a vendor violates the data privacy law? What training data is used that might subject the software buyer to future legal action?

Many legal teams are working to understand those questions.

Procurement is yet another barrier. SOC2, GDPR, ISO27001 & other certifications provide industry standards for security & compliance. But no such standard exists for AI - yet. Bias, fairness, & explainability are all important factors in AI : some are important for public relations, others for compliance.

Selling AI is not just selling software. Many of the processes are new & these barriers introduce friction into the sales process, extending sales cycles.

Over time, these rough edges will be worn smooth through practice. But the first companies selling today will need to persist through these challenges.

2024 Theory GTM Survey

Tue, 25 Jun 2024 00:00:00 +0000

It’s time for the 2024 Annual Theory Go-to-Market Survey. This is a brief 28-question survey.

Our goal is to understand how startups have evolved their sales, marketing, customer success, and cash management over the last four years by comparing these results to those through the go-go years of 2020 and beyond.

We will publish these results and answer questions about them at upcoming Office Hours.

With this data, we should be able to draw some broader conclusions about the shift from growth to efficiency & determine if the buyer behavior changes in the private market parallel those in the public market. In addition, we should be able to draw some initial inferences about the impact of AI on sales teams.

If you complete the survey, I will share with you the anonymized raw data so you can perform your own analyses. If you have questions, just message me on Twitter or send me an email.