The Multimodal Lake House : Partnering with Lance

Remember when you took a family photo & Ghibli-styled it?

Or that vibe coding session, when you pasted a screenshot of the browser so the AI can help you debug some Javascript?

Today, we expect AI to be able to hear, see, & read. This is why multimodal is the future of AI.

Multimodal data means using text, images, video, sound, even three-dimensional shapes with AI.

image

These are magical user experiences. But they aren’t easy to build. Data pipelines must be built to manage larger files. Embeddings need to be extracted from these unstructured files in ways that don’t explode compute costs.

Read more

Fighting for Context

Systems of record are recognizing they cannot “take their survival for granted.”

One strategy is to acquire : the rationale Salesforce gives for the Informatica acquisition.

image

Another strategy is more defensive - hampering access to the data within the systems of record (SOR).

image

Unlike the previous software era where SORs built platforms on top of themselves to develop broader ecosystems (in Salesforce’s case Veeva & Vlocity), the AI shift does seem to be more defensive.

Read more

DuckLake - Subsecond Latency on a Petabyte

DuckLake is one of the most exciting technologies in data.

While data lakes are powerful, the formats that manage them have become notoriously difficult to work with.

“I think one of the things in DuckLake that we managed to do is to cut, I want to say like 15 technologies out of this stack.”

How does it achieve this? Instead of building a custom catalog server, DuckLake uses a simple, elegant idea: a standard database to manage metadata. It uses a database for what it’s good at. This clean architecture allows DuckLake to manage huge data lakes—with millions or even billions of files—across AWS S3 or Google Cloud Storage.

Read more

The Data Decacorn Derby

Databricks seems to be closing the gap on Snowflake faster than expected.

Last week Databricks shared some important updates on their business which allows us to compare the progress of the two companies.

image Quarterly revenue between the two company shows nearly identical slope, two parallel lines. Snowflake recently exceeded $1b in quarterly revenue mark while Databricks just touched $750m and is targeting $925m for the next quarter.

image Snowflake’s revenue growth rate has been on a long glide path to nearly asymptoting at 25% year over year. Databricks saw a resurgence in their growth rate from mid-23 to early 25, from about 50 to 60% - exceptional in a company at this stage.

Read more

The Coming Wave of Acquihires

The Seed Surge of 2021 will lead to a raft of acquihires. image

In 2021 the total number of US software & AI seeds jumped from 2900 to 4300 - a 49% jump. Seeds fell to about 3000 creating a seed tabletop.

Series As moved in lockstep both on the way up and the way down - creating a squeeze.

image

These data form part of a longer term trend of a greater number of seeds but a relatively fixed number of Series As.

Read more

Partnering with Maze Security

Doctors and security research have more in common than you might think.

Doctors defend human bodies against an ever-shifting landscape of viruses & infections. Security researchers do the same thing, but at massive scale—protecting thousands of servers instead of a single patient.

The doctors’ responsibility are to defend a human body from an ever-shifting landscape of potential viruses and infections. Each human body is slightly different. The research around human health evolves all of the time as well as the research around potential infections.

Read more

Patterns Across 5 Years of YC Investing

Where venture capital flows, innovation follows. And for more than a decade, few faucets have been watched more closely than Y Combinator. An analysis of their investment patterns since 2020 doesn’t just reveal the accelerator’s strategy—it provides a map to the entire startup ecosystem’s next chapter.

With Demo Day approaching this week & inspired by Jamesin Seidel’s YC Series A analysis, I wondered how YC investment patterns have changed since 2020.

Read more

The IPO Door is Swinging Open

Who could have predicted that crypto and data center real estate would be the categories swinging the IPO market doors open?

In late 2024, I predicted a thaw in the IPO market. We’re now seeing that forecast come to life with CoreWeave and Circle’s IPOs. Neither company is pure-play software, but their strong performances signal renewed investor appetite for the ragged edge of technology.

image

CoreWeave went public in March 2025, raising $1.5 billion at a $20 billion valuation. The GPU infrastructure company has since soared over 300% on 420% year-over-year revenue growth, reaching nearly $1 billion in Q1 alone.

Read more

Stuck in the Middle of AI Workflows

Whenever I hear about a new startup, I pull out my research playbook. First, I understand the pitch, then find backgrounds of the team, & tally the total raised.1

Over the weekend, I decided to migrate this workflow to use AI tools, & the process taught me something important about how we’re actually integrating AI into our work.

Tools are small programs that expand AI capabilities. ChatGPT might call a web search tool to read a blog post I’d like to summarized. Claude might call the terminal tool to change file permissions in my current directory. Gemini might call a tool to find the latest stock price of the most recent IPO I’ve been following.

Read more

What Level of AI?

Which level do I want to use AI?

I find myself asking this question more & more frequently & I think the answer means at work I’ll be using many AIs - not just one or two.

AI Level Use Case Description
Chat-Based AI Find the best Italian restaurant in the North Beach neighborhood of San Francisco.
In-App AI Find a document or generate an overview paragraph within Notion.
Browser-Based AI Deep research queries, such as estimating the market size of data center construction.
Computer-Based AI Transcribe a video call and upload the notes to an investment memo.
Multiple AI Agents Newer coding agents (e.g., Codex & Jules) work in parallel on the same codebase.

Why are there so many levels? It depends on the context I want the AI to have.

Read more