Venture Capitalist at Theory

About / Categories / Subscribe / Twitter

11 minute read / Jan 22, 2025 /

Top Themes in Data Transcript

Slide 1

Clearing: While data world consolidates, capabilities have exploded with AI.

Content:

Slide 2

Clearing: My name is Tomasz Tunguz, founder and general partner at Theory.

Content:

Transition:

Slide 3

Clearing: Every transformation follows a pattern. Today, three powerful movements are reshaping how enterprises work with data.

Content:

Transition:

Slide 4

Clearing: Let’s talk about the great consolidation.

Content:

Transition:

Slide 5

Clearing: Buyers are overwhelmed. I’m hearing more and more of them say, “Don’t sell me another tool!”

Content:

Transition:

Slide 6

Clearing: That MacBook Pro should be called a mainframe pro. It’s just that powerful.

Content:

Transition:

Slide 7

Clearing: Decoupling storage and computers all about Unlocking flexibility.

Content:

Transition:

Slide 8

Clearing: AI is changing the way software and data engineering teams work together.

Content:

Transition:

Slide 9

Clearing: Historically, there’s been a divide between software engineering and AI/ML teams.

Content:

Transition:

Slide 10

Clearing: AI is a core part of many products, and in the future, every software company will be an AI company.

Content:

Transition:

Slide 11

Clearing: In the 24 months after chatGPT3 was released, a parameter race was unleashed where the sizes of models became ever larger, culminating most recently with Lama 3.3 at 450 billion parameters.

Content:

Transition:

Slide 12

Clearing: Databricks’ most recent state of data report published earlier this year. Small models are the most popular.

Content:

Transition:

Slide 13

Clearing: Plotting MMLU or high school equivalency over time, you can see that small, medium, and large models are converging around 70 to 80% accuracy.

Content:

Transition:

Slide 14

Clearing: In addition, smaller models offer significantly better latency.

Content:

Transition:

Slide 15

Clearing: Docspot tracks these prices and plots them on a logarithmic chart.

Content:

Transition:

Slide 16

Clearing: Data modeling isn’t just back - it’s become the foundation of reliable AI.

Content:

Transition:

Slide 17

Clearing: Here I created a little TypeScript application that processes the famous FAA data. I did this in 15 minutes.

Content:

Transition:

Slide 18

Clearing: Many other organizations, the leading organizations are starting to use AI in a pretty meaningful way.

Content:

Transition:

Slide 19

Clearing: Data governance isn’t about control anymore - it’s about enablement.

Content:

Transition:

Slide 20

Clearing: The business intelligence ecosystem has been a pendulum oscillating between centralized and decentralized control.

Content:

Transition:

Slide 21

Clearing: I believe data pipelines are the backbone of any modern AI system.

Content:

Transition:

Slide 22

Clearing: This slide really captures the essence of why intelligent data pipelines are so vital.

Content:

Slide 23

Clearing: Every transformation follows a pattern. Today, three powerful movements are reshaping how enterprises work with data.

Content:

Transition:


Read More:

What DeepSeek's Newest Model Means for AI