Deep Papers Artwork

Deep Papers

Deep Papers is a podcast series featuring deep dives on today’s most important AI papers and research. Hosted by Arize AI founders and engineers, each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning.

Show More

Podcasting since 2023 • 51 episodes

Deep Papers

Latest Episodes

The Illusion of Thinking: What the Apple AI Paper Says About LLM Reasoning

This week we discuss The Illusion of Thinking, a new paper from researchers at Apple that challenges today’s evaluation methods and introduces a new benchmark: synthetic puzzles with controllable complexity and clean logic. Their fi...

June 20, 2025 • 30:35

Deep Papers Artwork

Accurate KV Cache Quantization with Outlier Tokens Tracing

We discuss Accurate KV Cache Quantization with Outlier Tokens Tracing, a deep dive into improving the efficiency of LLM inference. The authors enhance KV Cache quantization, a technique for reducing memory and compute costs during inference, by...

June 04, 2025 • 25:11

Deep Papers Artwork

Scalable Chain of Thoughts via Elastic Reasoning

In this week's episode, we talk about Elastic Reasoning, a novel framework designed to enhance the efficiency and scalability of large reasoning models by explicitly separating the reasoning process into two distinct phases: thinking a...

May 16, 2025 • 28:54

Deep Papers Artwork

Sleep-time Compute: Beyond Inference Scaling at Test-time

What if your LLM could think ahead—preparing answers before questions are even asked?In this week's paper read, we dive into a

May 02, 2025 • 30:24

Deep Papers Artwork

LibreEval: The Largest Open Source Benchmark for RAG Hallucination Detection

For this week's paper read, we dive into our own research.We wanted to create a replicable, evolving dataset that can keep pace with model training so that you always know you're testing with data your model has never seen before. We al...

April 18, 2025 • 27:19

Deep Papers Artwork

See All Episodes

Contributors