Deep Papers

Deep Papers is a podcast series featuring deep dives on today’s most important AI papers and research. Hosted by Arize AI founders and engineers, each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning.

All Episodes

Deep Papers

Phi-2 Model

February 02, 2024 • Arize AI

We dive into Phi-2 and some of the major differences and use cases for a small language model (SLM) versus an LLM.

With only 2.7 billion parameters, Phi-2 surpasses the performance of Mistral and Llama-2 models at 7B and 13B parameters on various aggregated benchmarks. Notably, it achieves better performance compared to 25x larger Llama-2-70B model on multi-step reasoning tasks, i.e., coding and math. Furthermore, Phi-2 matches or outperforms the recently-announced Google Gemini Nano 2, despite being smaller in size.

Find the transcript and live recording: https://arize.com/blog/phi-2-model

To learn more about ML observability, join the Arize AI Slack community or get the latest on our LinkedIn and Twitter.

People on this episode

Deep Papers

Deep Papers

Phi-2 Model

People on this episode

Aparna Dhinakaran

Brian Burns

Jason Lopatecki