Jacob Leverich on Efficiency, Elegance, and the Joy of Not Grepping log files at 2AM
This week, Frank sat down with Dr. Jacob Leverich—Stanford PhD, cofounder of Observe, and a veteran of the Google MapReduce team and Splunk. Jacob’s journey, from tinkering with video game code as a kid, to innovating at the cutting edge of distributed systems and energy efficiency, is as inspiring as it is informative.Key TakeawaysEarly Tech Roots: Hear how curiosity with QBasic and classic PCs (think IBM PCXT and Commodore) put Jacob on a path to high-impact data engineering.MapReduce, Dremel, & the Rise of Big Data: Jacob pulls back the curtain on working with some of the most influential data processing tools at Google and how these systems shifted the entire data landscape (hello, BigQuery!).Building Efficient Systems: It’s not just about scale—energy efficiency and performance optimization are the unsung heroes of today’s data infrastructure. Jacob explains why making things “just work” isn’t enough anymore.The Realities of Ops & Observability: Remember the days of grepping logs at 2AM? There’s a better way. Jacob shares how platforms like Observe help teams consolidate, visualize, and act on operational data—turning chaos into actionable insight.Bridging Data & Ops: The lines between data observability and traditional ops are blurring, and Jacob’s unique experience shows how best practices from data warehousing are finally making ops smoother (and less sleepless).Power Concerns & the Future: As data grows, so does energy consumption in data centers. Find out why optimization isn’t just good for performance—it’s key to sustainability.Timestamps00:00 Interview with Jacob Levrich05:59 Journey into Game Programming06:43 "Pursuing Fast Video Game Code"10:23 Data Processing and Power Efficiency16:11 Snowflake's Transformative Database Approach19:18 Journey to Data Management Industry21:37 Data Products: Solving Core Challenges27:07 Early Web Log Analysis Techniques28:57 Consolidating Data for Efficiency33:23 Specialized Tools and Context Switching35:43 Unique Dual-Expertise in Tech38:58 User-Centric Business Strategies42:13 IP Data Analysis in Cloud47:23 Electricity Transport Upsets Local Farms48:25 Shift to Parallel Computing52:10 Hardware Specialization & Software Optimization57:32 "Stay Data Driven"
--------
58:10
István Mészáros on going From CERN to Startup & The Cat That Launched a Thousand Queries
Welcome to another insightful episode of Data Driven! Today, we're diving into the world of warehouse-native analytics with our special guest, István Mészáros, cofounder of Mitsu. Join us as we explore how Mitsu empowers startups and enterprises with a new approach to data analytics. From his beginnings as a CERN physicist to becoming an open-source evangelist and finally a startup founder, István shares his unique journey through the data industry.We'll discuss the motivation behind Mitsu's distinct branding, reminiscent of Hello Kitty, and why standing out in today's crowded market is crucial. István also reveals the challenges and strategies of building a data company in Europe, and how Mitsu simplifies analytics by offering a self-service solution without the high costs associated with existing market leaders.Timestamps0:00 Introducing István Mészáros05:30 Shifting Open Source to SaaS07:46 Lava-Themed Compliance Solutions Brand10:27 Tech Branding and Hello Kitty Insights13:46 Optimizing Conversion in Data-Heavy Travel16:31 Self-Service Analytics Tool Needed19:17 Automated Product Analytics Tool23:20 "Budget Constraints and DIY Solutions"28:17 Freelancer's Efficient Data Solutions29:08 Open Source Tool Productization Plan33:13 Navigating Freelance and Startup Challenges37:19 Transitioning to Data Engineering42:25 Instant Feedback in Hobbies43:46 Embracing Feedback in Business Transformation49:13 "Hoping AI Takes Over Hiring"51:58 Visit Site for Info & Contact55:22 "Parenting Boys with Earbuds"57:25 "Data Driven: Quantum Podcast Relaunch"
--------
58:18
Barr Moses on How Data Observability Can Save Your Company Millions
On this episode of Data Driven, we welcome Barr Moses, CEO and co-founder of Monte Carlo, as she delves into the fascinating world of data observability. Join hosts Frank La Vigne and Andy Leonard as they explore how reliable data is crucial for making sound business decisions in today's tech-driven world. Learn why a simple schema change at Unity resulted in a $100 million loss and how Monte Carlo is developing cutting-edge solutions to prevent similar disasters. From discussions on ensuring data integrity to the intriguing potential of AI in anomaly detection, Barr Moses shares insights that might just redefine your understanding of data's role in business. Tune in for a podcast that not only uncovers the nuances of data reliability but also touches on the quirky side of tech, like why, according to Google, you should never use superglue to fix slipping cheese on your pizza.Moments00:00 Monte Carlo: Data Reliability Innovator05:45 "Data & AI Observability Engineering"09:42 Data Industry's Growing Importance12:00 Cereal Supply Chain Data Optimization16:03 Data Observability and Lineage19:29 GenAI Uncertainties and Latency Concerns23:17 "Human Oversight in AI Accuracy"24:12 Data Observability and Human Role28:01 Adapting to Customer Language33:29 Data and Security Management Alignment35:20 Data Reliability and Observability Challenges38:17 Automated Code Analysis Tool Launch42:29 Data-Inspired Childhood44:12 Passionate About Impactful Work48:52 LinkedIn Security Concerns Highlighted53:19 "Data Observability Insights"
--------
54:15
Sanjay Annadate on Data Driven Digital Transformation
In this episode, Sanjay joins Frank for a deep dive into the heart of digital transformation and AI-powered automation. Here are some of the key takeaways:Digital Transformation Evolution: Sanjay reflects on his nearly three-decade journey witnessing the digital shift from infancy to the AI-driven present. He outlines the critical components of digital transformation, including cloud adoption and data prioritization, noting significant changes in business focus over recent years.Microsoft's Role: Sanjay provides insights into Microsoft's strategic investments in digital transformation technologies, emphasizing their pivotal role in influencing market trends and industry-specific capabilities.AI-Powered Enhancements: From the widespread adoption of Copilot to the burgeoning concept of agentic AI, Sanjay discusses how AI tools are not replacing but augmenting the productivity of data engineers, offering a glimpse into the future of business processes.Edge of Innovation: We explore how Microsoft Fabric and other technologies are simplifying complex architectures, allowing businesses to leverage multi-cloud strategies effectively, keeping them at the forefront of innovation.Real-Life Impact: Sanjay shares compelling examples, like reducing sales briefing preparation time from four days to two minutes, showcasing the transformative power of AI in real business scenarios.Whether you're a data engineer, business leader, or just someone fascinated by the data-driven world, this episode is packed with valuable insights.Moments00:00 Three Decades of Digital Transformation05:27 Microsoft's Digital Transformation Dominance09:37 Microsoft's Cloud Integration Advantage13:22 Red Hat AI's Open Source Approach15:33 Microsoft Fabric's Multi-Cloud Integration Strategy20:01 "Custom Solutions for Complex Queries"21:39 Content Creation Efficiency Unlocked26:38 Sales Role Dependency Reduction Tool30:06 Agentic AI and Workflow Transformation33:29 "Beyond Basic Automation"35:05 AI's Impact on Business Expansion39:58 Data-Driven Problem Solving Impact41:58 Reading Trends in Data Innovation
--------
45:07
Trevor Schulze on How CIO’s Can Drive AI Strategy
In this episode, Andy Leonard and Frank La Vigne are thrilled to be joined by Trevor Schulze, the Chief Information Officer at Alteryx. Trevor brings an unparalleled perspective on digital transformation, drawing from his impressive tenure at industry giants such as Micron, Cisco, and RingCentral.Time stamps00:00 "Data Driven: AI & CIO Insights"04:32 CIO's Role in AI Evolution06:50 CIO's Evolving Role with AI11:43 "Embracing Data Democratization"16:24 Democratizing Data Access19:33 "AI Investment and Optimization Cycle"20:55 AI Enhances Tool Configuration Guidance24:42 Breaking Free from Vendor Lock-In27:41 "Unleashing Shadow AI and Technical Debt"31:53 Digital Performance Essential for All Industries34:01 Data Privacy Concerns in AI Use37:30 AI Democratization Challenges for Enterprises42:15 AI Transforming Business Processes43:55 Data-Driven Career Journey47:13 "Building Trust in Data Analytics"52:34 Building Trust in Future Tech
Data Driven: the podcast where we explore the emerging field of Data Science. We bring the best minds in Data, Software Engineering, Machine Learning, and Artificial Intelligence right to you every Tuesday.
The field of data science mashes up the worlds of statistics, database architecture and software engineering. Data Scientist has been labelled by the Harvard Business Review, as "the sexiest job of the 21st century." A quick search of job search sites reveal that this field is in high demand.
In a world where Data is the new Oil, Data Science the new Refineries, consider this Car Talk for the Data Age. Every week we bring the best minds in this emerging field straight to you. Our goal is to educate and inspire our listeners so that they can be prepared to thrive in a Data Driven world.