This is a special episode of The Data Engineering Show revisiting the best bits from three different fascinating episode
In The Data Engineering Show, Ryanne Dolan from LinkedIn joins the Bros to discuss LinkedIn's Hoptimator project.
Andy Pavlo, Associate Professor at Carnegie Mellon University, delves into database internals and optimization.
Too often expensive resources and manhours are spent on dashboards no one uses, resulting in zero ROI.
Principles essential for data quality, cost optimization, and data modeling, as adopted by the world's leading companies
Data engineering should be less about the stack and more about best practices.
Joe Hellerstein and Joseph Gonzalez inspired generations of database enthusiasts and are now on the show
Megan Lieu about her approach to data advocacy as well as the power of notebooks
Every data team should have at least one data engineer with a software engineering background.
Vin Vashishta, the guy we all love to follow, has never seen a dashboard with positive ROI.
Joe Reis and Matt Housley joined the bros for some much-needed ranting, priceless data advice, and good laughs.
As people in the data industry go, Bill Inmon is among the top, often seen as the godfather of the data warehouse.
Meenal Iyer, VP Data at Momentive.ai, talks about enforcing collaboration in large organizations
When it comes to data management, have we come a long way since the early 2000s?
How good you are at Spark or Flink ≠ how good you are at data engineering. Zach Wilson explains.
How ZipRecruiter and Yotpo build resilient self-service products that keep customers happy and engineers calm
Barr Moses explains how to make sure your data is accurate in a world where so many different teams are accessing it
Amplitude's cutting-edge data stack and how it processes 5 Trillion real-time events while dealing with mutable data
80% of the code that you write doesn’t work on the first try. But knowing which 80% is not working is the real challenge
Sudeep Kumar, Principal Engineer at Salesforce considers the shift to Clickhouse as one of his biggest accomplishments
Max walks the Bros through his recipe for a smart data-driven company, and the genesis of Airflow, Superset & Presto.
According to Yoav Shmaria, VP R&D Platform at Similarweb, the best way to manage data warehouse costs is tagging
Klarna is one of the leading fintech companies in the world, valued at $45B.
An episode about Eventbrite’s data stack modernization process, and how you get engineers to adopt new technologies
How the data platform evolved as Slack grew from a startup to an IPOed and then acquired company.
Should data engineering AND BI be handled by the same people?
Why would you create ugly data? According to Jens Larsson, don’t even go near raw data.
Ananth Packkildurai is Principal Software Engineer at Zendesk and runs one of the strongest newsletters in data
Gong manages hundreds of thousands of videoconferences and millions of emails PER DAY, which add up to hundreds of TBs.
Bolt engineers are in the midst of designing a new next-gen data platform
Scaling a data platform to support 1.5T events per day requires complicated technical migrations
It’s the mother of all development projects. You use it daily. And so do 65M developers around the world.
How does a tech stack that always needs to be at the forefront of technology look like?
How Vimeo handles Data Ops to deal with massive scale?
How does Substack's data platform support 500K paying subscribers?
Steven Moy thoroughly explains Yelp’s data architecture under the hood and how it evolved over the past ten years.
Canva is one of the hottest, if not the hottest, graphic design platforms out there.
Appsflyer deals not only with 120 billion events per day, but does so while growing quickly as a company