Sudeep Kumar, Principal Engineer at Salesforce considers the shift to Clickhouse as one of his biggest accomplishments
How the data platform evolved as Slack grew from a startup to an IPOed and then acquired company.
Steven Moy thoroughly explains Yelp’s data architecture under the hood and how it evolved over the past ten years.
This guide will provide you with the fundamental knowledge necessary to handle semi-structured data effectively.
AWS re:invent 2022 was all about building the anticipation and delivering on expectations of us technologists.
In our recent ‘Big Data Analytics for Gaming Workshop’ we let the audience do the talking, here’s a summary of the talk.
As people in the data industry go, Bill Inmon is among the top, often seen as the godfather of the data warehouse.
How does a tech stack that always needs to be at the forefront of technology look like?
In this post, we look at factors to consider when building a data warehouse.
Barr Moses explains how to make sure your data is accurate in a world where so many different teams are accessing it
This is a special episode of The Data Engineering Show revisiting the best bits from three different fascinating episode
When it comes to data management, have we come a long way since the early 2000s?
It’s the mother of all development projects. You use it daily. And so do 65M developers around the world.
Explore the significant differences between ELT and ETL data integration processes and find the best option for you.
Demand from engineering teams has skyrocketed since Firebolt emerged from stealth last year
Upstart cloud data warehouse sees rapid growth in 2021, plans to double its workforce
Why would you create ugly data? According to Jens Larsson, don’t even go near raw data.
Amplitude's cutting-edge data stack and how it processes 5 Trillion real-time events while dealing with mutable data
Appsflyer deals not only with 120 billion events per day, but does so while growing quickly as a company
Bolt engineers are in the midst of designing a new next-gen data platform
An episode about Eventbrite’s data stack modernization process, and how you get engineers to adopt new technologies
IQVIA deep dive into maximizing impact of BI solutions for faster and more informed decision-making in healthcare.
Klarna is one of the leading fintech companies in the world, valued at $45B.
According to Yoav Shmaria, VP R&D Platform at Similarweb, the best way to manage data warehouse costs is tagging
How does Substack's data platform support 500K paying subscribers?
How Vimeo handles Data Ops to deal with massive scale?
Ananth Packkildurai is Principal Software Engineer at Zendesk and runs one of the strongest newsletters in data
How ZipRecruiter and Yotpo build resilient self-service products that keep customers happy and engineers calm
Too often expensive resources and manhours are spent on dashboards no one uses, resulting in zero ROI.
Gong manages hundreds of thousands of videoconferences and millions of emails PER DAY, which add up to hundreds of TBs.
Scaling a data platform to support 1.5T events per day requires complicated technical migrations
Canva is one of the hottest, if not the hottest, graphic design platforms out there.
Joe Reis and Matt Housley joined the bros for some much-needed ranting, priceless data advice, and good laughs.
Data engineering should be less about the stack and more about best practices.
Meenal Iyer, VP Data at Momentive.ai, talks about enforcing collaboration in large organizations
80% of the code that you write doesn’t work on the first try. But knowing which 80% is not working is the real challenge
Principles essential for data quality, cost optimization, and data modeling, as adopted by the world's leading companies
Megan Lieu about her approach to data advocacy as well as the power of notebooks
Is Postgres truly the right engine for analytics?
Joe Hellerstein and Joseph Gonzalez inspired generations of database enthusiasts and are now on the show
How to ingest, store and query JSON data, for example, is a consistent question on the minds of customers.
Max walks the Bros through his recipe for a smart data-driven company, and the genesis of Airflow, Superset & Presto.
In The Data Engineering Show, Ryanne Dolan from LinkedIn joins the Bros to discuss LinkedIn's Hoptimator project.
Should data engineering AND BI be handled by the same people?
Every data team should have at least one data engineer with a software engineering background.
Andy Pavlo, Associate Professor at Carnegie Mellon University, delves into database internals and optimization.
Vin Vashishta, the guy we all love to follow, has never seen a dashboard with positive ROI.
How good you are at Spark or Flink ≠ how good you are at data engineering. Zach Wilson explains.