IngestThis!

Data Engineering, Science, and Architecture Content

Home for data professionals. Articles, tutorials, and resources for Data Engineers, Scientists, Analysts, and Architects.

Guest submissions are welcome. Pitch your idea by emailing alex@ingestthis.com or join the devNursery Slack community.

Read the Blog →
Must Reads

Data Lakehouses & Agentic Analytics

Authoritative guides to the modern data ecosystem — curated from Dremio's engineering blog.

Semantic Layer

The Semantic Layer: Definitive Guide

A comprehensive guide to the Semantic Layer — how it creates a single source of truth for metrics, powers headless BI, and makes AI agents answer business questions accurately.

Read on Dremio.com →
Apache Polaris

Apache Polaris: The Catalog Standard for Lakehouses and AI

How Apache Polaris is emerging as the universal Iceberg catalog standard, enabling multi-engine interoperability and governed AI access across the lakehouse ecosystem.

Read on Dremio.com →
Table Formats

What Are Table Formats and Why Were They Needed?

The origin story of open table formats — the problems with Hive, why Apache Iceberg, Delta Lake, and Hudi were created, and what they unlock for modern data platforms.

Read on Dremio.com →
Dremio

What Is Dremio?

A clear-eyed breakdown of what Dremio is, how its semantic layer, query federation, Reflections, and Apache Arrow Flight power the Intelligent Lakehouse Platform.

Read on Dremio.com →
Apache Iceberg

What Apache Iceberg Native Actually Means

Not all 'Iceberg support' is equal. This piece breaks down what it means to be genuinely Apache Iceberg native versus bolt-on, and why it matters for your lakehouse.

Read on Dremio.com →
Open Source

Open Source and the Data Lakehouse

How the Apache Software Foundation's open-source projects — Iceberg, Arrow, Parquet, Polaris — form the modular foundation of the modern open data lakehouse.

Read on Dremio.com →
Agentic AI

What Is Agentic Analytics?

Agentic AI is reshaping how organizations interact with data. This guide explains agentic analytics, the role of the semantic layer, and why query performance matters for AI agents.

Read on Dremio.com →
Data Lakehouse

Definitive Guide to the Data Lakehouse

The complete, authoritative guide to the Data Lakehouse architecture — what it is, why it supersedes the data warehouse + data lake combination, and how to build one.

Read on Dremio.com →
AI & Performance

How Dremio Keeps Agentic Analytics Fast Without Manual Tuning

How Dremio's layered autonomous performance architecture — Reflections, caching, vectorized execution — handles unpredictable AI agent query patterns at interactive speed.

Read on Dremio.com →

Connect with Alex