IngestThis Logo
BLOG
COMMUNITY
PODCAST

Tag: Compaction

2025-09-16 • Alex Merced

The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Learn how to automate compaction, snapshot expiration, and layout optimization in Apache Iceberg using metadata-driven t...

2025-09-09 • Alex Merced

Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Learn how to scale Apache Iceberg table optimizations across large datasets using parallelism, checkpointing, and fail r...

2025-09-02 • Alex Merced

Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Partition evolution in Apache Iceberg is a powerful feature, but if not managed carefully, it can introduce fragmentatio...

2025-08-26 • Alex Merced

Using Iceberg Metadata Tables to Determine When Compaction Is Needed

Discover how to use Apache Iceberg's metadata tables to proactively detect small files, bloated manifests, and table fra...

2025-08-19 • Alex Merced

Designing the Ideal Cadence for Compaction and Snapshot Expiration

Learn how to design an effective schedule for compaction and snapshot expiration in Apache Iceberg to balance cost, perf...

2025-07-29 • Alex Merced

Optimizing Compaction for Streaming Workloads in Apache Iceberg

Learn how to design fast, incremental compaction strategies in Apache Iceberg to support high-throughput streaming pipel...

Categories

data engineering
oltp
database
data
frontend
data lakehouse
Data Engineering
Data Lakehouse
Javascript
Data Architecture
Data Analytics
Devops
Data Modeling
DevOps
python
sql
rust
AI
Apache Iceberg
copyright 2022 by Alex Merced of alexmercedcoder.dev