IngestThis Logo
BLOG
COMMUNITY
PODCAST

Tag: data engineering

2025-12-29 • Alex Merced

2025 Year in Review Apache Iceberg, Polaris, Parquet, and Arrow

A look back at key developments in Apache Iceberg, Polaris, Parquet, and Arrow in 2025....

2025-01-06 • Alex Merced

RAG Isn’t a Modeling Problem. It’s a Data Engineering Problem.

Why retrieval-augmented generation systems fail in enterprises—and what to do about it....

2025-01-02 • Alex Merced

Building Pangolin - My Holiday Break, an AI IDE, and a Lakehouse Catalog for the Curious

A personal story of how I built Pangolin Catalog over a holiday break using an AI-powered IDE....

2024-11-15 • Alex Merced

Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables

Auto ingesting data from JSON, CSV, and Parquet files into Apache Iceberg Tables...

2024-11-05 • Alex Merced

Dremio, Apache Iceberg and their role in AI-Ready Data

The Role of Dremio and Apache Iceberg in AI-Ready Data...

2024-10-31 • Alex Merced

Hands-on with Apache Iceberg & Dremio on Your Laptop within 10 Minutes

How to get hands-on with Apache Iceberg...

2024-10-30 • Alex Merced

Data Modeling - Entities and Events

How to Model Events and Entities...

2024-10-21 • Alex Merced

All About Parquet Part 01 - An Introduction

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 02 - Parquet's Columnar Storage Model

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 03 - Parquet File Structure | Pages, Row Groups, and Columns

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 04 - Schema Evolution in Parquet

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 05 - Compression Techniques in Parquet

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 06 - Encoding in Parquet | Optimizing for Storage

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 07 - Metadata in Parquet | Improving Data Efficiency

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 08 - Reading and Writing Parquet Files in Python

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 09 - Parquet in Data Lake Architectures

All about the Apache Parquet File Format...

2024-10-21 • Alex Merced

All About Parquet Part 10 - Performance Tuning and Best Practices with Parquet

All about the Apache Parquet File Format...

2024-10-18 • Alex Merced

A Guide to dbt Macros - Purpose, Benefits, and Usage

Learning about dbt Macros...

2024-10-16 • Alex Merced

Data Lakehouse Roundup 1 - News and Insights on the Lakehouse

What's Going on in the Data Lakehouse Space...

2024-10-15 • Alex Merced

Getting Started with Data Analytics Using PyArrow in Python

Learning to work with PyArrow to run analytics...

Categories

data engineering
oltp
database
data
frontend
data lakehouse
Data Engineering
Data Lakehouse
Javascript
Data Architecture
Data Analytics
Devops
Data Modeling
DevOps
python
sql
rust
AI
Apache Iceberg
copyright 2022 by Alex Merced of alexmercedcoder.dev