“This is the way”

“This is the way”

分类： LakeHouse

Apache Paimon

LakeHouse

Apache Paimon PK 表的 data distribution

本人初次接触 Paimon，以下内容仅是自己的…

Smith
2026年3月29日

Apache Parquet

LakeHouse

Arrow-rs Parquet Reader 浅析

在千篇一律 arrow-cpp 系的 Parquet Reade…

Smith
2026年1月11日
2 条评论

Apache-Iceberg

LakeHouse

Apache Iceberg Delete File 解析

Iceberg 默认使用 Copy On Write 技术，也…

Smith
2025年6月22日
1 条评论

Apache-Iceberg

LakeHouse

Apache Iceberg 概念梳理

在学习 Iceberg 源码前，我们需要搞清楚 I…

Smith
2025年5月1日
2 条评论

Apache Parquet

LakeHouse

Apache Parquet Bloom Filter

Bloom Filter 只能处理 =，IN 谓词。什么…

Smith
2024年11月23日
1 条评论

Apache Parquet

LakeHouse

Apache Parquet ZoneMap 过滤支持小记

前置背景 ZoneMap Min-max 过滤也叫 ZoneM…

Smith
2024年11月23日

Apache Polaris

LakeHouse

Apache Polaris 从入门到精通

Iceberg Rest Catalog 在介绍 Polaris 之…

Smith
2024年10月29日

orc-vs-parquet

LakeHouse

ORC vs Parquet，孰强孰弱？

2024 年的今天，从事实上看，Parquet 貌似…

Smith
2024年8月10日
4 条评论

LakeHouse

Apache ORC 加密解析

Apache ORC 支持对列进行加密，且会对该列…

Smith
2024年7月7日

rle-images

LakeHouse

RLE 编码在 Apache ORC 中的实现

最近刚学习了 Zigzag（浅谈 Apache ORC 之…

Smith
2024年6月8日
2 条评论