Parquet - Gillespedia

Parquet pops up a lot in context with [[CSV]]. It's a *column-oriented* data storage format. Apparently associated with [[Hadoop]]. It has data compression built-in, and optimized reading columns. [[Python]] [[Pandas]] can work with Parquet easily. Given that it's compressed by default, it's not as dead-simple to open and look at as its [[plaintext]] alternative. **** # More ## Source - https://en.wikipedia.org/wiki/Apache_Parquet