Apache Parquet

Software Screenshot:
Apache Parquet
Software Details:
Version: 2.3.1 updated
Upload Date: 9 Feb 16
Distribution Type: Freeware
Downloads: 39

Rating: nan/5 (Total Votes: 0)

Apache Parquet is a "columnar" data storage format that was specifically created for the Apache Hadoop family of projects.

Parquet is recommended to be used with large data, mainly because it uses a complex data compression system, relying on a series of optimized record shredding and re-assembly algorithms.

This allows data to be broken down, organized in a nested format, and reassembled whenever queried.

The Parquet format can also be used outside the Hadoop ecosystem, being specifically designed to be as agnostic as possible, working with any type of data processing framework and data storage model.

What is new in this release:

  • Rename packages and maven coordinates to org.apache
  • Add encoding stats to ColumnMetaData
  • Streaming thrift API
  • New logical types

What is new in version 2.3.0:

  • Rename packages and maven coordinates to org.apache
  • Add encoding stats to ColumnMetaData
  • Streaming thrift API
  • New logical types

Limitations:

  • The project is still under development in the Apache Incubator repository and might change drastically from version to version.

Similar Software

Mongous
Mongous

28 Feb 15

django-mssql
django-mssql

13 May 15

ArangoDB-Python
ArangoDB-Python

13 May 15

PyMySQL
PyMySQL

18 Jul 16

Other Software of Developer Apache Software Foundation

Apache ACE
Apache ACE

13 Apr 15

Apache Tajo
Apache Tajo

10 Feb 16

Apache MyFaces
Apache MyFaces

12 May 15

Apache Turbine
Apache Turbine

9 Feb 16

Comments to Apache Parquet

Comments not found
Add Comment
Turn on images!