Apache Parquet

Apache Parquet 2.3.1 updated

Apache Parquet is a "columnar" data storage format that was specifically created for the Apache Hadoop family of projects. Parquet is recommended to be used with large data, mainly because it uses a complex data compression system, relying on a series of...

Apache Accumulo

Apache Accumulo 1.7.0 updated

Apache Accumulo is a mashup of various technologies, from Google's BigTable, to Apache's Hadoop, Thrift and Zookeeper.Compared to Google's BigTable system, Accumulo features a few improvements of its own.These include table cell-based access restrictions,...

Apache Slider

Apache Slider 0.80.0 updated

Apache Slider targets Hadoop environments and is based on the database's next generation MapReduce 2.0 standard, also known as YARN. Slider can be used to create YARN-compliant applications that interact with the underlying Hadoop database or database...

Apache Ambari

Apache Ambari 2.1.2 updated

Apache Ambari allows database administrators to install, manage and monitor Hadoop clusters. Collecting metrics is done via Ganglia, while email alerts are sent out via Nagios. Amabari is basically a Web-based administration GUI for Hadoop, a tool that...

Apache Tajo

Apache Tajo 0.11.0 updated

A data warehouse system is a database used for reporting and data analysis. Apache Tajo is the implementation of this concept, but specifically tailored to work with Hadoop, a powerful NoSQL database developed by Yahoo and currently managed by the Apache...

Apache Gora

Apache Gora 0.6.1 updated

Apache Gora aims at providing a common ground on which developers can create tools that can handle large quantities of information, regardless of their underlying storage unit. Gora is written in Java and was created as an alternative to other Java ORMs...

Apache Empire-db

Apache Empire-db 2.4.4 updated

Apache Empire-db functions like a Java ORM but it provides extra support for basic data persistence methods. This implicitly means better targeted SQL statements that provide better and more efficient query results. Empire-db is easy to use, not requiring...