Apache Derby

Apache Derby 10.12.1.1 updated

Apache Derby is one of the first database engines the Apache Software foundation hosted under its umbrella before big names like Yahoo's Hadoop, Facebook's Cassandra, or IBM's CouchDB were also donated to it. Coded from the beginning to...

Apache Ambari

Apache Ambari 2.1.2 updated

Apache Ambari allows database administrators to install, manage and monitor Hadoop clusters. Collecting metrics is done via Ganglia, while email alerts are sent out via Nagios. Amabari is basically a Web-based administration GUI for Hadoop, a tool that...

Apache HBase

Apache HBase 1.2.0 updated

Apache HBase was modeled after Google's internal BigTable database system and was built atop the famous Apache Hadoop data processing framework and distributed computing engine. Because it was built on Hadoop, some of its cool features are also inherited...

Apache OpenJPA

Apache OpenJPA 2.4.0 / 1.2.3 updated

Apache OpenJPA comes in two separate branches, each production level material.The differences between the two are the standards they implement. The 1.x branch follows the JSR-220 Enterprise Java Beans 3.0 specificationm while the 2.x branch was modeled...

Apache Slider

Apache Slider 0.80.0 updated

Apache Slider targets Hadoop environments and is based on the database's next generation MapReduce 2.0 standard, also known as YARN. Slider can be used to create YARN-compliant applications that interact with the underlying Hadoop database or database...

Apache Empire-db

Apache Empire-db 2.4.4 updated

Apache Empire-db functions like a Java ORM but it provides extra support for basic data persistence methods. This implicitly means better targeted SQL statements that provide better and more efficient query results. Empire-db is easy to use, not requiring...

Apache Parquet

Apache Parquet 2.3.1 updated

Apache Parquet is a "columnar" data storage format that was specifically created for the Apache Hadoop family of projects. Parquet is recommended to be used with large data, mainly because it uses a complex data compression system, relying on a series of...

Apache Crunch

Apache Crunch 0.13.0 updated

A pipeline is a concatenation of operations to perform a specific job, arranged so that the output of each element is the input of the next.Apache Crunch provides an easier method of dealing with Apache Hadoop MapReduce pipelines.Crunch simplifies this...

Apache Gora

Apache Gora 0.6.1 updated

Apache Gora aims at providing a common ground on which developers can create tools that can handle large quantities of information, regardless of their underlying storage unit. Gora is written in Java and was created as an alternative to other Java ORMs...