Apache Gora

Apache Gora 0.6.1 updated

Apache Gora aims at providing a common ground on which developers can create tools that can handle large quantities of information, regardless of their underlying storage unit. Gora is written in Java and was created as an alternative to other Java ORMs...

Apache HBase

Apache HBase 1.2.0 updated

Apache HBase was modeled after Google's internal BigTable database system and was built atop the famous Apache Hadoop data processing framework and distributed computing engine. Because it was built on Hadoop, some of its cool features are also inherited...

Apache JDO is a Java API that provides easy & well documented methods for storing, accessing and manipulating persistent data.This kind of data is stored using classic POJOs (simple Java objects) and will allow developers to store some of their...

Apache MRUnit lets developers write unit tests that can be used in detecting problems with MapReduce jobs before their run on the database itself.By unit testing Hadoop's MapReduce jobs, developer can avoid useless resource consumption, a good habit to...

Apache OpenJPA

Apache OpenJPA 2.4.0 / 1.2.3 updated

Apache OpenJPA comes in two separate branches, each production level material.The differences between the two are the standards they implement. The 1.x branch follows the JSR-220 Enterprise Java Beans 3.0 specificationm while the 2.x branch was modeled...

Apache Parquet

Apache Parquet 2.3.1 updated

Apache Parquet is a "columnar" data storage format that was specifically created for the Apache Hadoop family of projects. Parquet is recommended to be used with large data, mainly because it uses a complex data compression system, relying on a series of...

Apache Slider

Apache Slider 0.80.0 updated

Apache Slider targets Hadoop environments and is based on the database's next generation MapReduce 2.0 standard, also known as YARN. Slider can be used to create YARN-compliant applications that interact with the underlying Hadoop database or database...

Apache Sqoop

Apache Sqoop 1.4.6 / 1.99.6 updated

Apache Sqoop is a must-have tool for every database administrator, letting them easily move data between the Hadoop NoSQL database to more classic database systems like PostgreSQL, MSSQL, MariaDB, or MySQL - a.k.a. relational databases.Sqoop basically...

Apache Tajo

Apache Tajo 0.11.0 updated

A data warehouse system is a database used for reporting and data analysis. Apache Tajo is the implementation of this concept, but specifically tailored to work with Hadoop, a powerful NoSQL database developed by Yahoo and currently managed by the Apache...