Apache MRUnit lets developers write unit tests that can be used in detecting problems with MapReduce jobs before their run on the database itself.By unit testing Hadoop's MapReduce jobs, developer can avoid useless resource consumption, a good habit to...

Apache Sqoop

Apache Sqoop 1.4.6 / 1.99.6 updated

Apache Sqoop is a must-have tool for every database administrator, letting them easily move data between the Hadoop NoSQL database to more classic database systems like PostgreSQL, MSSQL, MariaDB, or MySQL - a.k.a. relational databases.Sqoop basically...

Apache CouchDB was initially developed at IBM and donated later on to the Apache Software Foundation.Compared to other databases around, CouchDB is still very young, but this has not stopped it from gathering quite a following in its short lifespan.The...

Apache Accumulo

Apache Accumulo 1.7.0 updated

Apache Accumulo is a mashup of various technologies, from Google's BigTable, to Apache's Hadoop, Thrift and Zookeeper.Compared to Google's BigTable system, Accumulo features a few improvements of its own.These include table cell-based access restrictions,...

Apache JDO is a Java API that provides easy & well documented methods for storing, accessing and manipulating persistent data.This kind of data is stored using classic POJOs (simple Java objects) and will allow developers to store some of their...

Apache Tajo

Apache Tajo 0.11.0 updated

A data warehouse system is a database used for reporting and data analysis. Apache Tajo is the implementation of this concept, but specifically tailored to work with Hadoop, a powerful NoSQL database developed by Yahoo and currently managed by the Apache...

Apache Cassandra

Apache Cassandra 3.5.0 / 2.2.6 updated

Apache Cassandra was initially built by Facebook for their own servers and services, using principles developed by Google and Amazon for BigTable and DynamoDB. Currently the project was donated to the Apache Foundation who continues to developed it...